Documentation Index
Fetch the complete documentation index at: https://docs.scrapegraphai.com/llms.txt
Use this file to discover all available pages before exploring further.
POST /api/crawl.
Path parameters
The crawl job UUID returned by
POST /api/crawl.Example request
Example response
| Field | Description |
|---|---|
status | "running", "completed", "failed", or "stopped". |
total / finished | Progress counters. |
pages[] | Per-page results, ordered by crawl time. |
pages[].scrapeRefId | UUID of the underlying Scrape call — pass to GET /api/history/:id to fetch the formatted content (markdown, HTML, JSON, screenshot, etc.). |
Poll at a reasonable cadence (every 1–5 seconds) until
status is "completed", "failed", or "stopped". Or use Monitor with a webhook to avoid polling entirely.Fetching page content
The crawl response intentionally returns lightweight metadata (url, depth, scrapeRefId, etc.) rather than embedding every page’s full body. Use GET /api/history/:id with each scrapeRefId to fetch the formatted content the underlying scrape produced:
HistoryEntry with the full result payload, e.g. result.results.markdown.data[0] for markdown. See the History endpoint reference for the entry shape and a complete crawl-to-content example.
Related
- Start a job:
POST /api/crawl - Stop / resume / delete: Manage crawl jobs
- Fetch each page’s content: History