Many websites spread their content across multiple pages — product listings, search results, articles. In v2 there are two approaches: letDocumentation Index
Fetch the complete documentation index at: https://docs.scrapegraphai.com/llms.txt
Use this file to discover all available pages before exploring further.
crawl.start follow links for you, or iterate page URLs manually with extract.
Using Crawl for multi-page extraction
crawl.start is the recommended service when you want to follow links automatically. It runs asynchronously — start a job and poll for the result.
Iterating page URLs with Extract
If you know the URL pattern for each page, callextract on each URL and aggregate results:
Tips
- Prefer
crawl.startwhen the number or pattern of pages is unknown — it handles link discovery for you. - Use manual iteration when URLs follow a predictable pattern (
?page=N) and you want tight control. - Add delays between pages in manual mode to avoid triggering rate limits on the target website.
- Stop early when the extracted list is empty or a “no more results” marker appears.
- For infinite-scroll pages, use
FetchConfig(scrolls=N)instead of pagination.