How ScrapeGraphAI handles JS pages
ScrapeGraphAI uses a headless browser internally to render JavaScript before extracting content. For most sites this happens automatically.Use wait_ms for delayed content
If the content loads after a short delay (lazy loading, carousels, infinite scroll), add a wait time before extraction starts:
Tips for specific scenarios
Infinite scroll / paginated lists
Infinite scroll pages only show a subset of items on initial load. Use the pagination parameter to iterate through pages, or use SmartCrawler to follow paginated links automatically.Login-gated content
If the data requires authentication:- Pass the required cookies or session tokens via the
headersparameter. - Alternatively, export a logged-in session cookie from your browser and include it in the
Cookieheader.
Single Page Applications (SPAs)
SPAs render content client-side after the initial load. Increasingwait_ms usually resolves extraction issues. If not, check whether the data is available through the site’s own API (network tab in DevTools) — that may be easier to call directly.