> ## Documentation Index > Fetch the complete documentation index at: https://docs.scrapegraphai.com/llms.txt > Use this file to discover all available pages before exploring further. # Extract > Natural-language structured extraction from a URL, HTML, or markdown. ```http theme={null} POST https://v2-api.scrapegraphai.com/api/extract ``` Replaces the v1 `smartscraper` endpoint. Provide a prompt (and optionally a JSON schema) and get typed JSON back — no selectors or post-processing needed. ## Request body Exactly one of `url`, `html`, or `markdown` must be supplied as the source. URL of the page to extract from. Raw HTML content to extract from (max 2 MB). Markdown content to extract from (max 2 MB). Natural-language description of what to extract. JSON schema describing the desired output shape. When provided, the LLM is constrained to match it. HTML pre-processing mode: `"normal"`, `"reader"`, or `"prune"`. Fetch-time options. See the [Scrape endpoint](/api-reference/endpoint/scrape#request-body) for the full field list (`mode`, `stealth`, `headers`, `cookies`, `scrolls`, `wait`, `timeout`, `country`). Ignored when `html` or `markdown` is supplied. ## Example request ```bash theme={null} curl -X POST https://v2-api.scrapegraphai.com/api/extract \ -H "SGAI-APIKEY: $SGAI_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://example.com", "prompt": "What is the title of this page?" }' ``` ## Example response ```json theme={null} { "id": "8c34fc03-17be-4fcc-a7ce-6ebcab23ad43", "raw": null, "json": { "title": "Example Domain" }, "usage": { "promptTokens": 361, "completionTokens": 92 }, "metadata": { "chunker": { "chunks": [{ "size": 33 }] }, "fetch": {} } } ``` | Field | Description | | ----------------------------------------------- | ------------------------------------------------------------------------------------- | | `id` | UUID for this extract call. | | `json` | Structured output matching the schema (or free-form JSON when no schema is supplied). | | `raw` | Raw model output before JSON parsing, when available. | | `usage.promptTokens` / `usage.completionTokens` | LLM token accounting. | | `metadata.chunker` | How the source content was split before extraction. | | `metadata.fetch` | Fetch diagnostics (populated when the page was fetched by the API). | ## With a schema ```bash theme={null} curl -X POST https://v2-api.scrapegraphai.com/api/extract \ -H "SGAI-APIKEY: $SGAI_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://example.com", "prompt": "Extract the page title and description", "schema": { "type": "object", "properties": { "title": { "type": "string" }, "description": { "type": "string" } }, "required": ["title"] } }' ``` ## Extract from HTML or markdown ```bash theme={null} curl -X POST https://v2-api.scrapegraphai.com/api/extract \ -H "SGAI-APIKEY: $SGAI_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "html": "

Widget

$9.99

", "prompt": "Extract product name and price" }' ``` ## Related * Service overview: [Extract](/services/extract) * SDK wrappers: [Python](/sdks/python) · [JavaScript](/sdks/javascript)