Mocking & Testing - ScrapeGraphAI

Mock Mode

Test your code without making real API calls

Custom Responses

Override responses for specific endpoints

Overview

A mock environment is an isolated test environment. You can use mock mode to test ScrapeGraphAI functionality in your application, and experiment with new features without affecting your live integration or consuming API credits. For example, when testing in mock mode, the scraping requests you create aren’t processed by our servers or counted against your credit usage.

Use cases

Mock mode provides an environment for testing various functionalities and scenarios without the implications of real API calls. Below are some common use cases for mocking in your ScrapeGraphAI integrations:

Scenario	Description
Simulate scraping responses to test without real API calls	Use mock mode to test scraping functionality without real API calls. Create mock responses in your application to test data processing logic or use custom handlers to simulate various response scenarios.
Scale isolated testing for teams	Your team can test in separate mock environments to make sure that data and actions are completely isolated from other tests. Changes made in one mock configuration don’t interfere with changes in another.
Test without API key requirements	You can test your integration without providing real API keys, making it easier for external developers, implementation partners, or design agencies to work with your code without access to your live API credentials.
Test in development or CI/CD pipelines	Access mock mode from your development environment or continuous integration pipelines. Test ScrapeGraphAI functionality directly in your code or use familiar testing frameworks and fixtures.

Test in mock mode

You can simulate scraping responses and use mock data to test your integration without consuming API credits. Learn more about using mock responses to confirm that your integration works correctly.

Basic Mock Usage

Enable mock mode by setting mock=True when initializing the client:

from scrapegraph_py import Client
from scrapegraph_py.logger import sgai_logger

# Set logging level for better visibility
sgai_logger.set_logging(level="INFO")

def basic_mock_usage():
    # Initialize the client with mock mode enabled
    client = Client.from_env(mock=True)

    print("\n-- get_credits (mock) --")
    print(client.get_credits())

    print("\n-- markdownify (mock) --")
    md = client.markdownify(website_url="https://example.com")
    print(md)

    print("\n-- get_markdownify (mock) --")
    md_status = client.get_markdownify("00000000-0000-0000-0000-000000000123")
    print(md_status)

    print("\n-- smartscraper (mock) --")
    ss = client.smartscraper(user_prompt="Extract title", website_url="https://example.com")
    print(ss)

if __name__ == "__main__":
    basic_mock_usage()

When mock mode is enabled, all API calls return predefined mock responses instead of making real HTTP requests. This ensures your tests run quickly and don’t consume API credits.

Custom Response Overrides

You can override specific endpoint responses using the mock_responses parameter:

def mock_with_path_overrides():
    # Initialize the client with mock mode and custom responses
    client = Client.from_env(
        mock=True,
        mock_responses={
            "/v1/credits": {"remaining_credits": 42, "total_credits_used": 58, "mock": true}
        },
    )

    print("\n-- get_credits with override (mock) --")
    print(client.get_credits())

Available Endpoint Overrides

You can override responses for any endpoint by providing the path and expected response:

client = Client.from_env(
    mock=True,
    mock_responses={
        "/v1/credits": {
            "remaining_credits": 100,
            "total_credits_used": 0,
            "mock": true
        },
        "/v1/smartscraper/start": {
            "job_id": "mock-job-123",
            "status": "processing",
            "mock": true
        },
        "/v1/smartscraper/status/mock-job-123": {
            "job_id": "mock-job-123",
            "status": "completed",
            "result": {
                "title": "Mock Title",
                "content": "Mock content from the webpage",
                "mock": true
            }
        },
        "/v1/markdownify/start": {
            "job_id": "mock-markdown-456",
            "status": "processing",
            "mock": true
        },
        "/v1/markdownify/status/mock-markdown-456": {
            "job_id": "mock-markdown-456",
            "status": "completed",
            "result": "# Mock Markdown\n\nThis is mock markdown content.",
            "mock": true
        }
    }
)

Custom Handler Functions

For more complex mocking scenarios, you can provide a custom handler function:

def mock_with_custom_handler():
    def handler(method, url, kwargs):
        return {"handled_by": "custom_handler", "method": method, "url": url}

    # Initialize the client with mock mode and custom handler
    client = Client.from_env(mock=True, mock_handler=handler)

    print("\n-- searchscraper via custom handler (mock) --")
    resp = client.searchscraper(user_prompt="Search something")
    print(resp)

Advanced Custom Handler

Create sophisticated mock responses based on request parameters:

def advanced_custom_handler():
    def smart_handler(method, url, kwargs):
        # Handle different endpoints with custom logic
        if "/v1/credits" in url:
            return {
                "remaining_credits": 50,
                "total_credits_used": 50,
                "mock": true
            }
        elif "/v1/smartscraper" in url:
            # Extract user_prompt from kwargs to create contextual responses
            user_prompt = kwargs.get("user_prompt", "")
            if "title" in user_prompt.lower():
                return {
                    "job_id": "mock-title-job",
                    "status": "completed",
                    "result": {
                        "title": "Extracted Title",
                        "content": "This is the extracted content",
                        "mock": true
                    }
                }
            else:
                return {
                    "job_id": "mock-generic-job",
                    "status": "completed",
                    "result": {
                        "data": "Generic extracted data",
                        "mock": true
                    }
                }
        else:
            return {"error": "Unknown endpoint", "url": url}

    client = Client.from_env(mock=True, mock_handler=smart_handler)
    
    # Test different scenarios
    print("Credits:", client.get_credits())
    print("Title extraction:", client.smartscraper(
        website_url="https://example.com",
        user_prompt="Extract the title"
    ))
    print("Generic extraction:", client.smartscraper(
        website_url="https://example.com",
        user_prompt="Extract some data"
    ))

Testing Best Practices

Unit Testing with Mocks

import unittest
from unittest.mock import patch
from scrapegraph_py import Client

class TestScrapeGraphAI(unittest.TestCase):
    def setUp(self):
        self.client = Client.from_env(mock=True)
    
    def test_get_credits(self):
        credits = self.client.get_credits()
        self.assertIn("remaining_credits", credits)
        self.assertIn("total_credits_used", credits)
    
    def test_smartscraper_with_schema(self):
        from pydantic import BaseModel, Field
        
        class TestSchema(BaseModel):
            title: str = Field(description="Page title")
            content: str = Field(description="Page content")
        
        response = self.client.smartscraper(
            website_url="https://example.com",
            user_prompt="Extract title and content",
            output_schema=TestSchema
        )
        
        self.assertIsInstance(response, TestSchema)
        self.assertIsNotNone(response.title)
        self.assertIsNotNone(response.content)

if __name__ == "__main__":
    unittest.main()

Integration Testing

def test_integration_flow():
    """Test a complete workflow using mocks"""
    client = Client.from_env(
        mock=True,
        mock_responses={
            "/v1/credits": {"remaining_credits": 10, "total_credits_used": 90, "mock": true},
            "/v1/smartscraper/start": {
                "job_id": "test-job-123",
                "status": "processing",
                "mock": true
            },
            "/v1/smartscraper/status/test-job-123": {
                "job_id": "test-job-123",
                "status": "completed",
                "result": {
                    "title": "Test Page",
                    "content": "Test content",
                    "mock": true
                }
            }
        }
    )
    
    # Test the complete flow
    credits = client.get_credits()
    assert credits["remaining_credits"] == 10
    
    # Start a scraping job
    job = client.smartscraper(
        website_url="https://example.com",
        user_prompt="Extract title and content"
    )
    
    # Check job status
    status = client.get_smartscraper("test-job-123")
    assert status["status"] == "completed"
    assert "title" in status["result"]

Environment Variables

You can also control mocking through environment variables:

# Enable mock mode via environment variable
export SGAI_MOCK=true

# Set custom mock responses (JSON format)
export SGAI_MOCK_RESPONSES='{"\/v1\/credits": {"remaining_credits": 100, "mock": true}}'

# The client will automatically detect mock mode from environment
client = Client.from_env()  # Will use mock mode if SGAI_MOCK=true

Async Mocking

Mocking works seamlessly with async clients:

import asyncio
from scrapegraph_py import AsyncClient

async def async_mock_example():
    async with AsyncClient(mock=True) as client:
        # All async methods work with mocks
        credits = await client.get_credits()
        print(f"Mock credits: {credits}")
        
        response = await client.smartscraper(
            website_url="https://example.com",
            user_prompt="Extract data"
        )
        print(f"Mock response: {response}")

# Run the async example
asyncio.run(async_mock_example())

HTTP Method Mocking with cURL

You can also test ScrapeGraphAI endpoints directly using cURL with mock responses. This is useful for testing API integrations without using SDKs.

Basic cURL Mock Usage

# Enable mock mode via environment variable
export SGAI_MOCK=true

# Test credits endpoint with mock
curl -X GET "https://api.scrapegraph.ai/v1/credits" \
  -H "Authorization: Bearer $SGAI_API_KEY" \
  -H "Content-Type: application/json"

Custom Mock Responses with cURL

# Set custom mock responses via environment variable
export SGAI_MOCK_RESPONSES='{
  "/v1/credits": {
    "remaining_credits": 100,
    "total_credits_used": 0,
    "mock": true
  },
}'

# Test smartscraper endpoint
curl -X POST "https://api.scrapegraph.ai/v1/smartscraper/" \
  -H "Authorization: Bearer $SGAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "website_url": "https://example.com",
    "user_prompt": "Extract title and content"
    "mock": true
  }'

Testing Different HTTP Methods

# POST request - to smartCrawler
curl --location 'https://api.scrapegraphai.com/v1/smartscraper' \
--data '{
    "website_url": "https://www.scrapegraphai.com//",
    "user_prompt": "Extract founder info ",
    "mock":true
}'

# POST request - to Markdownify
curl --location 'https://api.scrapegraphai.com/v1/markdownify' \
--data '{
    "website_url": "https://www.scrapegraphai.com//",
    "mock":true
}'

# POST request - to SearchScraper
curl --location 'https://api.scrapegraphai.com/v1/searchscraper' \
--data '{
    "website_url": "https://www.scrapegraphai.com//",
    "mock":true
    "output_schema":{},
    "num_results":3, 
}'

JavaScript SDK Mocking

The JavaScript SDK also supports comprehensive mocking capabilities:

Basic Mock Usage

import { 
  scrape, 
  smartScraper,
  getCredits,
  enableMock 
} from 'scrapegraph-js';

// Enable mock mode globally
enableMock();

async function basicMockUsage() {
  console.log('\n=== Basic Mock Usage ===');
  
  try {
    // Test scrape endpoint
    const scrapeResult = await scrape(API_KEY, 'https://example.com', { renderHeavyJs: true });
    console.log('Scrape result:', scrapeResult);
    
    // Test smartScraper endpoint
    const smartResult = await smartScraper(API_KEY, 'https://example.com', 'Extract the title');
    console.log('SmartScraper result:', smartResult);
    
    // Test getCredits endpoint
    const credits = await getCredits(API_KEY);
    console.log('Credits:', credits);
    
  } catch (error) {
    console.error('Error in basic mock usage:', error.message);
  }
}

Custom Mock Responses

import { setMockResponses } from 'scrapegraph-js';

// Set custom responses for specific endpoints
setMockResponses({
  '/v1/credits': {
    remaining_credits: 42,
    total_credits_used: 58,
    custom_field: 'This is a custom response',
    mock: true
  },
  '/v1/smartscraper': () => ({
    request_id: 'custom-mock-request-id',
    custom_data: 'Generated by custom function',
    mock: true
  })
});

async function mockWithCustomResponses() {
  try {
    // Test credits with custom response
    const credits = await getCredits(API_KEY);
    console.log('Custom credits:', credits);
    
    // Test smartScraper with custom response
    const smartResult = await smartScraper(API_KEY, 'https://example.com', 'Extract data');
    console.log('Custom smartScraper result:', smartResult);
    
  } catch (error) {
    console.error('Error in custom responses:', error.message);
  }
}

Custom Mock Handler

import { setMockHandler } from 'scrapegraph-js';

// Set a custom handler that overrides all responses
setMockHandler((method, url) => {
  return {
    custom_handler: true,
    method: method,
    url: url,
    timestamp: new Date().toISOString(),
    message: 'This response was generated by a custom handler',
    mock: true
  };
});

async function mockWithCustomHandler() {
  try {
    const scrapeResult = await scrape(API_KEY, 'https://example.com');
    console.log('Scrape with custom handler:', scrapeResult);
    
    const smartResult = await smartScraper(API_KEY, 'https://example.com', 'Test prompt');
    console.log('SmartScraper with custom handler:', smartResult);
    
  } catch (error) {
    console.error('Error in custom handler:', error.message);
  }
}

Per-Request Mock Mode

import { disableMock } from 'scrapegraph-js';

// Disable global mock mode
disableMock();

async function perRequestMockMode() {
  try {
    // Test individual requests with mock enabled
    const scrapeResult = await scrape(API_KEY, 'https://example.com', { mock: true });
    console.log('Per-request mock scrape:', scrapeResult);
    
    const smartResult = await smartScraper(API_KEY, 'https://example.com', 'Test', null, null, null, null, { mock: true });
    console.log('Per-request mock smartScraper:', smartResult);
    
  } catch (error) {
    console.error('Error in per-request mock mode:', error.message);
  }
}

Environment Variable Activation

// Set environment variable to enable mock mode
// SGAI_MOCK=1 node your-script.js

async function testEnvironmentActivation() {
  console.log('Current SGAI_MOCK value:', process.env.SGAI_MOCK || 'not set');
  
  // Mock mode will be automatically enabled if SGAI_MOCK=1
  const credits = await getCredits(API_KEY);
  console.log('Credits with env check:', credits);
}

Complete JavaScript Mock Example

Here’s a complete example showing all JavaScript mocking features:

/**
 * Complete JavaScript Mock Mode Example
 * 
 * This example demonstrates:
 * 1. Global mock mode activation
 * 2. Custom mock responses
 * 3. Custom mock handlers
 * 4. Per-request mock mode
 * 5. Environment variable activation
 * 6. Testing all available endpoints
 */

import { 
  scrape, 
  getScrapeRequest,
  smartScraper,
  getSmartScraperRequest,
  searchScraper,
  getSearchScraperRequest,
  markdownify,
  getMarkdownifyRequest,
  crawl,
  getCrawlRequest,
  agenticScraper,
  getAgenticScraperRequest,
  getCredits,
  sendFeedback,
  initMockConfig, 
  enableMock, 
  disableMock, 
  setMockResponses, 
  setMockHandler 
} from 'scrapegraph-js';

const API_KEY = process.env.SGAI_API_KEY || 'sgai-00000000-0000-0000-0000-000000000000';

async function completeMockDemo() {
  console.log('🧪 ScrapeGraph AI JavaScript SDK - Mock Mode Demo');
  console.log('================================================');
  
  // 1. Basic mock usage
  console.log('\n1. Basic Mock Usage:');
  enableMock();
  
  const credits = await getCredits(API_KEY);
  console.log('✅ Credits:', credits);
  
  const scrapeResult = await scrape(API_KEY, 'https://example.com');
  console.log('✅ Scrape:', scrapeResult.request_id ? 'Success' : 'Failed');
  
  // 2. Custom responses
  console.log('\n2. Custom Mock Responses:');
  setMockResponses({
    '/v1/credits': {
      remaining_credits: 100,
      total_credits_used: 0,
      custom_message: 'Custom mock response',
      mock: true
    }
  });
  
  const customCredits = await getCredits(API_KEY);
  console.log('✅ Custom Credits:', customCredits);
  
  // 3. Custom handler
  console.log('\n3. Custom Mock Handler:');
  setMockHandler((method, url) => ({
    custom_handler: true,
    method,
    url,
    timestamp: new Date().toISOString(),
    mock: true
  }));
  
  const handlerResult = await smartScraper(API_KEY, 'https://example.com', 'Test');
  console.log('✅ Custom Handler Result:', handlerResult);
  
  // 4. Per-request mock
  console.log('\n4. Per-Request Mock Mode:');
  disableMock();
  
  const perRequestResult = await scrape(API_KEY, 'https://example.com', { mock: true });
  console.log('✅ Per-Request Mock:', perRequestResult.request_id ? 'Success' : 'Failed');
  
  console.log('\n🎉 Complete mock demo finished!');
}

// Run the demo
completeMockDemo().catch(console.error);

SDK Comparison

Python SDK

Client(mock=True) initialization
mock_responses parameter for overrides
mock_handler for custom logic
Environment variable: SGAI_MOCK=true

JavaScript SDK

enableMock() global activation
setMockResponses() for overrides
setMockHandler() for custom logic
Environment variable: SGAI_MOCK=1

cURL/HTTP

Environment variable: SGAI_MOCK=true
SGAI_MOCK_RESPONSES for custom responses
Direct HTTP method testing
No SDK dependencies required

Feature Comparison

Feature	Python SDK	JavaScript SDK	cURL/HTTP
Global Mock Mode	`Client(mock=True)`	`enableMock()`	`SGAI_MOCK=true`
Per-Request Mock	`{mock: True}` in params	`{mock: true}` in options	N/A
Custom Responses	`mock_responses` dict	`setMockResponses()`	`SGAI_MOCK_RESPONSES`
Custom Handler	`mock_handler` function	`setMockHandler()`	N/A
Environment Variable	`SGAI_MOCK=true`	`SGAI_MOCK=1`	`SGAI_MOCK=true`
Async Support	`AsyncClient(mock=True)`	Native async/await	N/A
HTTP Methods	SDK abstraction	SDK abstraction	Direct GET/POST/PUT/DELETE
Dependencies	Python SDK required	JavaScript SDK required	None

Limitations

You can’t test real-time scraping performance in mock mode.
Mock responses don’t reflect actual website changes or dynamic content.
Rate limiting and credit consumption are not simulated in mock mode.
Some advanced features may behave differently in mock mode compared to live mode.

Troubleshooting

Common Issues

Mock responses not working

Ensure mock=True is set when initializing the client
Check that your mock response paths match the actual API endpoints
Verify the response format matches the expected schema

Custom handler not being called

Make sure you’re passing the mock_handler parameter correctly
Check that your handler function accepts the correct parameters: (method, url, kwargs)
Ensure the handler returns a valid response object

Schema validation errors

Mock responses must match the expected Pydantic schema structure
Use the same field names and types as defined in your schema
Test your mock responses with the actual schema classes

Examples

Complete Mock Example

Here’s a complete example showing all mocking features:

from scrapegraph_py import Client
from scrapegraph_py.logger import sgai_logger
from pydantic import BaseModel, Field
from typing import List

# Set up logging
sgai_logger.set_logging(level="INFO")

class ProductInfo(BaseModel):
    name: str = Field(description="Product name")
    price: str = Field(description="Product price")
    features: List[str] = Field(description="Product features")

def complete_mock_demo():
    # Initialize with comprehensive mock responses
    client = Client.from_env(
        mock=True,
        mock_responses={
            "/v1/credits": {
                "remaining_credits": 25,
                "total_credits_used": 75,
                "mock": true
            },
            "/v1/smartscraper/start": {
                "job_id": "demo-job-789",
                "status": "processing",
                "mock": true
            },
            "/v1/smartscraper/status/demo-job-789": {
                "job_id": "demo-job-789",
                "status": "completed",
                "result": {
                    "name": "iPhone 15 Pro",
                    "price": "$999",
                    "features": [
                        "A17 Pro chip",
                        "48MP camera system",
                        "Titanium design",
                        "Action Button"
                    ],
                    "mock": true
                }
            }
        }
    )
    
    print("=== ScrapeGraphAI Mock Demo ===\n")
    
    # Test credits endpoint
    print("1. Checking credits:")
    credits = client.get_credits()
    print(f"   Remaining: {credits['remaining_credits']}")
    print(f"   Used: {credits['total_credits_used']}\n")
    
    # Test smartscraper with schema
    print("2. Extracting product information:")
    product = client.smartscraper(
        website_url="https://apple.com/iphone-15-pro",
        user_prompt="Extract product name, price, and key features",
        output_schema=ProductInfo
    )
    
    print(f"   Product: {product.name}")
    print(f"   Price: {product.price}")
    print("   Features:")
    for feature in product.features:
        print(f"     - {feature}")
    
    print("\n3. Testing markdownify:")
    markdown = client.markdownify(website_url="https://example.com")
    print(f"   Markdown length: {len(markdown)} characters")
    
    print("\n=== Demo Complete ===")

if __name__ == "__main__":
    complete_mock_demo()

Support

GitHub Issues

Report bugs or request features

Documentation

Python SDK documentation

Need help with mocking? Check out our Python SDK documentation or join our Discord community for support.

Get Started

Services

Official SDKs

Integrations

Contribute

Resources

Mock Mode

Custom Responses

​Overview

​Use cases

​Test in mock mode

​Basic Mock Usage

​Custom Response Overrides

​Custom Handler Functions

​Testing Best Practices

​Unit Testing with Mocks

​Integration Testing

​Environment Variables

​Async Mocking

​HTTP Method Mocking with cURL

​Basic cURL Mock Usage

​Custom Mock Responses with cURL

​Testing Different HTTP Methods

​JavaScript SDK Mocking

​Basic Mock Usage

​Custom Mock Responses

​Custom Mock Handler

​Per-Request Mock Mode

​Environment Variable Activation

​SDK Comparison

Python SDK

JavaScript SDK

cURL/HTTP

​Feature Comparison

​Limitations

​Troubleshooting

​Mock responses not working

​Custom handler not being called

​Schema validation errors

​Examples

​Support

GitHub Issues

Documentation

Overview

Use cases

Test in mock mode

Basic Mock Usage

Custom Response Overrides

Custom Handler Functions

Testing Best Practices

Unit Testing with Mocks

Integration Testing

Environment Variables

Async Mocking

HTTP Method Mocking with cURL

Basic cURL Mock Usage

Custom Mock Responses with cURL

Testing Different HTTP Methods

JavaScript SDK Mocking

Basic Mock Usage

Custom Mock Responses

Custom Mock Handler

Per-Request Mock Mode

Environment Variable Activation

SDK Comparison

Feature Comparison

Limitations

Troubleshooting

Mock responses not working

Custom handler not being called

Schema validation errors

Examples

Support