Skip to main content
Find answers to common questions about using the WebLinq API.

General Questions

WebLinq is a powerful web scraping and content extraction API that helps you:
  • Extract clean, structured content from any webpage
  • Take high-quality screenshots
  • Generate PDFs
  • Extract structured data using AI
  • Monitor websites for changes
Our API is designed to be reliable, scalable, and easy to use.
WebLinq offers flexible pricing plans: - Free Tier: 1,000 credits/month, perfect for testing - Developer: 49/monthfor10,000creditsBusiness:49/month for 10,000 credits - **Business**: 199/month for 50,000 credits - Enterprise: Custom pricing for high-volume needs
One credit typically equals one API request. Some advanced features like AI extraction may use more credits.
Yes! You can sign up for a free account and get 1,000 credits to test the API. No credit card required. The free tier includes: - Full API access - Basic rate limits - Community support - Core features and endpoints
WebLinq is a REST API that works with any programming language. We provide official SDKs for:The API can be used with any language that can make HTTP requests.

Technical Questions

We provide several options for JavaScript-heavy sites:
  1. Wait Until Options:
    {
      "wait_until": "networkidle0"  // Wait for network to be idle
    }
    
  2. JavaScript Execution:
    {
      "javascript": true,  // Enable JavaScript execution
      "wait_time": 5000   // Wait 5 seconds after load
    }
    
  3. Custom Triggers:
    {
      "wait_for": ".dynamic-content",  // Wait for element
      "timeout": 10000                 // Max wait time
    }
    
Rate limits vary by plan: - Free: 60 requests/minute - Developer: 300 requests/minute - Business: 1,000 requests/minute - Enterprise: Custom limits We provide rate limit info in response headers: http X-RateLimit-Limit: 60 X-RateLimit-Remaining: 45 X-RateLimit-Reset: 1625097600
Implement exponential backoff for rate-limited requests.
We support multiple authentication methods for target websites:
  1. Cookie-based:
    {
      "headers": {
        "Cookie": "session=abc123"
      }
    }
    
  2. Header-based:
    {
      "headers": {
        "Authorization": "Bearer token123"
      }
    }
    
  3. Form Login:
    {
      "authentication": {
        "type": "form",
        "loginUrl": "https://example.com/login",
        "username": "user",
        "password": "pass"
      }
    }
    
We employ several strategies to handle anti-bot measures:
  1. Rotating IP addresses
  2. Browser fingerprint randomization
  3. Human-like behavior patterns
  4. Automatic CAPTCHA solving (Enterprise plans)
Some websites may explicitly prohibit scraping. Always check terms of service.

Features & Capabilities

We support multiple output formats:
  • Text: Clean, formatted text
  • HTML: Raw or cleaned HTML
  • JSON: Structured data
  • Markdown: Formatted markdown
  • PDF: High-quality PDFs
  • Images: PNG, JPEG, WebP
Specify the format in your request:
{
  "format": "markdown",
  "clean": true  // Remove ads, nav, etc.
}
Yes! Use CSS selectors to target specific elements:
{
  "selectors": {
    "title": "h1.article-title",
    "author": ".author-bio h3",
    "content": "article.main-content",
    "date": "time.published-date"
  }
}
We also support:
  • XPath selectors
  • Regular expressions
  • AI-powered content extraction
We offer several pagination strategies:
  1. Auto-pagination:
    {
      "pagination": {
        "enabled": true,
        "maxPages": 5
      }
    }
    
  2. Custom Selectors:
    {
      "pagination": {
        "nextButton": ".pagination .next",
        "maxPages": 5
      }
    }
    
  3. URL Patterns:
    {
      "pagination": {
        "pattern": "page={i}",
        "start": 1,
        "end": 5
      }
    }
    
We support geolocation-specific requests:
{
  "geolocation": {
    "country": "US",
    "city": "New York"
  }
}
Enterprise customers can also:
  • Use dedicated IP ranges
  • Specify residential proxies
  • Set custom User-Agent strings

Troubleshooting

Common error codes and solutions:
  • 401: Invalid API key
  • 403: Permission denied
  • 429: Rate limit exceeded
  • 500: Server error
  • 504: Timeout
See our Error Handling guide for details.
Timeouts can occur for several reasons: 1. Slow Website: - Increase timeout: "timeout": 30000 - Use wait_until: "networkidle0" 2. Complex JavaScript: - Enable JS: "javascript": true - Add wait time: "wait_time": 5000 3. Large Pages: - Use selective extraction - Implement pagination
Try these troubleshooting steps: 1. Check Selectors: - Use browser dev tools to verify - Try multiple selector options 2. JavaScript Content: - Enable JS execution - Add appropriate wait conditions 3. Dynamic Content: - Use wait_for with specific elements - Implement retry logic
If you’re getting blocked:
  1. Implement Delays:
    {
      "delay": {
        "min": 2000,
        "max": 5000
      }
    }
    
  2. Rotate User Agents:
    {
      "headers": {
        "User-Agent": "custom-agent"
      }
    }
    
  3. Use Enterprise Features:
    • Residential proxies
    • IP rotation
    • Custom headers

Business & Support

Yes! Enterprise plans include:
  • Dedicated support team
  • Custom feature development
  • SLA guarantees
  • Technical consulting
  • Custom integrations
Contact us for details.
We take data privacy seriously: - Data encrypted in transit and at rest - No persistent storage of scraped content - GDPR and CCPA compliant - Regular security audits - SOC 2 Type II certified See our Privacy Policy for details.
Yes! Our API is production-ready with: - 99.9% uptime guarantee - Global CDN distribution - Automatic scaling - Real-time monitoring - Production support Many companies use WebLinq in production for: - Content aggregation - Price monitoring - Market research - Data analytics
We offer multiple support channels:Enterprise customers get priority support via dedicated channels.

Still have questions?

I