Web Unlocker API

Web Unlocker is a powerful scraping API that allows access to any website while bypassing sophisticated bot protections. You can retrieve clean HTML/JSON responses with a single API call without managing complex anti-bot infrastructure.

import requests

API_URL = "https://api.brightdata.com/request"
API_TOKEN = "INSERT_YOUR_API_TOKEN"
ZONE_NAME = "INSERT_YOUR_WEB_UNLOCKER_ZONE_NAME"
TARGET_URL = "http://lumtest.com/myip.json"

headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_TOKEN}"
}

payload = {
    "zone": ZONE_NAME,
    "url": TARGET_URL,
    "format": "raw"
}

response = requests.post(API_URL, headers=headers, json=payload)

if response.status_code == 200:
    print("Success:", response.text)
else:
    print(f"Error {response.status_code}: {response.text}")

Native Proxy-based Access

Alternative method using proxy-based routing.

Example: cURL Command

curl "http://lumtest.com/myip.json" \
--proxy "brd.superproxy.io:33335" \
--proxy-user "brd-customer-<CUSTOMER_ID>-zone-<ZONE_NAME>:<ZONE_PASSWORD>"

Required credentials:

Customer ID: Found in Account settings
Web Unlocker API zone name: Found in the overview tab
Web Unlocker API password: Found in the overview tab

Example: Python Script

import requests

customer_id = "<customer_id>"
zone_name = "<zone_name>"
zone_password = "<zone_password>"

host = "brd.superproxy.io"
port = 33335
proxy_url = f"http://brd-customer-{customer_id}-zone-{zone_name}:{zone_password}@{host}:{port}"

proxies = {"http": proxy_url, "https": proxy_url}

response = requests.get("http://lumtest.com/myip.json", proxies=proxies)

if response.status_code == 200:
    print(response.json())
else:
    print(f"Error: {response.status_code}")

Practical Example: Scraping G2 Reviews

Let's see how to scrape reviews from G2.com, a site heavily protected by Cloudflare.

Basic Request (Without Web Unlocker)

Using a simple Python script to scrape G2 reviews:

import requests
from bs4 import BeautifulSoup

url = 'https://www.g2.com/products/mongodb/reviews'
response = requests.get(url)

if response.status_code == 200:
    soup = BeautifulSoup(response.text, "lxml")
    headings = soup.find_all('h2')
    
    if headings:
        print("\nHeadings Found:")
        for heading in headings:
            print(f"- {heading.get_text(strip=True)}")
    else:
        print("No headings found")
else:
    print("Request blocked")

Result: The script fails (403 error) due to Cloudflare’s anti-bot measures.

Enhanced Request (With Web Unlocker)

To bypass such restrictions, use Web Unlocker. Below is a Python implementation:

Direct API Access

import requests
from bs4 import BeautifulSoup

API_URL = "https://api.brightdata.com/request"
API_TOKEN = "INSERT_YOUR_API_TOKEN"
ZONE_NAME = "INSERT_YOUR_ZONE"
TARGET_URL = "https://www.g2.com/products/mongodb/reviews"

headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_TOKEN}"
}
payload = {"zone": ZONE_NAME, "url": TARGET_URL, "format": "raw"}

response = requests.post(API_URL, headers=headers, json=payload)

if response.status_code == 200:
    soup = BeautifulSoup(response.text, "lxml")
    headings = [h.get_text(strip=True) for h in soup.find_all('h2')]
    print("\nExtracted Headings:", headings)
else:
    print(f"Error {response.status_code}: {response.text}")

Result: Successfully bypasses protection, retrieves content with status 200.

Proxy-Based Access

Alternatively, use the proxy-based method:

import requests
from bs4 import BeautifulSoup

proxy_url = "http://brd-customer-<customer_id>-zone-<zone_name>:<zone_password>@brd.superproxy.io:33335"
proxies = {"http": proxy_url, "https": proxy_url}

url = "https://www.g2.com/products/mongodb/reviews"
response = requests.get(url, proxies=proxies, verify=False)

if response.status_code == 200:
    soup = BeautifulSoup(response.text, "lxml")
    headings = [h.get_text(strip=True) for h in soup.find_all('h2')]
    print("\nExtracted Headings:", headings)
else:
    print(f"Error {response.status_code}: {response.text}")

Note: Suppress SSL certificate warnings by adding:

from requests.packages.urllib3.exceptions import InsecureRequestWarning
requests.packages.urllib3.disable_warnings(InsecureRequestWarning)

Waiting for Specific Elements

Use the x-unblock-expect header to wait for specific elements or text:

headers["x-unblock-expect"] = '{"element": ".star-wrapper__desc"}'
# or
headers["x-unblock-expect"] = '{"text": "reviews"}'

👉 You can find the complete code in g2_wait.py

Mobile User-Agent Targeting

To use mobile user agents instead of desktop ones, append -ua-mobile to your username:

username = f"brd-customer-{customer_id}-zone-{zone_name}-ua-mobile"

👉 You can find the complete code in g2_mobile.py

Geolocation Targeting

While Web Unlocker automatically selects optimal IP locations, you can specify target locations:

username = f"brd-customer-{customer_id}-zone-{zone_name}-country-us"
username = f"brd-customer-{customer_id}-zone-{zone_name}-country-us-city-sanfrancisco"

👉 You can learn more here.

Debugging Requests

Enable detailed debugging information by adding the -debug-full flag:

username = f"brd-customer-{customer_id}-zone-{zone_name}-debug-full"

👉 You can find the complete code in g2_debug.py

Success Rate Statistics

Monitor API success rates for specific domains:

import requests

API_TOKEN = "INSERT_YOUR_API_TOKEN"

def get_success_rate(domain):
    url = f"https://api.brightdata.com/unblocker/success_rate/{domain}"
    headers = {
        "Content-Type": "application/json",
        "Authorization": f"Bearer {API_TOKEN}"
    }
    response = requests.get(url, headers=headers)
    print(response.json() if response.status_code == 200 else response.text)

get_success_rate("g2.com") # Get statistics for specific domain
get_success_rate("g2.*") # Get statistics for all top-level domains

Final Notes

Web Unlocker lets you scrape even the most protected websites effortlessly. Key points to remember:

Not Compatible With:
- Browsers (Chrome, Firefox, Edge)
- Anti-detect browsers (Adspower, Multilogin)
- Automation tools (Puppeteer, Playwright, Selenium)
Use Scraping Browser:
For browser-based automation, use Bright Data’s Scraping Browser.
Premium Domains:
Access challenging sites with premium domain features.
CAPTCHA Solving:
Solved automatically, but can be disabled. Learn more about Bright Data's CAPTCHA Solver.
Custom Headers & Cookies:
Send your own to target specific site versions. Learn more.

Visit the official documentation for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Unlocker API

Table of Contents

Features

Getting Started

Direct API Access

Native Proxy-based Access

Practical Example: Scraping G2 Reviews

Basic Request (Without Web Unlocker)

Enhanced Request (With Web Unlocker)

Direct API Access

Proxy-Based Access

Waiting for Specific Elements

Mobile User-Agent Targeting

Geolocation Targeting

Debugging Requests

Success Rate Statistics

Final Notes

About

Languages

luminati-io/web-unlocker-api

Folders and files

Latest commit

History

Repository files navigation

Web Unlocker API

Table of Contents

Features

Getting Started

Direct API Access

Native Proxy-based Access

Practical Example: Scraping G2 Reviews

Basic Request (Without Web Unlocker)

Enhanced Request (With Web Unlocker)

Direct API Access

Proxy-Based Access

Waiting for Specific Elements

Mobile User-Agent Targeting

Geolocation Targeting

Debugging Requests

Success Rate Statistics

Final Notes

About

Topics

Resources

Stars

Watchers

Forks

Languages