The Browser API
Built for AI Agents

Control real browsers through a simple REST API. Get structured page data, stable element refs, and change diffs instead of raw HTML.

Stability detection built in
100x fewer tokens per page
Diffs after every action
No credit card required. 1 hour of free runtime included.

See the API in Action

Six steps to show the full lifecycle: create a session, observe the page, fill forms, extract data, scroll, and screenshot.

POST /v1/sessions
{
  "url": "https://app.example.com/login",
  "viewport": {
    "width": 1280,
    "height": 720
  },
  "auto_dismiss_blockers": true
}
201 Created
{
  "session_id": "ses_abc123def456",
  "page": {
    "url": "https://app.example.com/login",
    "title": "Log In",
    "stable": true,
    "markdown": {
      "content": "# Log In\n\nWelcome back..."
    },
    "interactive_elements": [
      { "ref": "e1", "tag": "input",
        "label": "Email" },
      { "ref": "e2", "tag": "input",
        "label": "Password" },
      { "ref": "e3", "tag": "button",
        "label": "Sign In" }
    ],
    "forms": [
      { "id": "login", "fields": 2 }
    ]
  },
  "blockers_dismissed": ["cookie_consent"]
}
Without Browserbeam

Launch Puppeteer, set viewport, navigate, wait for networkidle, detect and dismiss cookie banner (2-3 extra actions), call page.content(), parse 15,000+ character HTML with cheerio, manually extract form fields.

~25 lines of code. ~3,500 tokens for the raw HTML alone.

With Browserbeam

One POST request. Navigate, auto-dismiss the cookie banner, return markdown content, element refs, and form structures. The page is ready for your agent to read and act on.

1 API call. Markdown + refs + forms. ~150 tokens.

An Intelligence Layer for the Browser

Six capabilities that sit between your agent and the page, so the LLM spends tokens on the task, not on browser overhead.

Stability Detection

Every response includes a stability signal that tells your agent when the page is fully loaded and ready. No more guessing wait times or burning tokens on premature reads.

Element Registry

Interactive elements get short, stable refs like e1, e2, e3. Your agent clicks by ref instead of constructing fragile CSS selectors.

Diff Tracking

After each action, the API returns only what changed: elements added, removed, or modified. Your agent reads a 30-token diff instead of re-parsing the entire page.

Blocker Dismissal

Cookie banners, newsletter popups, and chat widgets are detected and dismissed automatically. Your agent never wastes actions on interruptions irrelevant to the task.

Semantic Extraction

Pages are compressed into a structured, token-efficient representation: interactive elements, headings, and visible text. Thousands of DOM nodes become a compact JSON object.

Error Enrichment

When an action fails, you get context, not just "element not found." The API tells you if an overlay is blocking the target, if a CAPTCHA appeared, and what to do next.

What Will You Build?

One API, many possibilities. From autonomous agents to data pipelines, Browserbeam gives your code a browser it can see through.

Give your AI agent a real browser it can see and control. Structured page data, interactive element refs, and markdown content your model can read.

Request
POST /v1/sessions

{
  "url": "https://app.example.com/login",
  "viewport": {
    "width": 1280,
    "height": 720
  },
  "auto_dismiss_blockers": true
}
Response
{
  "session_id": "ses_abc123def456",
  "page": {
    "url": "https://app.example.com/login",
    "title": "Log In",
    "stable": true,
    "markdown": {
      "content": "# Log In\n\nWelcome back..."
    },
    "interactive_elements": [
      { "ref": "e1", "tag": "input", "label": "Email" },
      { "ref": "e2", "tag": "input", "label": "Password" },
      { "ref": "e3", "tag": "button", "label": "Sign In" }
    ],
    "forms": [
      { "id": "login", "fields": 2 }
    ]
  },
  "blockers_dismissed": ["cookie_consent"]
}

Simple Runtime Billing

Pay for the time your sessions are open. Start with a 1-hour free trial. No credit card needed.

Starter

For individuals and side projects

$29 / month
  • 100 hours runtime / month
  • 25 concurrent sessions
  • 15-minute max session length
  • $0.10/hr overage
Get Started
Best Value

Pro

For teams and production use

$99 / month
  • 500 hours runtime / month
  • 100 concurrent sessions
  • 30-minute max session length
  • $0.08/hr overage
Get Started

Scale

For agencies and high-volume use

$199 / month
  • 1,500 hours runtime / month
  • 200 concurrent sessions
  • 1-hour max session length
  • $0.06/hr overage
Get Started
Billed monthly by runtime. Cancel anytime.
Prices exclude VAT where applicable.

Frequently Asked Questions

How does the browser API work?

Browserbeam provides a REST API that gives your LLM or automation code control over a real browser. Create a session, navigate to any URL, observe the page as structured JSON, interact with elements using short refs, and get change diffs after every action. You send HTTP requests. We handle the browser infrastructure.

How is this different from Browserless or Browserbase?

Those services give you hosted Playwright or Puppeteer. You still get raw HTML, guess at wait times, and parse the DOM yourself. Browserbeam adds an intelligence layer on top: stability detection, semantic extraction, element refs, diff tracking, and blocker dismissal. Your LLM gets structured, compact data instead of a wall of HTML.

What can I build with Browserbeam?

LLM-powered web agents, automated form filling, web scraping, QA testing, screenshot and PDF generation, and any workflow where AI needs to see and interact with web pages. Browserbeam handles the browser so your code stays simple.

How does pricing work?

You pay for runtime: the wall-clock time your browser sessions are open. No credits, no units, no bandwidth metering. The free trial gives you 1 hour of total runtime to test the full API. Paid plans include monthly runtime allowances with per-hour overage billing, so you never get cut off mid-session.

Is my data private and secure?

Every session runs in an isolated browser context with separate cookies, storage, and cache. No CDP port is exposed. Sessions are destroyed when closed and we never store page content beyond the session lifetime. API keys are hashed. Your usage data is accessible only to you.

Do I need to manage Playwright or Puppeteer?

No. Browserbeam is a REST API. You send JSON, you get JSON back. No browser libraries to install, no Chrome binaries to maintain, no WebSocket connections to manage. If your code can make HTTP requests, it can use Browserbeam.

Give Your LLM a Browser It Can Actually Use

Structured page data, stable element refs, and change diffs. One REST API.

Stability detection built in
100x fewer tokens per page
Diffs after every action
No credit card required. 1 hour of free runtime included.