MCP

Automate Web Interactions with Cloud Browsers

Browserbase MCP server for LLMs to automate cloud browser tasks, extract data, and take screenshots.

Works with browserbasesmitheryplaywrightpuppeteer

9
Spark score
out of 100
Updated 6 months ago
Version 1.0.0
Models

Add to Favorites

Why it matters

Leverage cloud browser automation to enable LLMs to interact with web pages, extract structured data, and perform automated actions with precision.

Outcomes

What it gets done

01

Control cloud browsers via Browserbase for web automation.

02

Extract structured data from any webpage.

03

Navigate, click, and fill forms on web pages.

04

Take full-page and element screenshots.

Install

Add it to your toolbox

Run in your project directory:

curl -fsSL https://spark.entire.vc/get/vb-browserbase | bash

Capabilities

Tools your agent gets

navigate

Navigate to a specified URL in the browser

take_screenshot

Capture a full-page or element screenshot from the browser

extract_data

Extract structured data from a webpage

click

Click on an element in the browser

fill_form

Fill form fields with specified values

create_session

Create a new browser session

close_session

Close an active browser session

manage_session

Manage and control browser sessions

Overview

Browserbase MCP

What it does

The Browserbase MCP server provides cloud browser automation capabilities for LLMs. It allows AI models to interact with web pages, perform actions like navigation and form filling, extract structured data, and capture full-page or element-specific screenshots. It supports multiple AI models and offers features like vision support for complex DOMs.

How it connects

Use the Browserbase MCP server when you need to enable LLMs to perform automated tasks on websites, extract specific information from web pages, or require AI-driven browser interaction for your applications. It's suitable for tasks requiring precise web navigation, data scraping, and automated form submissions.

Source README

The Browserbase MCP server provides cloud browser automation capabilities using Browserbase and Stagehand. It enables LLMs to interact with web pages, take screenshots, extract information, and perform automated actions with atomic precision.

Installation

npm install -g @browserbasehq/mcp-server-browserbase

Configuration

Remote Server (Recommended)

Get a hosted URL from Smithery with LLM costs included:

{
  "mcpServers": {
    "browserbase": {
      "type": "http",
      "url": "your-smithery-url.com"
    }
  }
}

Local Server

{
  "mcpServers": {
    "browserbase": {
      "command": "npx",
      "args": ["@browserbasehq/mcp-server-browserbase"],
      "env": {
        "BROWSERBASE_API_KEY": "your-api-key",
        "BROWSERBASE_PROJECT_ID": "your-project-id",
        "GEMINI_API_KEY": "your-gemini-key"
      }
    }
  }
}

Features

Feature Description
Browser Automation Control cloud browsers via Browserbase
Data Extraction Extract structured data from any webpage
Web Interaction Navigate, click, and fill forms
Screenshots Full-page and element screenshots
Model Flexibility Supports OpenAI, Claude, Gemini, and more
Vision Support Annotated screenshots for complex DOMs
Session Management Create, manage, and close browser sessions
High Performance 20-40% faster with automatic caching (v3)

Stagehand v3 Features

  • Targeted extraction across iframes and shadow roots
  • CSS selector support with improved element targeting
  • Multi-browser support (Playwright, Puppeteer, Patchright)
  • Built-in primitives: page, locator, frameLocator, deepLocator
  • Experimental features with --experimental flag

Configuration Options

Flag Description
--proxies Enable Browserbase proxies
--advancedStealth Enable advanced stealth mode (Scale Plan)
--keepAlive Enable keep-alive sessions
--contextId Specify a Browserbase Context ID
--browserWidth Viewport width (default: 1024)
--browserHeight Viewport height (default: 768)
--modelName AI model (default: gemini-2.0-flash)
--experimental Enable experimental features

Usage Example

Claude, navigate to example.com, take a screenshot,
and extract all product prices from the page.

Resources

Discussion

Questions & comments · 0

Sign In Sign in to leave a comment.