Automate Web Interactions with Cloud Browsers
Browserbase MCP server for LLMs to automate cloud browser tasks, extract data, and take screenshots.
Why it matters
Leverage cloud browser automation to enable LLMs to interact with web pages, extract structured data, and perform automated actions with precision.
Outcomes
What it gets done
Control cloud browsers via Browserbase for web automation.
Extract structured data from any webpage.
Navigate, click, and fill forms on web pages.
Take full-page and element screenshots.
Install
Add it to your toolbox
Run in your project directory:
curl -fsSL https://spark.entire.vc/get/vb-browserbase | bash Capabilities
Tools your agent gets
Navigate to a specified URL in the browser
Capture a full-page or element screenshot from the browser
Extract structured data from a webpage
Click on an element in the browser
Fill form fields with specified values
Create a new browser session
Close an active browser session
Manage and control browser sessions
Overview
Browserbase MCP
What it does
The Browserbase MCP server provides cloud browser automation capabilities for LLMs. It allows AI models to interact with web pages, perform actions like navigation and form filling, extract structured data, and capture full-page or element-specific screenshots. It supports multiple AI models and offers features like vision support for complex DOMs.
How it connects
Use the Browserbase MCP server when you need to enable LLMs to perform automated tasks on websites, extract specific information from web pages, or require AI-driven browser interaction for your applications. It's suitable for tasks requiring precise web navigation, data scraping, and automated form submissions.
Source README
The Browserbase MCP server provides cloud browser automation capabilities using Browserbase and Stagehand. It enables LLMs to interact with web pages, take screenshots, extract information, and perform automated actions with atomic precision.
Installation
npm install -g @browserbasehq/mcp-server-browserbase
Configuration
Remote Server (Recommended)
Get a hosted URL from Smithery with LLM costs included:
{
"mcpServers": {
"browserbase": {
"type": "http",
"url": "your-smithery-url.com"
}
}
}
Local Server
{
"mcpServers": {
"browserbase": {
"command": "npx",
"args": ["@browserbasehq/mcp-server-browserbase"],
"env": {
"BROWSERBASE_API_KEY": "your-api-key",
"BROWSERBASE_PROJECT_ID": "your-project-id",
"GEMINI_API_KEY": "your-gemini-key"
}
}
}
}
Features
| Feature | Description |
|---|---|
| Browser Automation | Control cloud browsers via Browserbase |
| Data Extraction | Extract structured data from any webpage |
| Web Interaction | Navigate, click, and fill forms |
| Screenshots | Full-page and element screenshots |
| Model Flexibility | Supports OpenAI, Claude, Gemini, and more |
| Vision Support | Annotated screenshots for complex DOMs |
| Session Management | Create, manage, and close browser sessions |
| High Performance | 20-40% faster with automatic caching (v3) |
Stagehand v3 Features
- Targeted extraction across iframes and shadow roots
- CSS selector support with improved element targeting
- Multi-browser support (Playwright, Puppeteer, Patchright)
- Built-in primitives:
page,locator,frameLocator,deepLocator - Experimental features with
--experimentalflag
Configuration Options
| Flag | Description |
|---|---|
--proxies |
Enable Browserbase proxies |
--advancedStealth |
Enable advanced stealth mode (Scale Plan) |
--keepAlive |
Enable keep-alive sessions |
--contextId |
Specify a Browserbase Context ID |
--browserWidth |
Viewport width (default: 1024) |
--browserHeight |
Viewport height (default: 768) |
--modelName |
AI model (default: gemini-2.0-flash) |
--experimental |
Enable experimental features |
Usage Example
Claude, navigate to example.com, take a screenshot,
and extract all product prices from the page.
Resources
Discussion
Questions & comments · 0
Sign In Sign in to leave a comment.