MCP

Automate Web Interactions with Cloud Browsers

Name: Automate Web Interactions with Cloud Browsers
Availability: OnlineOnly
Author: VibeBaza

Browserbase MCP server for LLMs to automate cloud browser tasks, extract data, and take screenshots.

Connect

Works with browserbasesmitheryplaywrightpuppeteer

VibeBaza

Maintainer?

Spark score

out of 100

Updated 6 months ago

Version 1.0.0

Models

claude

Add to Favorites

Why it matters

Leverage cloud browser automation to enable LLMs to interact with web pages, extract structured data, and perform automated actions with precision.

Outcomes

What it gets done

Control cloud browsers via Browserbase for web automation.

Extract structured data from any webpage.

Navigate, click, and fill forms on web pages.

Take full-page and element screenshots.

Install

Add it to your toolbox

Run in your project directory:

curl -fsSL https://spark.entire.vc/get/vb-browserbase | bash

Capabilities

Tools your agent gets

navigate

Navigate to a specified URL in the browser

take_screenshot

Capture a full-page or element screenshot from the browser

extract_data

Extract structured data from a webpage

click

Click on an element in the browser

fill_form

Fill form fields with specified values

create_session

Create a new browser session

close_session

Close an active browser session

manage_session

Manage and control browser sessions

Overview

Browserbase MCP

What it does

The Browserbase MCP server provides cloud browser automation capabilities for LLMs. It allows AI models to interact with web pages, perform actions like navigation and form filling, extract structured data, and capture full-page or element-specific screenshots. It supports multiple AI models and offers features like vision support for complex DOMs.

How it connects

Use the Browserbase MCP server when you need to enable LLMs to perform automated tasks on websites, extract specific information from web pages, or require AI-driven browser interaction for your applications. It's suitable for tasks requiring precise web navigation, data scraping, and automated form submissions.

Source README

The Browserbase MCP server provides cloud browser automation capabilities using Browserbase and Stagehand. It enables LLMs to interact with web pages, take screenshots, extract information, and perform automated actions with atomic precision.

Installation

npm install -g @browserbasehq/mcp-server-browserbase

Configuration

Remote Server (Recommended)

Get a hosted URL from Smithery with LLM costs included:

{
  "mcpServers": {
    "browserbase": {
      "type": "http",
      "url": "your-smithery-url.com"
    }
  }
}

Local Server

{
  "mcpServers": {
    "browserbase": {
      "command": "npx",
      "args": ["@browserbasehq/mcp-server-browserbase"],
      "env": {
        "BROWSERBASE_API_KEY": "your-api-key",
        "BROWSERBASE_PROJECT_ID": "your-project-id",
        "GEMINI_API_KEY": "your-gemini-key"
      }
    }
  }
}

Features

Feature	Description
Browser Automation	Control cloud browsers via Browserbase
Data Extraction	Extract structured data from any webpage
Web Interaction	Navigate, click, and fill forms
Screenshots	Full-page and element screenshots
Model Flexibility	Supports OpenAI, Claude, Gemini, and more
Vision Support	Annotated screenshots for complex DOMs
Session Management	Create, manage, and close browser sessions
High Performance	20-40% faster with automatic caching (v3)

Stagehand v3 Features

Targeted extraction across iframes and shadow roots
CSS selector support with improved element targeting
Multi-browser support (Playwright, Puppeteer, Patchright)
Built-in primitives: page, locator, frameLocator, deepLocator
Experimental features with --experimental flag

Configuration Options

Flag	Description
`--proxies`	Enable Browserbase proxies
`--advancedStealth`	Enable advanced stealth mode (Scale Plan)
`--keepAlive`	Enable keep-alive sessions
`--contextId`	Specify a Browserbase Context ID
`--browserWidth`	Viewport width (default: 1024)
`--browserHeight`	Viewport height (default: 768)
`--modelName`	AI model (default: gemini-2.0-flash)
`--experimental`	Enable experimental features

Usage Example

Claude, navigate to example.com, take a screenshot,
and extract all product prices from the page.

Resources

Discussion