Skill

Automate Web Browsing with Natural Language

MultiOn Tool connects LlamaIndex agents to the MultiOn browser automation service, enabling agents to interact with the internet through Chrome.

Works with multionopenai

71
Spark score
out of 100
Updated 3 months ago
Version 1.0.0
Models

Add to Favorites

Why it matters

Empower your AI agent to interact with the internet through your Chrome browser. This tool allows agents to perform actions on your behalf by translating natural language commands into browser operations.

Outcomes

What it gets done

01

Execute web browsing tasks based on natural language instructions.

02

Integrate with MultiOn for seamless browser automation.

03

Enable AI agents to access and act on web content.

Install

Add it to your toolbox

Run in your project directory:

curl -fsSL https://spark.entire.vc/get/li-tool-tools-multion | bash

Capabilities

What this skill does

Drive a browser

Controls a real browser to automate web workflows.

Search the web

Searches the web and retrieves relevant sources.

Scrape

Fetches and parses content from web pages.

Overview

MultiOn Tool

What it does

MultiOn Tool integrates the MultiOn browser automation service into LlamaIndex agents. The tool includes a `browse` function that takes natural language instructions to pass to the web browser to execute.

How it connects

Use MultiOn Tool when you need your AI agent to perform web-based tasks through browser interaction. The source examples show reading Twitter feeds and checking Google Calendar events. This tool requires the MultiOn Chrome extension installed and a MultiOn account.

Source README

MultiOn Tool

pip install llama-index-tools-multion

This tool connects to MultiOn to enable your agent to easily
connect to the internet through your Chrome Web browser and act on your behalf

You will need to have the MultiOn chrome extension installed and a MultiOn account
to use this integration

Usage

This tool has more a extensive example usage documented in a Jupyter notebook here.

Here's an example usage of the MultionToolSpec.

from llama_index.tools.multion import MultionToolSpec
from llama_index.core.agent.workflow import FunctionAgent
from llama_index.llms.openai import OpenAI

multion_tool = MultionToolSpec(api_key="your-multion-key")

agent = FunctionAgent(
    tools=multion_tool.to_tool_list(),
    llm=OpenAI(model="gpt-4.1"),
)

print(await agent.run("Can you read the latest tweets from my followers"))
print(await agent.run("What's the next thing on my google calendar?"))

browse: The core function that takes natural language instructions to pass to the web browser to execute

This loader is designed to be used as a way to load data as a Tool in a Agent.

Discussion

Questions & comments · 0

Sign In Sign in to leave a comment.