Automate Web Browsing with Natural Language
MultiOn Tool connects LlamaIndex agents to the MultiOn browser automation service, enabling agents to interact with the internet through Chrome.
Why it matters
Empower your AI agent to interact with the internet through your Chrome browser. This tool allows agents to perform actions on your behalf by translating natural language commands into browser operations.
Outcomes
What it gets done
Execute web browsing tasks based on natural language instructions.
Integrate with MultiOn for seamless browser automation.
Enable AI agents to access and act on web content.
Install
Add it to your toolbox
Run in your project directory:
curl -fsSL https://spark.entire.vc/get/li-tool-tools-multion | bash Capabilities
What this skill does
Controls a real browser to automate web workflows.
Searches the web and retrieves relevant sources.
Fetches and parses content from web pages.
Overview
MultiOn Tool
What it does
MultiOn Tool integrates the MultiOn browser automation service into LlamaIndex agents. The tool includes a `browse` function that takes natural language instructions to pass to the web browser to execute.
How it connects
Use MultiOn Tool when you need your AI agent to perform web-based tasks through browser interaction. The source examples show reading Twitter feeds and checking Google Calendar events. This tool requires the MultiOn Chrome extension installed and a MultiOn account.
Source README
MultiOn Tool
pip install llama-index-tools-multion
This tool connects to MultiOn to enable your agent to easily
connect to the internet through your Chrome Web browser and act on your behalf
You will need to have the MultiOn chrome extension installed and a MultiOn account
to use this integration
Usage
This tool has more a extensive example usage documented in a Jupyter notebook here.
Here's an example usage of the MultionToolSpec.
from llama_index.tools.multion import MultionToolSpec
from llama_index.core.agent.workflow import FunctionAgent
from llama_index.llms.openai import OpenAI
multion_tool = MultionToolSpec(api_key="your-multion-key")
agent = FunctionAgent(
tools=multion_tool.to_tool_list(),
llm=OpenAI(model="gpt-4.1"),
)
print(await agent.run("Can you read the latest tweets from my followers"))
print(await agent.run("What's the next thing on my google calendar?"))
browse: The core function that takes natural language instructions to pass to the web browser to execute
This loader is designed to be used as a way to load data as a Tool in a Agent.
Discussion
Questions & comments · 0
Sign In Sign in to leave a comment.