Documentation
Your complete guide to using Nanobrowser
🌐 Nanobrowser
Nanobrowser is an open-source AI web automation tool that runs in your browser. A free alternative to OpenAI Operator with flexible LLM options and multi-agent system.
🔥 Why Nanobrowser?
Looking for a powerful AI web agent without the $200/month price tag of OpenAI Operator? Nanobrowser, as a chrome extension, delivers premium web automation capabilities while keeping you in complete control:
- 100% Free - No subscription fees or hidden costs. Just install and use your own API keys, and you only pay what you use with your own API keys.
- Privacy-Focused - Everything runs in your local browser. Your credentials stay with you, never shared with any cloud service.
- Flexible LLM Options - Connect to your preferred LLM providers with the freedom to choose different models for different agents.
- Fully Open Source - Complete transparency in how your browser is automated. No black boxes or hidden processes.
Note: We currently support OpenAI, Anthropic, Gemini, Ollama and custom OpenAI-Compatible providers, more providers will be supported.
📊 Key Features
- Multi-agent System: Specialized AI agents collaborate to accomplish complex web workflows
- Interactive Side Panel: Intuitive chat interface with real-time status updates
- Task Automation: Seamlessly automate repetitive web automation tasks across websites
- Follow-up Questions: Ask contextual follow-up questions about completed tasks
- Conversation History: Easily access and manage your AI agent interaction history
- Multiple LLM Support: Connect your preferred LLM providers and assign different models to different agents
🚀 Quick Start
1. Install from Chrome Web Store
- Visit the Nanobrowser Chrome Web Store page
- Click "Add to Chrome" button
- Confirm the installation when prompted
2. Configure Agent Models
- Click the Nanobrowser icon in your toolbar to open the sidebar
- Click the
Settings
icon (top right) - Add your LLM API keys
- Choose which model to use for different agents (Navigator, Planner, Validator)
🔧 Manually Install
1. Download
Download the latest nanobrowser.zip
file from the official Github release page.
2. Install
- Unzip
nanobrowser.zip
- Open
chrome://extensions/
in Chrome - Enable
Developer mode
(top right) - Click
Load unpacked
(top left) - Select the unzipped
nanobrowser
folder
3. Configure Agent Models
- Click the Nanobrowser icon in your toolbar to open the sidebar
- Click the
Settings
icon (top right) - Add your LLM API keys
- Choose which model to use for different agents (Navigator, Planner, Validator)
4. Upgrading
- Download the latest
nanobrowser.zip
file from the release page - Unzip and replace your existing Nanobrowser files with the new ones
- Go to
chrome://extensions/
in Chrome and click the refresh icon on the Nanobrowser card
🛠️ Build from Source
2. Clone the Repository
git clone https://github.com/nanobrowser/nanobrowser.git
cd nanobrowser
3. Install Dependencies
pnpm install
4. Build the Extension
pnpm build
5. Load the Extension
- The built extension will be in the
dist
directory - Follow the installation steps from the Manually Install section to load the extension into your browser
6. Development Mode (optional)
pnpm dev
🤖 Choosing Your Models
Nanobrowser allows you to configure different LLM models for each agent to balance performance and cost. Here are recommended configurations:
💡 See It In Action
Here are some powerful tasks you can accomplish with just a sentence:
- News Summary
"Go to TechCrunch and extract top 10 headlines from the last 24 hours"
- GitHub Research
"Look for the trending Python repositories on GitHub with most stars"
- Shopping Research
"Find a portable Bluetooth speaker on Amazon with a water-resistant design, under $50. It should have a minimum battery life of 10 hours"
🛠️ Roadmap
We're actively developing Nanobrowser with exciting features on the horizon, welcome to join us!
- Expand LLM Support: Add integration with more large language model providers for AI agents
- Security Firewall: Implement protective measures to ensure safe browsing operations
- Memory Optimization: Reduce token usage while maintaining context awareness
- Session Replay: Efficiently replay historical tasks with minimal token consumption
- Specialized Agents: Develop purpose-built agents for complex domain-specific tasks
💬 Community
Join our growing community of developers and users:
- Discord - Chat with team and community
- Twitter - Follow for updates and announcements
- GitHub Discussions - Share ideas and ask questions
Made with ❤️ by the Nanobrowser Team.
Like Nanobrowser? Give us a star 🌟