Gregor Žunič 2d27dc4f7c loggers
2024-11-05 13:03:38 +01:00
2024-11-04 10:24:09 +01:00
2024-11-05 13:03:38 +01:00
2024-11-05 13:03:38 +01:00
2024-11-05 13:03:38 +01:00
2024-11-02 18:43:35 +01:00
2024-10-31 17:00:29 +01:00
2024-11-01 17:07:05 +01:00
2024-11-01 21:39:09 +01:00
2024-11-01 21:39:09 +01:00
2024-11-05 12:50:18 +01:00
2024-11-05 13:03:38 +01:00

🌐 Browser-Use

Open-Source Web Automation with LLMs

GitHub stars License: MIT Python 3.11+

Let LLMs interact with websites naturally

Key FeaturesLive DemosQuick StartExamplesModels


🎥 Live Demos

Watch Browser-Use tackle real-world tasks:

Jobs Demo
Job Applications

Apply to 5 SF tech jobs
Images Demo
Multi-Tab Search

Find images across tabs
Flights Demo
Flight Search

Find cheapest flights

🚀 Key Features

  • 🤖 Universal LLM Support - Works with any Language Model
  • 🎯 Smart Element Detection - Automatically finds interactive elements
  • 📑 Multi-Tab Management - Seamless handling of browser tabs
  • 🔍 XPath Extraction - No more manual DevTools inspection
  • 👁️ Vision Model Support - Process visual page information
  • 🛠️ Customizable Actions - Add your own browser interactions

💻 Quick Start

# Install with uv (recommended)
uv pip install -r requirements.txt

# Generate requirements
uv pipreqs --ignore .venv --force

📝 Examples

from browser_use import AgentService, ControllerService
from langchain_anthropic import ChatAnthropic

# Initialize browser agent
agent = AgentService(
    task="Find cheapest flight from London to Kyrgyzstan",
    model=ChatAnthropic(model="claude-3-sonnet"),
    controller=ControllerService(),
    use_vision=True
)

# Let it work its magic
await agent.run()

🤖 Supported Models

GPT-4o GPT-4o Mini Claude 3.5 Sonnet

🤝 Contributing

Contributions are welcome!


Star this repo if you find it useful!
Made with ❤️ by the Browser-Use team
Description
Mirrored from GitHub
Readme MIT 125 MiB
Languages
Python 98%
Shell 1.4%
Dockerfile 0.4%
HTML 0.2%