B

Browser Use Portal

Powerful mcp for server, enables, agents, control. Includes structured workflows, validation checks, and reusable patterns for browser_automation.

MCPClipticsbrowser_automationv1.0.0MIT
0 views0 copies

Browser Use Portal

MCP server enabling AI agents to control and automate web browsers through the browser-use framework.

When to Use This MCP Server

Connect this server when you need to:

  • Automate complex browser interactions like form filling, multi-step navigation, and data extraction from dynamic web pages
  • Build and test end-to-end workflows that require a fully rendered browser environment with JavaScript execution
  • Delegate web research or scraping tasks to an AI agent that can visually interpret and interact with page content

Consider alternatives when:

  • You only need to fetch static HTML or API responses where a simple HTTP client suffices
  • Your automation targets are well-structured APIs rather than browser-rendered user interfaces

Quick Start

Configuration

name: browser-use-portal type: mcp category: browser_automation

Example Connection

claude mcp add browser-use-portal -- browser-use-mcp-server run server --port 8000 --stdio --proxy-port 9000

Available Tools

navigate:        Open a URL in the controlled browser instance
click:           Click on elements identified by selector or visual description
type-text:       Enter text into input fields and form elements
screenshot:      Capture a screenshot of the current browser viewport
extract-content: Extract structured data from the rendered page
scroll:          Scroll the page in any direction by a specified amount

Core Concepts

Browser Automation Overview

AspectDetails
ProtocolMCP over stdio with optional HTTP proxy
Browser EngineChromium-based headless or headed browser
AI IntegrationUses OpenAI vision models for element identification
Session ManagementPersistent browser sessions across multiple tool calls
Proxy SupportBuilt-in proxy on configurable port for traffic inspection
Page InteractionFull DOM access, JavaScript execution, and cookie management

Browser Control Architecture

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ Claude Code  │────▢│  Browser Use     │────▢│  Chromium     β”‚
β”‚  (Client)    │◀────│  MCP Server      │◀────│  Browser      β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                           β”‚                        β”‚
                    β”Œβ”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”          β”Œβ”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”
                    β”‚  Proxy Port β”‚          β”‚  Target     β”‚
                    β”‚   (9000)    β”‚          β”‚  Website    β”‚
                    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜          β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Configuration

ParameterTypeDefaultDescription
--portinteger8000Port for the MCP server HTTP interface
--stdioflagdisabledEnable stdio transport for Claude Code integration
--proxy-portinteger9000Port for the browser traffic inspection proxy
OPENAI_API_KEYstringrequiredAPI key for vision model used in element identification
headlessbooleantrueRun browser without visible window for CI environments

Best Practices

  1. Use Headless Mode in CI - Run with headless mode enabled in continuous integration pipelines to avoid display server dependencies. Switch to headed mode during development for visual debugging of automation steps.

  2. Set Reasonable Timeouts - Web pages load at varying speeds. Configure generous timeouts for navigation actions to prevent premature failures on slow-loading pages, especially those with heavy JavaScript frameworks.

  3. Isolate Browser Sessions - Create separate browser sessions for unrelated tasks to prevent cookie and state contamination. This ensures that login states and preferences from one workflow do not affect another.

  4. Capture Screenshots for Debugging - Take screenshots before and after critical interaction steps. When automation fails, these screenshots provide the visual context needed to understand what the AI agent actually encountered.

  5. Respect robots.txt and Rate Limits - Configure appropriate delays between requests to avoid overwhelming target servers. Automated browsing should comply with site policies and legal requirements for web scraping.

Common Issues

  1. Browser Fails to Launch - Ensure Chromium dependencies are installed on your system. On Linux, install chromium-browser and required shared libraries. On macOS, the bundled Chromium should work out of the box.

  2. Element Not Found Errors - Dynamic pages may not have finished rendering when the tool tries to interact. Add explicit waits or use the screenshot tool to verify page state before clicking or typing.

  3. OpenAI API Key Missing - The browser-use framework requires an OpenAI API key for visual element identification. Set OPENAI_API_KEY in your environment or pass it through the MCP server configuration.

Community

Reviews

Write a review

No reviews yet. Be the first to review this template!

Similar Templates