Show HN: Mcp-chromautomation – Chrome MCP that is not a puppeteer

3 months ago 16

A comprehensive Model Context Protocol (MCP) service for Chrome browser automation with enhanced capabilities, built on the browserhttp library and using the mcp-go framework.

Enhanced MCP Server Capabilities (19 Tools)

Core Browser Automation:

Enhanced Navigation: Navigate with performance tracking and advanced waiting
Advanced Interaction: Click, type, select with improved reliability
Content Extraction: Text, links, images, forms with metadata
Session Management: Complete cookie and localStorage handling
Screenshot Capture: Automated screenshot with custom naming

New Analysis Tools:

Performance Monitoring: DOM load times, network requests, resource analysis
SEO Analysis: Title, description, keywords, heading structure
Security Scanning: SSL validation, CSP analysis, vulnerability detection
Content Intelligence: Comprehensive link categorization and form analysis

Advanced Features:

JSON API Integration: Send POST requests with browser context
Local Storage Management: Read/write browser localStorage
Multi-condition Waiting: Wait for elements, text, or navigation
Session Cleanup: Complete cookie and storage clearing

Interactive Menu: Navigate through options with keyboard
Real-time Feedback: See results immediately
Session Browser: View and manage saved sessions
Request Logs: Monitor all browser activity
Settings Panel: Database statistics and configuration

Enhanced Browser Automation

Built on your proven browserhttp library
Real Chrome browser via chromedp
JavaScript rendering and form submission
Screenshot capture and storage
Persistent tab management

mcp-chromautomation/ ├── main.go # Entry point with CLI commands ├── internal/ │ ├── server/ # MCP protocol implementation │ │ └── server.go # JSON-RPC server with tools │ ├── browser/ # Browser automation layer │ │ └── manager.go # Enhanced browserhttp integration │ ├── storage/ # Local data persistence │ │ └── manager.go # SQLite database management │ └── cli/ # Terminal user interface │ └── app.go # Bubble Tea application └── go.mod # Dependencies

Clone and setup:

git clone https://github.com/gleicon/mcp-chromautomation cd mcp-chromautomation
Install dependencies:
Build the application:

go build -o mcp-chromautomation
Setup Chrome for automation (preserves your existing sessions):

# Start Chrome with debugging enabled (keeps your sessions & cookies) ./start_chrome.sh

First, ensure Chrome is running with debugging enabled:

# This preserves your existing sessions and cookies ./start_chrome.sh

Then start the MCP server:

./mcp-chromautomation server

The service will connect to your existing Chrome instance, preserving all your:

🔐 Logged-in sessions
🍪 Cookies and authentication
📝 Form data and preferences
🔖 Bookmarks and extensions

The server provides 19 enhanced MCP tools:

Core Automation:

chrome_navigate - Enhanced navigation with performance tracking
chrome_click - Reliable element clicking with validation
chrome_extract_text - Advanced text extraction with selectors
chrome_fill_form - Smart form filling with validation
chrome_screenshot - Screenshot capture with custom naming
chrome_wait_for_element - Element waiting with timeout control

Content Analysis:

chrome_extract_links - Extract and categorize all page links
chrome_extract_images - Extract images with metadata
chrome_extract_forms - Analyze form structures and fields
chrome_analyze_seo - Comprehensive SEO analysis

Performance & Security:

chrome_get_performance - Detailed performance metrics
chrome_check_security - Security vulnerability scanning

Advanced Interaction:

chrome_post_json - Send JSON data via browser
chrome_wait_advanced - Multi-condition waiting

Session Management:

chrome_get_local_storage - Access browser localStorage
chrome_set_local_storage - Manage localStorage data
chrome_clear_session - Complete session cleanup
session_save - Enhanced session saving with full state
session_load - Complete session restoration

Launch the beautiful terminal interface:

Navigate with keyboard shortcuts:

↑/↓ or j/k - Move up/down
Enter - Select item
Esc - Go back
q - Quit

Use with any MCP client by configuring the server:

{ "mcpServers": { "chromautomation": { "command": "/path/to/mcp-chromautomation", "args": ["server"] } } }

Example tool calls:

// Navigate to a website await mcp.callTool("chrome_navigate", { url: "https://example.com", screenshot: true, wait_for: "#main-content" }); // Click an element await mcp.callTool("chrome_click", { selector: ".submit-button", screenshot: true }); // Extract text content const result = await mcp.callTool("chrome_extract_text", { selector: ".article-content p" });

JavaScript Integration & Security Model

The service uses legitimate browser automation through Chrome DevTools Protocol (CDP), not malicious code injection.

How JavaScript Execution Works

1. Chrome DevTools Protocol (CDP)

// Uses Chrome's official debugging protocol import "github.com/chromedp/chromedp"

2. Safe JavaScript Execution Types

DOM Queries (Read-only):

// Extract links using standard DOM API Array.from(document.querySelectorAll('a[href]')).map(a => a.href)

Performance Metrics (Browser APIs):

// Get performance data using standard Performance API (() => { const perf = performance.getEntriesByType('navigation')[0]; return { domContentLoaded: perf.domContentLoadedEventEnd - perf.domContentLoadedEventStart, loadComplete: perf.loadEventEnd - perf.loadEventStart }; })()

Session Management (Standard Web APIs):

// Cookie management via document.cookie document.cookie = 'name=value; path=/'; // localStorage access localStorage.getItem('key'); localStorage.setItem('key', 'value');

What it IS:

Legitimate browser automation using Chrome's official debugging protocol
Execution equivalent to manual browser DevTools console usage
Read-only operations for most functions (link extraction, text parsing)
Standard DOM/Web API usage only

What it's NOT:

Not XSS injection into target websites
Not malicious code execution
Not breaking website security policies
Not bypassing same-origin policies maliciously

Flow Overview:

1. Connect to existing Chrome via CDP 2. Navigate to page normally (like a user) 3. Wait for page to load completely 4. Execute JavaScript in browser context (like F12 console) 5. Extract data using standard DOM APIs 6. Return results safely

The service provides detailed performance monitoring:

// Real browser performance metrics const metrics = await mcp.callTool("chrome_get_performance", {}); // Returns: DOM load times, network requests, resource sizes

Advanced content extraction and analysis:

// Extract and categorize all links const links = await mcp.callTool("chrome_extract_links", {}); // Returns: internal, external, product, category links with counts // SEO analysis const seo = await mcp.callTool("chrome_analyze_seo", {}); // Returns: title, description, keywords, heading structure

Built-in security analysis tools:

// Comprehensive security check const security = await mcp.callTool("chrome_check_security", {}); // Returns: SSL status, CSP analysis, vulnerability scan results

Complete Website Analysis

// Comprehensive analysis workflow const client = new MCPChromeClient(); // Navigate with performance tracking const nav = await client.callTool('chrome_navigate', { url: 'https://example.com', track_performance: true, screenshot: true }); // Extract all content const links = await client.callTool('chrome_extract_links', {}); const seo = await client.callTool('chrome_analyze_seo', {}); const performance = await client.callTool('chrome_get_performance', {}); const security = await client.callTool('chrome_check_security', {}); console.log(`Found ${links.count} links`); console.log(`SEO score: ${seo.seo.title ? 'Has title' : 'Missing title'}`); console.log(`Load time: ${performance.performance.load_complete}ms`); console.log(`SSL valid: ${security.security.ssl.valid}`);

// Example: Analyze Magazine Luiza const result = await client.callTool('chrome_navigate', { url: 'https://magazineluiza.com.br' }); const links = await client.callTool('chrome_extract_links', {}); const linkData = JSON.parse(links.content[0].text); console.log('Link Analysis:'); console.log(`Total links: ${linkData.count}`); console.log(`Product links: ${linkData.links.filter(l => l.includes('/produto/') || l.includes('/p/') ).length}`); console.log(`Internal links: ${linkData.links.filter(l => l.includes('magazineluiza.com.br') ).length}`);

Form Analysis and Interaction

// Extract and analyze forms const forms = await client.callTool('chrome_extract_forms', {}); const formData = JSON.parse(forms.content[0].text); console.log(`Found ${formData.count} forms`); // Fill a form intelligently await client.callTool('chrome_fill_form', { fields: { '#email': '[email protected]', '#password': 'secure_password', '#remember': 'true' }, submit: true, validate_before_submit: true });

// Monitor page performance over time const sites = ['https://example.com', 'https://google.com']; const results = []; for (const site of sites) { await client.callTool('chrome_navigate', { url: site, track_performance: true }); const perf = await client.callTool('chrome_get_performance', {}); const perfData = JSON.parse(perf.content[0].text); results.push({ site, loadTime: perfData.performance.load_complete, domReady: perfData.performance.dom_content_loaded, requests: perfData.performance.network_requests }); } console.table(results);

The service stores data locally in ~/.mcp-chromautomation/:

Sessions: Browser state including cookies and URLs
Request Logs: Complete HTTP request/response history
Screenshots: Captured page screenshots with metadata
Performance Data: Page load metrics and analysis
Settings: User preferences and configuration

browserhttp - Your excellent browser automation library
mcp-go - Robust MCP protocol implementation
chromedp - Chrome DevTools Protocol
cobra - CLI framework
sqlite - Pure Go SQLite

bubbletea - TUI framework
bubbles - TUI components
lipgloss - Terminal styling

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

This project is licensed under the MIT License - see the LICENSE file for details.

Built on browserhttp library
Inspired by the Model Context Protocol specification
UI powered by the amazing Charm libraries

Read Entire Article

Show HN: Mcp-chromautomation – Chrome MCP that is not a puppeteer

Enhanced MCP Server Capabilities (19 Tools)

Enhanced Browser Automation

JavaScript Integration & Security Model

How JavaScript Execution Works

Complete Website Analysis

Form Analysis and Interaction

Related

Show HN: Shelly Systems – Process Modeling as Code

Show HN: Zingle – an AI code reviewer for data teams (SQL/db...

Eval Protocol: RL for agents in any language, container, or ...