Show HN: REPL is the memory layer for multi-agent AI apps – Sherlog‑MCP

4 months ago 3

A powerful Model Context Protocol (MCP) server that provides a persistent IPython workspace for data analysis, log processing, and multi-agent collaboration.

Sherlog MCP Server transforms Claude Desktop into a stateful data analysis powerhouse by providing:

Persistent IPython Shell: A living workspace where all operations persist across tool calls
DataFrame-Centric Architecture: Every operation returns DataFrames, creating a unified data model
Shared Blackboard: Perfect for multi-agent workflows where data needs to be passed between operations
MCP Proxy: Seamlessly integrates any external MCP server, executing all operations within the same IPython context

Think of it as giving Claude a persistent Python notebook that never forgets, where every piece of data is immediately available for the next operation.

🐍 Persistent IPython Workspace

Stateful Execution: Variables, imports, and results persist across all tool calls
Automatic Memory Management: Smart cleanup after 50 operations to prevent bloat
Session Preservation: Workspace saves on shutdown and restores on startup

Unified Data Model: All tools return pandas/polars DataFrames
Seamless Integration: Results from any tool become inputs for others
Smart Conversions: Automatically converts various formats to DataFrames

Dynamic Tool Integration: Connect any MCP server and use its tools within the IPython context
Unified Namespace: External tools' results become DataFrames in the shared workspace
Zero Configuration: Just add external MCPs to your environment

Log Analysis: Powered by Salesforce's LogAI - anomaly detection, clustering, parsing
Data Sources: S3, local files, GitHub, Grafana, and more
Processing Tools: Feature extraction, vectorization, preprocessing pipelines

Docker Desktop
Claude Desktop

Add to Claude Desktop configuration:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

Restart Claude Desktop

Configuration with Environment Variables

For a fully configured setup with external integrations:

{ "mcpServers": { "sherlog": { "command": "docker", "args": [ "run", "--rm", "-i", "--volume=/var/run/docker.sock:/var/run/docker.sock", "--mount=type=bind,src=/Users/username/,dst=/Users/username/,ro", "-e", "MCP_TRANSPORT=stdio", "-e", "GITHUB_TOKEN=your_github_token", "-e", "AWS_ACCESS_KEY_ID=your_aws_key", "-e", "AWS_SECRET_ACCESS_KEY=your_aws_secret", "-e", "EXTERNAL_MCPS_JSON={\"filesystem\":{\"command\":\"npx\",\"args\":[\"-y\",\"@modelcontextprotocol/server-filesystem\",\"/Users/username/data\"]}}", "ghcr.io/navneet-mkr/sherlog-mcp:latest" ] } } }

Important Notes:

Replace /Users/username/ with your actual home directory path
The mount path must match between source and destination for file access to work
Add read-only (,ro) to mounts for security unless write access is needed
Docker must be running before starting Claude Desktop

The Docker configuration includes two types of mounts:

Docker Socket (for Docker tools):

--volume=/var/run/docker.sock:/var/run/docker.sock

Allows the MCP server to manage Docker containers (if using Docker tools)
File System Access:

--mount=type=bind,src=/Users/username/,dst=/Users/username/,ro

Grants read-only access to your files. Adjust the path to limit access:
- For specific project: src=/Users/username/projects,dst=/Users/username/projects
- For data folder only: src=/Users/username/data,dst=/Users/username/data

Minimal Setup (no external integrations):

{ "mcpServers": { "sherlog": { "command": "docker", "args": [ "run", "--rm", "-i", "-e", "MCP_TRANSPORT=stdio", "ghcr.io/navneet-mkr/sherlog-mcp:latest" ] } } }

With File Access (for log analysis):

{ "mcpServers": { "sherlog": { "command": "docker", "args": [ "run", "--rm", "-i", "--mount=type=bind,src=/path/to/logs,dst=/data/logs,ro", "-e", "MCP_TRANSPORT=stdio", "ghcr.io/navneet-mkr/sherlog-mcp:latest" ] } } }

# Session Management export MCP_AUTO_RESET_THRESHOLD=50 # Operations before auto-cleanup export MCP_AUTO_RESET_ENABLED=true # Enable automatic memory management export MCP_MAX_OUTPUT_SIZE=50000 # Max output size per buffer (default: 50KB) # Logging export LOG_LEVEL=INFO

Configure API keys for built-in integrations:

# AWS export AWS_ACCESS_KEY_ID=your_key export AWS_SECRET_ACCESS_KEY=your_secret export AWS_REGION=us-east-1 # GitHub export GITHUB_TOKEN=your_token # Grafana export GRAFANA_URL=https://your-instance.grafana.net export GRAFANA_API_KEY=your_key

Connect any MCP server to execute within the IPython workspace:

export EXTERNAL_MCPS_JSON='{ "filesystem": { "command": "npx", "args": ["-y", "@modelcontextprotocol/server-filesystem", "/tmp"] }, "postgres": { "command": "npx", "args": ["-y", "@modelcontextprotocol/server-postgres"], "env": { "DATABASE_URL": "$DATABASE_URL" } } }'

Every tool execution happens within a persistent IPython shell. This means:

Variables Persist: Create df in one tool call, use it in the next
Imports Stay: Import once, use everywhere
State Accumulates: Build complex analyses step by step

DataFrame as Universal Currency

All tools follow a simple pattern:

Execute operation
Store result as DataFrame in IPython namespace
Return reference for next operation

This creates a powerful chain of operations where each step builds on the last.

In multi-agent scenarios, the IPython workspace acts as a shared blackboard:

Agent A loads data → stores as raw_data
Agent B processes it → creates processed_data
Agent C analyzes results → uses both previous DataFrames

Sherlog MCP provides a comprehensive set of native tools optimized for the IPython workspace, with the ability to extend functionality through external MCP servers.

Our native tools are designed to work seamlessly with the DataFrame-centric architecture:

execute_python_code: Run arbitrary Python code in the workspace
list_shell_variables: See all available DataFrames and variables
session_memory_status: Monitor memory usage and auto-reset status
reset_session_now: Manually trigger a session cleanup

Local Files: load_file_log_data, read_file, write_file
AWS S3: s3_list_files, s3_download_file, s3_upload_file
GitHub: github_fetch_issues, github_fetch_pull_requests, github_fetch_commits
Grafana: grafana_query_prometheus, grafana_query_loki

Log Analysis (Powered by LogAI)

detect_anomalies: Time-series and semantic anomaly detection
cluster_logs: Group similar log entries using various algorithms
extract_features: Generate ML features from log text
parse_logs: Extract structured data from unstructured logs
vectorize_logs: Convert logs to numerical representations

Docker: docker_list_containers, docker_logs, docker_exec
Kubernetes: k8s_get_pods, k8s_get_logs, k8s_describe_resource
Code Analysis: analyze_code_structure, search_codebase

While Sherlog MCP includes many tools natively, you can connect any MCP server to extend functionality. External tools are automatically integrated into the IPython workspace:

External tools are prefixed: external_[server]_[tool]
Results automatically convert to DataFrames
Full access to the same IPython namespace

Filesystem: Advanced file operations beyond our built-in tools
PostgreSQL/MySQL: Direct database queries
Weather: Real-time weather data
Slack: Send messages and read channels
Google Sheets: Spreadsheet operations

"-e", "EXTERNAL_MCPS_JSON={\"postgres\":{\"command\":\"npx\",\"args\":[\"-y\",\"@modelcontextprotocol/server-postgres\"],\"env\":{\"DATABASE_URL\":\"$DATABASE_URL\"}}}"

Why Native Tools Are Better

Native tools in Sherlog MCP offer advantages over external MCPs:

DataFrame Integration: Results are automatically structured as DataFrames
Session Awareness: Tools can access and modify the IPython namespace
Optimized Performance: No subprocess overhead
Unified Error Handling: Consistent error messages and recovery
Cross-Tool State: Results from one tool are immediately available to others

To see all available tools in your session:

Native tools: Check the list above
External tools: Use list_external_tools() to see connected MCP servers
In Claude: Ask "What tools do you have available?"

Claude Desktop ↓ Sherlog MCP Server (stdio) ↓ IPython Shell (persistent workspace) ├── Built-in Tools (return DataFrames) ├── External MCP Tools (via proxy) └── User Code (execute_python_code)

Session Memory Management

The server automatically manages memory to prevent bloat:

Monitors execution count
After 50 operations with DataFrames, triggers smart cleanup
Preserves imports and recent DataFrames
Configurable via environment variables

Working with External MCPs

External MCP tools integrate seamlessly:

Results automatically convert to DataFrames
Stored in IPython namespace with tool name
Available for subsequent operations

Example flow:

PostgreSQL MCP queries database → result stored as DataFrame
LogAI tools analyze the data → create new DataFrames
Custom Python code combines results → final analysis

If you want to build and run locally:

Clone the repository:

git clone https://github.com/navneet-mkr/sherlog-mcp.git cd sherlog-mcp

Build the Docker image:

Use your local image in Claude Desktop by replacing the image name:

"ghcr.io/navneet-mkr/sherlog-mcp:latest"

with:

"ghcr.io/navneet-mkr/sherlog-mcp:your-version"

docker run --rm -it \ -e LOG_LEVEL=DEBUG \ -e MCP_TRANSPORT=stdio \ ghcr.io/navneet-mkr/sherlog-mcp:latest

Apache License 2.0 - see LICENSE file for details.

Built on:

LogAI by Salesforce
Model Context Protocol by Anthropic
IPython for the persistent shell

Read Entire Article

Show HN: REPL is the memory layer for multi-agent AI apps – Sherlog‑MCP

🐍 Persistent IPython Workspace

Configuration with Environment Variables

DataFrame as Universal Currency

Log Analysis (Powered by LogAI)

Why Native Tools Are Better

Session Memory Management

Working with External MCPs

Related

Show HN: HLinq: easy to use and extensible .NET resource que...

Proxy. What Is It?

How to easily change panel on button click in Unity 2025 wit...