We present Enterprise Deep Research (EDR), a multi-agent system that integrate:
- Master Planning Agent for adaptive query decomposition.
- Four specialized search agents (General, Academic, GitHub, LinkedIn).
- Extensible MCP-based tool ecosystem supporting NL2SQL, file analysis, and enterprise workflows.
- Visualization Agent for data-driven insights.
- Reflection mechanism that detects knowledge gaps and updates research direction with optional human-in-the-loop steering guidance.
- Real-time steering commands for continuous research refinement.
Note
These components enable automated report generation, real-time streaming, and seamless enterprise deployment, as validated on internal datasets.
We present two video demos of using EDR in web application and in Slack workspace.
Note
Multi-provider LLM support • Slack agent • Real-time streaming • Document analysis • Citation management • Parallel processing • Specialized benchmarking • Human-in-the-loop steering
Requirements: Python 3.11+ • Node.js 20.9.0+
Required Variables:
- TAVILY_API_KEY - Tavily search API key
- One LLM provider key:
- OPENAI_API_KEY - OpenAI API key
- ANTHROPIC_API_KEY - Anthropic API key
- GROQ_API_KEY - Groq API key
- GOOGLE_CLOUD_PROJECT - Google Cloud project ID
- SAMBNOVA_API_KEY - SambaNova API key
Optional Settings:
- LLM_PROVIDER - Default provider (default: openai)
- LLM_MODEL - Model name (provider-specific defaults)
- MAX_WEB_RESEARCH_LOOPS - Max iterations (default: 10)
| OpenAI | o4-mini | o4-mini, o4-mini-high, o3-mini, o3-mini-reasoning, gpt-4o |
| Anthropic | claude-sonnet-4 | claude-sonnet-4, claude-sonnet-4-thinking, claude-3-7-sonnet, claude-3-7-sonnet-thinking |
| gemini-2.5-pro | gemini-2.5-pro, gemini-1.5-pro-latest, gemini-1.5-flash-latest | |
| Groq | deepseek-r1-distill-llama-70b | deepseek-r1-distill-llama-70b, llama-3.3-70b-versatile, llama3-70b-8192 |
| SambaNova | DeepSeek-V3-0324 | DeepSeek-V3-0324 |
Full Stack (Recommended) - Single Command:
The application will serve both the backend API and pre-built frontend at http://localhost:8000
Backend API Documentation: http://localhost:8000/docs
Navigate to http://localhost:8000 for interactive research with real-time progress tracking.
- DeepResearchBench: Comprehensive research evaluation
- ResearchQA: Question-answering with citation verification
- DeepConsult: Consulting-style analysis tasks
The EDR-200 dataset contains 201 complete agentic research trajectories generated by Enterprise Deep Research—99 queries from DeepResearch Bench and 102 queries from DeepConsult. Unlike prior benchmarks that only capture final outputs, these trajectories expose the full reasoning process across search, reflection, and synthesis steps, enabling fine-grained analysis of agentic planning and decision-making dynamics.
Refer to our detailed benchmarking guide.
Licensed under Apache 2.0.
Acknowledgments: Built on LangGraph, Tavily, React, Tailwind CSS, and FastAPI.
.png)




