langchain-ai/open_deep_researchView GitHub Homepage for Latest Official Releases

An open-source deep research assistant that uses AI agents to automatically conduct in-depth research and generate comprehensive reports.

MITPythonopen_deep_researchlangchain-ai 7.2k Last Updated: August 06, 2025

Open Deep Research Project Details

Project Overview

Open Deep Research, developed by LangChain AI, is an experimental, fully open-source deep research assistant capable of automating in-depth research and generating comprehensive reports on any topic. The core goal of this project is to provide a simple, configurable open-source solution that competes with deep research products from companies like OpenAI, Anthropic, Perplexity, and Google.

Core Features

🎯 Dual Architecture Design

The project offers two distinct implementations, each with its own advantages:

Workflow Architecture
- Structured plan-execute workflow
- Human-in-the-loop interaction, allowing user feedback and approval of the report plan
- Section-by-section creation, with reflection between search iterations
- Supports all search tool providers
Multi-Agent Architecture
- Supervisor-researcher architecture
- Multiple independent agents working in parallel
- Significantly reduces report generation time
- Specialized tool design

🔧 Highly Configurable

Open Deep Research provides extensive configuration options to customize the research process and model behavior. All configurations can be set via the Web UI, environment variables, or direct configuration modification:

Max Structured Output Retries (Default: 3): Maximum number of retries for structured output calls
Allow Clarification (Default: true): Whether to allow the researcher to ask clarifying questions before starting research
Max Concurrent Research Units (Default: 5): Maximum number of research units to run concurrently using sub-agents
Search API (Default: Tavily): Supports multiple search API choices
Max Researcher Iterations (Default: 3): Number of times the research supervisor reflects on the research and poses follow-up questions

🤖 Multi-Model Support

Open Deep Research uses multiple specialized models for different research tasks:

Summary Model (Default: openai:gpt-4.1-nano): Summarizes research findings from search APIs
Research Model (Default: openai:gpt-4.1): Conducts research and analysis
Compression Model (Default: openai:gpt-4.1-mini): Compresses research findings from sub-agents
Final Report Model (Default: openai:gpt-4.1): Writes the final comprehensive report

All models are configured using the init_chat_model() API, supporting providers such as OpenAI, Anthropic, Google Vertex AI, and more.

🔍 Rich Search Tool Integration

The project supports various search APIs:

Tavily API - General web search
Perplexity API - General web search
Exa API - Powerful neural search
ArXiv - Academic papers in physics, mathematics, computer science, etc.
PubMed - Biomedical literature
Linkup API - General web search
DuckDuckGo API - General web search
Google Search API - Custom search engine

Technical Architecture

Built on LangGraph

Open Deep Research is built on LangGraph, a framework for building resilient language agent graphs. This architecture enables the project to:

Flexibly apply different research strategies
Use intermediate results to guide exploration
Handle open-ended research tasks

Three-Step Research Process

Open Deep Research uses agents to conduct research as part of a three-step process:

Scoping
- User Clarification: Gathers all user context required for the research
- Brief Generation: Converts detailed chat interactions into a comprehensive, focused research brief
Research
- Conducts research using a supervisor agent
- Delegates research tasks to an appropriate number of sub-agents
- Determines if the research brief can be broken down into independent sub-topics
Report Generation
- Writes the report in a single pass
- Avoids coordination difficulties of multiple agents writing in parallel

Installation and Usage

Quick Start

# Clone the repository
git clone https://github.com/langchain-ai/open_deep_research.git
cd open_deep_research

# Set environment variables
cp .env.example .env

# Install and start the LangGraph server
uvx --refresh --from "langgraph-cli[inmem]" --with-editable . --python 3.11 langgraph dev --allow-blocking

Install as a Python Package

pip install open-deep-research

Usage

Via Studio UI:
- 🚀 API: http://127.0.0.1:2024
- 🎨 Studio UI: https://smith.langchain.com/studio/?baseUrl=http://127.0.0.1:2024
- 📚 API Docs: http://127.0.0.1:2024/docs
Jupyter Notebook:
- View examples: src/open_deep_research/graph.ipynb
- Multi-agent example: src/open_deep_research/multi_agent.ipynb

Platform Integration

Open Agent Platform (OAP)

The Open Agent Platform (OAP) is a user interface where non-technical users can build and configure their own agents. OAP is ideal for allowing users to configure the deep researcher, using different MCP tools and search APIs best suited for their needs.

Performance and Evaluation

Benchmarking Results

Based on the results, Open Deep Research consistently and significantly improves answer quality on base models, demonstrated across Llama, DeepSeek, and Qwen flagship models, showcasing the quality of the workflow.

Testing Capabilities

# Test with default Anthropic models
python tests/run_test.py --all

# Test with OpenAI o3 models
python tests/run_test.py --all \
  --supervisor-model "openai:o3" \
  --researcher-model "openai:o3" \
  --planner-provider "openai" \
  --planner-model "o3" \
  --writer-provider "openai" \
  --writer-model "o3" \
  --eval-model "openai:o3" \
  --search-api "tavily"

Deployment Options

Local Deployment: Run locally via the LangGraph server
Cloud Deployment: Easily deployable to the LangGraph Platform

Project Ecosystem

Related Projects

local-deep-researcher: Fully local web research and report writing assistant
LangGraph: Framework for building resilient language agent graphs
LangChain: For building context-aware reasoning applications

Community and Open Source

Since the release of OpenAI, the open-source community has made remarkable efforts in developing deep research alternatives. This project aims to provide the community with an accessible open-source deep research toolkit.

Technical Requirements

Model Compatibility

Structured Output Support: All models must support structured output
Tool Calling Support: Agent models need to support tool calling
Search API Compatibility: Research and compression models must support the selected search API

Tested Models

Claude 3.7
o3 and o3-mini
gpt-4.1
llama-3.3-70b-versatile (via Groq)

Advantages and Features

Design Principles

A key design principle of Open Deep Research is flexibility, enabling the exploration of different research strategies based on the request. Agents are well-suited for research because they can flexibly apply different strategies and use intermediate results to guide exploration.

Core Advantages

Open Source Transparency: Fully open source, allowing community review and improvement
Highly Configurable: Supports custom models, search tools, and report structures
Multi-Architecture Support: Offers both Workflow and Multi-Agent implementations
Broad Compatibility: Supports multiple AI model providers and search APIs
Human-in-the-Loop: Supports user feedback and report plan approval
Specialized Design: Uses specialized models for different tasks

Future Development

The project is continuously evolving, with plans to incorporate more features and improvements:

Expand multi-agent architecture support for more search tools
Improve report generation quality
Enhance user interaction experience
Optimize performance and resource utilization

Conclusion

Open Deep Research represents a significant advancement in open-source AI research tools. It not only provides functionalities that compete with commercial products but also maintains the transparency and customizability inherent in open-source projects. Whether for researchers, developers, or enterprise users, this powerful research assistant can be customized and deployed to meet their specific needs.