Expert Reasoning System

A high-performance reasoning system that evaluates complex questions using multiple expert LLM agents with different strengths and specializations.

Overview

This system implements a multi-expert approach to problem solving, leveraging the strengths of different large language models (LLMs) with specialized tools and strategic routing. The system:

Analyzes and classifies questions based on type and complexity
Routes questions to appropriate processing pipelines
Generates multiple expert responses using reasoning-optimized prompts
Executes code for computational problems when needed
Searches the web for knowledge-intensive questions
Evaluates all responses and selects the most accurate answer
Provides detailed analysis and metrics on system performance

Features

Multiple Expert Models: Leverages GPT-4o, Claude 3.7 Sonnet, Gemini 2.5 Pro, and other state-of-the-art models
Advanced Reasoning: Uses specialized reasoning techniques like chain-of-thought and high reasoning effort
Code Generation & Execution: Automatically generates and executes Python code for computational problems
Web Search Integration: Uses Gemini's search capabilities for knowledge-intensive questions
Strategic Router: Intelligently routes questions to the optimal processing pipeline
Comprehensive Evaluation: Verifies responses against known answers and selects the best one
Detailed Analytics: Provides performance metrics by question type, category, and expert

Architecture

The system follows a clean, modular architecture:

Agent Layer: Handles interactions with various LLM providers
Tools Layer: Provides specialized capabilities like search and code execution
Memory Layer: Manages storage and analysis of results
Routing Layer: Determines the optimal strategy for each question

Installation

Clone the repository
Install dependencies
Configure API keys Create a .env file in the root directory with your API keys:

Usage

Run the system on a dataset of questions:

python main.py --file test_hle.xlsx --sample 100

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
agent		agent
build/agent		build/agent
memory		memory
results/detailed		results/detailed
tools		tools
.env		.env
ARCHITECTURE.md		ARCHITECTURE.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
expert_reasoning.log		expert_reasoning.log
main.py		main.py
requirements.txt		requirements.txt
test_hle.xlsx		test_hle.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Expert Reasoning System

Overview

Features

Architecture

Installation

Usage

About

Uh oh!

Releases

Packages

Languages

License

daonet/HPA-HLE

Folders and files

Latest commit

History

Repository files navigation

Expert Reasoning System

Overview

Features

Architecture

Installation

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages