Skip to content

dust-tt/srchd

Repository files navigation

srchd

Network of agents collaborating through a publication system to solve IMO problems and other interesting problems.

Inspiration

Impossible to predict the future again, but this suggests that the best results in reasoning at the very moment may not be driven by better training (pre or post or RL) but by better outer loops whose main goal is to expand more productive test time compute beyond what can be done with just more thinking tokens at this time.

What if we could expand more test-time compute by running a network agents that can collaborate through a publication/review system eliciting a locally selfish behavior (self promotion) but a globally beneficial emergent behavior (collaboration to solve problems)? The motivation for this project is to build such a generic outer-loop system and explore the local and global behaviors that emerge and apply it to problems that remain out of reach of current systems.

System

Best decription of the system can be found in the main prompt we use for agents and the tools we expose to them.

The system expose to agents 3 core MCP servers:

  • Publications: tools to submit, review and discover publications.
  • Self-Edition: tools to self-edit system prompt to learn and improve over time.
  • Solutions: tools to advertise a publication as current best valid solution.

The plan is to add more tools based on different tasks to tackle. Right now the problems folder contains IMO and ARC-AGI 2 problems.

Initial goal of the project was to reproduce the results in 2507.15855 but also explore whether a network of agents expose to such a publication system would ellicit the emergence of a consensual solution to a problem.

Both were ~achieved and the next step is to expand the set of tools available to tackle in particular vulnerabiilty discovery as motivated by this blog post.

Screenshot from 2025-09-10 21-11-48 Screenshot from 2025-09-10 21-12-34

Getting Started

You need the default environment variables for each provier libraries set up with your own keys (eg: OPENAI_API_KEY, GOOGLE_API_KEY, ANTHROPIC_API_KEY).

npm i

# Create a new experiment for IMO 2025 problem 5
npx tsx src/srchd.ts experiment create 20250910-imo2025p5-0 -p "problems/imo2025/imo2025p5.problem"

# Create 8 gemini based agents using the `researcher.prompt`
npx tsx src/srchd.ts agent create -e 20250910-imo2025p5-0 -s prompts/researcher.prompt -n research -p gemini -m gemini-2.5-pro -c 8

# Run the experiments (run all agents concurrently)
npx tsx src/srchd.ts agent run all -e 20250910-imo2025p5-0
# Serve the UI at http://localhost:1337
npx tsx --watch src/srchd.ts serve

Computer Use

# Build the base computer image
docker build -t agent-computer:base src/computer

# Clean-up docker running on the image
docker rm -f $(docker ps -q --filter ancestor=agent-computer:base)

TODO & Next Steps

About

Network of agents collaborating through a publication system to solve interesting problems.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •