[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability
-
Updated
Sep 20, 2025 - Python
[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability
MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.
MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.
An Go library of synchronization primitives to help make use of hardware transactional memory (HTM)
An easy-to-use evaluation tool for running Humanity's Last Exam on (locally) hosted Ollama instances.
Tiny experiments that evaluate how different social-cue prompts influence large language model (LLM) decision-making on HLE dataset multiple-choice questions.
This repository explores how social cues impact decision-making in large language models using a structured pipeline. Join us in analyzing various prompts and their effects on responses to multiple-choice questions! 🐙✨
Add a description, image, and links to the hle topic page so that developers can more easily learn about it.
To associate your repository with the hle topic, visit your repo's landing page and select "manage topics."