Skip to content
Change the repository type filter

All

    Repositories list

    • repo

      Public
      Python
      0000Updated Dec 17, 2025Dec 17, 2025
    • IASC

      Public
      LLMs for Constructed Languages
      HTML
      33610Updated Dec 16, 2025Dec 16, 2025
    • ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution
      Python
      12474165Updated Dec 14, 2025Dec 14, 2025
    • Kamon

      Public
      Data and code for understanding and generation of Kamon.
      Python
      43100Updated Dec 14, 2025Dec 14, 2025
    • Sudoku-Bench

      Public
      An AI benchmark for creative, human-like problem solving using Sudoku variants
      JavaScript
      1414310Updated Dec 13, 2025Dec 13, 2025
    • ALE-Bench

      Public
      The official repository of ALE-Bench
      Python
      1613920Updated Dec 13, 2025Dec 13, 2025
    • treequest

      Public
      A Tree Search Library with Flexible API for LLM Inference-Time Scaling
      Python
      6550310Updated Dec 9, 2025Dec 9, 2025
    • Continuous Thought Machines, because thought takes time and reasoning is a process.
      Python
      2481.6k03Updated Dec 9, 2025Dec 9, 2025
    • Browser-based chat UI for TinySwallow-1.5B that runs without API calls.
      CSS
      812800Updated Dec 1, 2025Dec 1, 2025
    • Python
      97130Updated Nov 22, 2025Nov 22, 2025
    • Neuroevolution Community
      2600Updated Nov 17, 2025Nov 17, 2025
    • MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
      Python
      189000Updated Nov 12, 2025Nov 12, 2025
    • Python
      55101Updated Nov 6, 2025Nov 6, 2025
    • Python
      0400Updated Oct 31, 2025Oct 31, 2025
    • The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
      Python
      3621.9k386Updated Oct 24, 2025Oct 24, 2025
    • asal

      Public
      Automating the Search for Artificial Life with Foundation Models!
      Jupyter Notebook
      5244710Updated Oct 23, 2025Oct 23, 2025
    • shachi

      Public
      Reimagining Agent-based Modeling with Large Language Model Agents via Shachi
      Python
      01100Updated Oct 10, 2025Oct 10, 2025
    • TAID

      Public
      Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
      Python
      911930Updated Oct 6, 2025Oct 6, 2025
    • The code repository of the paper: Competition and Attraction Improve Model Fusion
      Jupyter Notebook
      3316810Updated Aug 25, 2025Aug 25, 2025
    • BALROG

      Public
      Benchmarking Agentic LLM and VLM Reasoning On Games
      Python
      41100Updated Aug 19, 2025Aug 19, 2025
    • Evaluating the performance of LLMs on Japanese challenging financial tasks.
      Python
      32700Updated Jul 28, 2025Jul 28, 2025
    • Reasoning-based Evaluation and Ranking of Translations.
      Python
      41810Updated Jul 18, 2025Jul 18, 2025
    • Python
      1810510Updated Jun 30, 2025Jun 30, 2025
    • RLT

      Public
      Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
      Python
      5335430Updated Jun 23, 2025Jun 23, 2025
    • edinet2dataset is a tool to construct financial dataset using EDINET.
      Python
      72600Updated Jun 11, 2025Jun 11, 2025
    • Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
      Python
      6492720Updated Jun 8, 2025Jun 8, 2025
    • L2D

      Public
      Large language models to diffusion finetuning code
      Python
      32100Updated Jun 2, 2025Jun 2, 2025
    • The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
      Jupyter Notebook
      1.7k12k8921Updated Apr 26, 2025Apr 26, 2025
    • Python
      2027600Updated Apr 18, 2025Apr 18, 2025
    • CycleQD

      Public
      CycleQD is a framework for parameter space model merging.
      Python
      64500Updated Feb 1, 2025Feb 1, 2025