Skip to content
Change the repository type filter

All

    Repositories list

    • 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
      Python
      32k001Updated Sep 25, 2025Sep 25, 2025
    • Python
      0000Updated Sep 19, 2025Sep 19, 2025
    • Python
      0000Updated Sep 5, 2025Sep 5, 2025
    • Simple examples of how to run public benchmarks with Runloop
      Python
      1000Updated Aug 29, 2025Aug 29, 2025
    • codex

      Public
      Lightweight coding agent that runs in your terminal
      Rust
      6.9k000Updated Jul 20, 2025Jul 20, 2025
    • An open-source AI agent that brings the power of Gemini directly into your terminal.
      TypeScript
      10k000Updated Jul 20, 2025Jul 20, 2025
    • Python
      0200Updated Jul 13, 2025Jul 13, 2025
    • SWE-agent

      Public
      SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
      Python
      1.9k000Updated Jul 13, 2025Jul 13, 2025
    • litellm

      Public
      Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
      Python
      5.1k000Updated Jul 2, 2025Jul 2, 2025
    • Python
      0000Updated Jul 1, 2025Jul 1, 2025
    • diff-viz

      Public
      JavaScript
      0000Updated Jun 23, 2025Jun 23, 2025
    • OpenHands

      Public
      🙌 OpenHands: Code Less, Make More
      Python
      8.1k000Updated Jun 4, 2025Jun 4, 2025
    • Shell
      0001Updated May 14, 2025May 14, 2025
    • lcb_utils

      Public
      Python
      0000Updated May 8, 2025May 8, 2025
    • Python
      0000Updated Feb 24, 2025Feb 24, 2025
    • rl-fun

      Public
      0000Updated Feb 23, 2025Feb 23, 2025
    • SWE-bench

      Public
      [ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
      Python
      720000Updated Feb 22, 2025Feb 22, 2025
    • SWE-ReX

      Public
      Sandboxed code execution for AI agents, locally or on the cloud.
      Python
      92000Updated Feb 20, 2025Feb 20, 2025
    • debugger

      Public
      Python
      0000Updated Feb 7, 2025Feb 7, 2025
    • Python
      0000Updated Jan 31, 2025Jan 31, 2025
    • Python
      1000Updated Dec 4, 2024Dec 4, 2024
    • 0010Updated Dec 3, 2024Dec 3, 2024
    • puzzles

      Public
      Python
      0000Updated Nov 11, 2024Nov 11, 2024
    • search

      Public
      Rust
      0000Updated Nov 10, 2024Nov 10, 2024
    • bcb-modal

      Public
      run bigcodebench on modal
      Python
      0000Updated Oct 17, 2024Oct 17, 2024
    • LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
      Python
      54000Updated Oct 17, 2024Oct 17, 2024
    • EAI4Math

      Public
      Python
      2000Updated Oct 12, 2024Oct 12, 2024
    • Shell
      0000Updated Oct 8, 2024Oct 8, 2024
    • Python
      0000Updated Oct 2, 2024Oct 2, 2024
    • TeX
      0000Updated Sep 3, 2024Sep 3, 2024