Stars
Build your own AI SRE agents. The open source toolkit for the AI era ✨
The agent that grows with you
⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
🎃 A fast, out-of-the-box terminal built for AI coding.
A 15TB Collection of Physics Simulation Datasets
Adaptive Test-time Learning and Autonomous Specialization
🥷 Engineering habits you already know, turned into skills Claude can run.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Agent harness to publish your history from Claude Code et al. as Huggingface datasets.
Claude Code plugin that generates individualized knowledge systems from conversation. You describe how you think and work, have a conversation and get a complete second brain as markdown files you …
Master programming by recreating your favorite technologies from scratch.
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.
The official repo of the paper "StressTest: Can YOUR Speech LM Handle the Stress?"
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"
🔊 Text-Prompted Generative Audio Model
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
WavJourney: Compositional Audio Creation with LLMs
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
Official PyTorch implementation of LaMI: Augmenting Large Language Models via Late Multi-Image Fusion (ACL 2026)
The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"