Lists (1)
Sort Name ascending (A-Z)
Stars
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
nanoRLHF: from-scratch journey into how LLMs and RLHF really work.
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.
AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.
slime is an LLM post-training framework for RL Scaling.
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
Deep Agents is an agent harness built on langchain and langgraph. Deep Agents are equipped with a planning tool, a filesystem backend, and the ability to spawn subagents - making them well-equipped…
verl: Volcano Engine Reinforcement Learning for LLMs
Large-scale language modeling tutorials with PyTorch
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
[ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Demo of a customer service use case implemented with the OpenAI Agents SDK
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
Official Repository of Absolute Zero Reasoner
Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers
LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
A live stream development of RL tunning for LLM agents