Stars
A curated list of practical Codex skills for automating workflows across the Codex CLI and API.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
[CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models
[NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Official inference framework for 1-bit LLMs
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
A curated list of awesome papers on dataset distillation and related applications.
A high-throughput and memory-efficient inference and serving engine for LLMs
[EMNLP 2025] LoSiA: Efficient High-Rank Fine-Tuning via Subnet Localization and Optimization (Oral)
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.
[ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.