-
Princeton University
- Princeton, NJ
-
11:28
(UTC -05:00)
Stars
Flash-Muon: An Efficient Implementation of Muon Optimizer
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…
DSPy: The framework for programming—not prompting—language models
My learning notes/codes for ML SYS.
A live stream development of RL tunning for LLM agents
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang
This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals
A curated list for Efficient Large Language Models
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
High accuracy RAG for answering questions from scientific documents with citations
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
AI for Science 论文解读合集(持续更新ing),论文/数据集/教程下载:hyper.ai
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
TinyChatEngine: On-Device LLM Inference Library
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
Recent Transformer-based CV and related works.