Highlights
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
🎨 Local-first, open-source Claude Design alternative. 🖥️ Native desktop app. ⚡ 259+ Skills · ✨ 142+ Design Systems 🖼️ Web · desktop · mobile prototypes · slides · images · videos · HyperFrames 📦 Sa…
A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.
SWE-bench: Can Language Models Resolve Real-world Github Issues?
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
TokenSpeed is a speed-of-light LLM inference engine.
[ACL25] Code for paper "ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models"
Rethinking the Trust Region in LLM Reinforcement Learning
SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal
Reinforcement Learning via Self-Distillation (SDPO)
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
[Machine Learning] Code for paper "W2S: Weak-to-Strong Prompt Correction for Large Language Models"
System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
A curated list for Efficient Large Language Models
🏡 GitHub Pages template for personal academic homepage
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning".
Sky-T1: Train your own O1 preview model within $450