Lists (7)
Sort Name ascending (A-Z)
Stars
Code Repository of Evaluating Quantized Large Language Models
推荐算法实战(Recommend algorithm)
Classic papers and resources on recommendation
A Lighting Pytorch Framework for Recommendation Models, Easy-to-use and Easy-to-extend.
Autonomous Agents (LLMs) research papers. Updated Daily.
vLLM Documentation in Chinese Simplified / vLLM 中文文档
Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-level concepts to analyze unstructured text.
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
A smart, powerful, and beautiful excalidraw drawing tool.Draw Professional Charts with Natural Language
Project for ECE143, Data Visualization
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory
⚡A CLI tool for code structural search, lint and rewriting. Written in Rust
LLM model quantization (compression) toolkit with HW acceleration support for Nvidia, AMD, Intel GPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.
Awesome list for LLM quantization
[ACL 2025] Graph-guided agentic framework for code localization https://arxiv.org/abs/2503.09089
Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model smaller while preserving accuracy.
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
FlashInfer: Kernel Library for LLM Serving