Skip to content
View lxww302's full-sized avatar

Block or report lxww302

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 18,130 3,075 Updated Apr 16, 2026

算法竞赛模板库 by 灵茶山艾府 💭💡🎈

Go 8,359 799 Updated Apr 27, 2026

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 951 119 Updated Apr 14, 2026
Python 359 20 Updated Jul 29, 2025

Kernels, of the mega variety :)

Python 715 56 Updated Apr 28, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 64,134 8,403 Updated Apr 28, 2026

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

Python 73 5 Updated Apr 2, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,221 714 Updated Apr 28, 2026

Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

260 10 Updated Mar 7, 2026

Code and example data for the paper: Rule Based Rewards for Language Model Safety

Jupyter Notebook 208 22 Updated Jul 19, 2024

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Jupyter Notebook 265 30 Updated May 14, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,602 409 Updated Nov 13, 2025

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,580 370 Updated Apr 28, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,087 1,277 Updated Apr 27, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,126 947 Updated Apr 24, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,458 547 Updated Apr 28, 2026

s1: Simple test-time scaling

Python 6,647 761 Updated Jun 25, 2025

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,671 221 Updated Apr 27, 2026
Python 210 27 Updated May 5, 2025

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,976 307 Updated Aug 9, 2025

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,204 75 Updated Apr 18, 2026

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 904 102 Updated Mar 18, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,167 493 Updated Apr 28, 2026

High-speed Large Language Model Serving for Local Deployment

C++ 9,393 566 Updated Jan 24, 2026

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

2,013 84 Updated Apr 15, 2026
Jupyter Notebook 132 15 Updated Nov 11, 2024

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,508 181 Updated Mar 28, 2025

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,304 273 Updated Feb 20, 2026

Instructions on how to use the Realtime API on Microcontrollers and Embedded Platforms

1,581 203 Updated Mar 25, 2025

Aidan Bench attempts to measure <big_model_smell> in LLMs.

Python 318 14 Updated Jun 26, 2025
Next