lxww302

Follow

Wei Wu lxww302

Follow

7 followers · 104 following

TikTok
San Jose
https://scholar.google.com/citations?user=gA1_ulAAAAAJ&hl=zh-CN

Stars

google-gemini / gemini-fullstack-langgraph-quickstart

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 18,130 3,075 Updated Apr 16, 2026

EndlessCheng / codeforces-go

算法竞赛模板库 by 灵茶山艾府 💭💡🎈

Go 8,359 799 Updated Apr 27, 2026

NVlabs / Fast-dLLM

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 951 119 Updated Apr 14, 2026

ruixin31 / Spurious_Rewards

Python 359 20 Updated Jul 29, 2025

HazyResearch / Megakernels

Kernels, of the mega variety :)

Python 715 56 Updated Apr 28, 2026

bytedance / deer-flow

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 64,134 8,403 Updated Apr 28, 2026

holarissun / RewardModelingBeyondBradleyTerry

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

Python 73 5 Updated Apr 2, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,221 714 Updated Apr 28, 2026

Hongcheng-Gao / Awesome-Long2short-on-LRMs

Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

260 10 Updated Mar 7, 2026

openai / safety-rbr-code-and-data

Code and example data for the paper: Rule Based Rewards for Language Model Safety

Jupyter Notebook 208 22 Updated Jul 19, 2024

cmu-l3 / l1

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Jupyter Notebook 265 30 Updated May 14, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,602 409 Updated Nov 13, 2025

NVIDIA / Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,580 370 Updated Apr 28, 2026

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,087 1,277 Updated Apr 27, 2026

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,126 947 Updated Apr 24, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,458 547 Updated Apr 28, 2026

simplescaling / s1

s1: Simple test-time scaling

Python 6,647 761 Updated Jun 25, 2025

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,671 221 Updated Apr 27, 2026

neuralmagic / AutoFP8

Python 210 27 Updated May 5, 2025

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,976 307 Updated Aug 9, 2025

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,204 75 Updated Apr 18, 2026

zhihu / ZhiLight

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 904 102 Updated Mar 18, 2026

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,167 493 Updated Apr 28, 2026

Tiiny-AI / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 9,393 566 Updated Jan 24, 2026

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

2,013 84 Updated Apr 15, 2026

LoongServe / LoongServe

Jupyter Notebook 132 15 Updated Nov 11, 2024

VITA-MLLM / VITA

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,508 181 Updated Mar 28, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,304 273 Updated Feb 20, 2026

openai / openai-realtime-embedded

Instructions on how to use the Realtime API on Microcontrollers and Embedded Platforms

1,581 203 Updated Mar 25, 2025

aidanmclaughlin / AidanBench

Aidan Bench attempts to measure <big_model_smell> in LLMs.

Python 318 14 Updated Jun 26, 2025