Odysseusq

Shengling Qin Odysseusq

B.E., Dept. of EE, Tsinghua Univ.

1 follower · 0 following

Dept. of EEE, HKU
Hong Kong

Achievements

Highlights

Stars

Odysseusq / nano-megatron

Nano Megatron

Python 1 Updated Feb 6, 2026

October2001 / Awesome-KV-Cache-Compression

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

714 26 Updated Apr 15, 2026

Odysseusq / VLCache

Official Repo for paper "VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference"

Python 14 1 Updated Mar 28, 2026

thu-nics / R2R

[NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"

Python 90 13 Updated Apr 7, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 14,022 2,212 Updated Apr 26, 2026

vllm-project / compressed-tensors

A safetensors extension to efficiently store sparse quantized tensors on disk

Python 292 93 Updated Jun 12, 2026

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,320 201 Updated Mar 27, 2024

MLSys-Learner-Resources / Awesome-MLSys-Blogger

The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)

HTML 341 9 Updated Jan 5, 2025

deepseek-ai / EPLB

Expert Parallelism Load Balancer

Python 1,388 203 Updated Mar 24, 2025

bytedance / HLLM

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

Python 623 82 Updated Aug 26, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 26,311 2,439 Updated Apr 2, 2026

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,561 317 Updated Jul 17, 2025

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,156 8,827 Updated Jun 13, 2026

LlamaChinese / Llama-Chinese

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

Python 14,715 1,301 Updated Apr 6, 2025

zai-org / ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,568 1,805 Updated Jun 27, 2024

DLLXW / baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,921 356 Updated May 21, 2024

google-research / google-research

Google Research

Jupyter Notebook 38,130 8,429 Updated Jun 12, 2026

Chumsy0725 / logit-adj-pytorch

PyTorch implementation of the paper: Long-tail Learning via Logit Adjustment

Python 117 11 Updated Sep 7, 2021

alibaba / EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,178 257 Updated Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shengling Qin Odysseusq

Achievements

Achievements

Highlights

Block or report Odysseusq

Stars

Odysseusq / nano-megatron

October2001 / Awesome-KV-Cache-Compression

Odysseusq / VLCache

thu-nics / R2R

GeeeekExplorer / nano-vllm

vllm-project / compressed-tensors

IST-DASLab / gptq

MLSys-Learner-Resources / Awesome-MLSys-Blogger

deepseek-ai / EPLB

bytedance / HLLM

huggingface / open-r1

mit-han-lab / llm-awq

hiyouga / LlamaFactory

LlamaChinese / Llama-Chinese

zai-org / ChatGLM2-6B

DLLXW / baby-llama2-chinese

google-research / google-research

Chumsy0725 / logit-adj-pytorch

alibaba / EasyNLP