songmzhang

Songming Zhang songmzhang

PhD student at Beijing Jiaotong University

15 followers · 2 following

Beijing Jiaotong University
Beijing
06:39 (UTC +08:00)
https://songmzhang.github.io/

Achievements

Stars

chrisliu298 / awesome-on-policy-distillation

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models

353 7 Updated Jun 16, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,082 6,557 Updated Jun 16, 2026

songmzhang / KDFlow

A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.

Python 199 15 Updated Jun 5, 2026

XZhang00 / LU-LAFNs

Python 5 Updated Sep 11, 2025

XZhang00 / QSTR

Code for EMNLP2023 paper "A Quality-based Syntactic Template Retriever for Syntactically-controlled Paraphrase Generation".

Python 4 Updated Mar 20, 2024

XZhang00 / CM-Align

Code for EMNLP-2025 (Findings) paper “CM-Align: Consistency-based Multilingual Alignment for Large Language Models”.

Python 3 Updated Sep 11, 2025

XZhang00 / LayerMoE

Shell 6 3 Updated Sep 10, 2025

XZhang00 / M-Thinker

Code for "Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning".

Python 27 Updated Nov 11, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 6,441 541 Updated Jun 16, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,153 897 Updated Jun 16, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,530 1,481 Updated Jun 16, 2026

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 16,724 4,088 Updated Jun 16, 2026

songmzhang / AlignDistil

Code for ACL 2025 Paper "AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation"

Python 3 Updated Aug 26, 2025

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,004 4,085 Updated Jun 16, 2026

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 11,832 889 Updated Apr 22, 2026

songmzhang / DSKDv2

The official implementation of the paper "A Dual-Space Framework for General Knowledge Distillation of Large Language Models".

Python 15 1 Updated Jan 4, 2026

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 1,036 153 Updated Jun 21, 2025

FunnySaltyFish / Better-Ruozhiba

【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集

257 11 Updated Feb 21, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,648 969 Updated Jun 9, 2026

unslothai / unsloth

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,655 5,978 Updated Jun 16, 2026

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 7,093 790 Updated Jun 15, 2026

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 12,979 3,345 Updated Jun 2, 2026

Nicolas-BZRD / llm-recipes

Python 33 7 Updated Mar 13, 2024

songmzhang / DSKD

Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same-tokenizer and cross-tokenizer LLM distillation.

Python 63 12 Updated Mar 21, 2026

DefangChen / Knowledge-Distillation-Paper

This resposity maintains a collection of important papers on knowledge distillation (awesome-knowledge-distillation)).

85 17 Updated Mar 19, 2025

meta-pytorch / torchtune

PyTorch native post-training library

Python 5,773 729 Updated Jun 16, 2026

HuangOwen / Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

1,846 128 Updated Feb 23, 2026

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,291 72 Updated Mar 9, 2025

jongwooko / distillm

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 267 29 Updated Mar 13, 2025

xszyou / Fay

fay是一个帮助数字人（2.5d、3d、移动、pc、网页）或大语言模型（openai兼容、deepseek）连通业务系统的agent框架。

Python 12,873 2,288 Updated May 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Songming Zhang songmzhang

Achievements

Achievements

Block or report songmzhang

Stars

chrisliu298 / awesome-on-policy-distillation

sgl-project / sglang

songmzhang / KDFlow

XZhang00 / LU-LAFNs

XZhang00 / QSTR

XZhang00 / CM-Align

XZhang00 / LayerMoE

XZhang00 / M-Thinker

linkedin / Liger-Kernel

THUDM / slime

modelscope / ms-swift

NVIDIA / Megatron-LM

songmzhang / AlignDistil

verl-project / verl

FlagOpen / FlagEmbedding

songmzhang / DSKDv2

lmarena / arena-hard-auto

FunnySaltyFish / Better-Ruozhiba

OpenRLHF / OpenRLHF

unslothai / unsloth

open-compass / opencompass

EleutherAI / lm-evaluation-harness

Nicolas-BZRD / llm-recipes

songmzhang / DSKD

DefangChen / Knowledge-Distillation-Paper

meta-pytorch / torchtune

HuangOwen / Awesome-LLM-Compression

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

jongwooko / distillm

xszyou / Fay