markwwen

🐍

Happy new year~

Weihuang Wen markwwen

🐍

Happy new year~

A third-year phd candidate in CUHKSZ.

111 followers · 458 following

The Chinese University of Hong Kong, Shenzhen
China
15:26 (UTC +08:00)
https://markwwen.github.io

Achievements

Highlights

Organizations

Lists (1)

Sort

✨ Inspiration

Stars

chatopera / efaqa-corpus-zh

❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库

Python 730 88 Updated May 24, 2025

qiuhuachuan / smile

[EMNLP 2024] 中文领域心理健康对话大模型MeChat

Python 504 56 Updated Nov 17, 2024

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,261 350 Updated Dec 19, 2025

luoxuan-cs / Direct-Multitoken-Decoding

The official repo for the paper Direct Multi-token Decoding

Python 3 Updated Oct 17, 2025

NVIDIA / Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,692 218 Updated Dec 20, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,087 333 Updated Dec 20, 2025

zju-jiyicheng / SpecVLM

[EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning

Python 30 1 Updated Dec 2, 2025

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,800 74 Updated Jun 5, 2025

HArmonizedSS / HASS

Forked from SafeAILab/EAGLE

Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)

Python 52 7 Updated Mar 14, 2025

hemingkx / SpecDec

Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)

Python 46 1 Updated Dec 9, 2023

wyf3 / llm_related

复现大模型相关算法及一些学习记录

Python 2,708 370 Updated Dec 15, 2025

meta-pytorch / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,167 568 Updated Aug 22, 2025

chenhongyu2048 / LLM-inference-optimization-paper

Summary of some awesome work for optimizing LLM inference

150 5 Updated Nov 30, 2025

hao-ai-lab / LookaheadReasoning

[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning

Python 56 6 Updated Oct 31, 2025

hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,311 78 Updated Mar 6, 2025

Mimosa-Lin / SpecForge

Forked from sgl-project/SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 1 Updated Aug 5, 2025