Skip to content
View xwqtju's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report xwqtju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
AGS Script 1 Updated Jun 10, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,391 1,053 Updated Jun 4, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,820 1,061 Updated Jun 18, 2026

Claude Code 中文全面上手指南。基于 luongnv89/claude-howto 本土化重写,面向中国小白用户,保留命令与配置兼容性,并附学习路径与本地化校验护栏。

Python 1,925 292 Updated Jun 18, 2026

DeepSeek-V4 Lecture

Python 19 1 Updated Jun 5, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,635 235 Updated May 30, 2026

Ascend PyTorch adapter (torch_npu). Mirror of https://gitcode.com/Ascend/pytorch

Python 537 76 Updated Jun 18, 2026

Fast and memory-efficient exact attention

Python 24,181 2,840 Updated Jun 18, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,908 2,479 Updated Jun 18, 2026

Repo for Qwen Image Finetune

Jupyter Notebook 1 Updated Dec 12, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,557 1,483 Updated Jun 18, 2026

Train transformer language models with reinforcement learning.

Python 18,667 2,792 Updated Jun 18, 2026

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 21,282 2,337 Updated Jun 18, 2026

Enjoy the magic of Diffusion models!

Python 12,595 1,232 Updated Jun 18, 2026

Repo for Qwen Image Finetune

Jupyter Notebook 240 26 Updated Jun 8, 2026

[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models

Python 188 14 Updated Jan 1, 2025

Officiel code for PATCH: Learnable Tile-level Hybrid Sparsity for LLMs

Python 6 Updated Jun 15, 2026

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 1,005 95 Updated Feb 25, 2026

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 4,538 429 Updated Jun 14, 2026

[ICML 2025] Official PyTorch implementation of "FlatQuant: Flatness Matters for LLM Quantization"

Python 218 32 Updated Nov 25, 2025

Fast and memory-efficient exact attention

Python 33 1 Updated Dec 2, 2024

LLM Finetuning with peft

Jupyter Notebook 2,929 764 Updated Aug 1, 2025

Awesome list for LLM quantization

Python 428 26 Updated Apr 20, 2026

Awesome LLM compression research papers and tools.

1,846 128 Updated Feb 23, 2026

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,339 406 Updated Apr 20, 2026

Summarize existing representative LLMs text datasets.

1,473 150 Updated Mar 11, 2026

A curated list of awesome open-source libraries for production LLM

523 74 Updated Dec 31, 2024

A curated list for Efficient Large Language Models

Python 2,019 165 Updated Jun 17, 2025

Windows 和 Office 激活工具 MAS (Microsoft-Activation-Scripts) 的汉化版

Batchfile 1,103 106 Updated Jun 2, 2026
Next