Skip to content
View cyst219's full-sized avatar

Block or report cyst219

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 69,761 13,279 Updated Feb 7, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67,021 8,145 Updated Feb 4, 2026

所有小初高、大学PDF教材。

Roff 64,905 14,478 Updated Oct 18, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,561 4,708 Updated Feb 7, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,511 1,875 Updated Jan 9, 2026

Fast and memory-efficient exact attention

Python 22,137 2,356 Updated Feb 8, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,051 3,205 Updated Feb 6, 2026

A book for Learning the Foundations of LLMs

15,636 1,476 Updated Dec 12, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,584 1,195 Updated Feb 7, 2026

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,644 736 Updated Feb 6, 2026

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 4,728 687 Updated Jan 27, 2026

Sky-T1: Train your own O1 preview model within $450

Python 3,370 345 Updated Jul 12, 2025

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 3,124 435 Updated Jan 17, 2025

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,380 275 Updated Feb 7, 2026
Jupyter Notebook 1,295 160 Updated Jan 4, 2026
Python 1,088 51 Updated Jan 10, 2026

中文法律LLaMA (LLaMA for Chinese legel domain)

Python 979 130 Updated Aug 28, 2024

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 520 54 Updated Feb 6, 2026

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 413 51 Updated Oct 4, 2025

PyContinual (An Easy and Extendible Framework for Continual Learning)

Python 324 69 Updated Jan 29, 2024

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 224 11 Updated Dec 16, 2025
Jupyter Notebook 79 6 Updated Jul 24, 2025

[ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"

Jupyter Notebook 59 4 Updated May 9, 2025

Continual Learning for Transformers that allows training on multiple tasks sequentially while preserving knowledge from earlier tasks using Elastic Weight Consolidation.

Python 17 Updated Aug 8, 2025

Source code for a LoRA-based continual relation extraction method.

Python 14 2 Updated Sep 25, 2023