Skip to content
View cyst219's full-sized avatar

Block or report cyst219

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Python 237 13 Updated Dec 16, 2025

中文法律LLaMA (LLaMA for Chinese legel domain)

Python 990 131 Updated Aug 28, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,101 1,972 Updated Jan 9, 2026
Jupyter Notebook 1,364 163 Updated Mar 24, 2026

Fast and memory-efficient exact attention

Python 23,350 2,614 Updated Apr 14, 2026

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 605 63 Updated Apr 9, 2026

[ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"

Jupyter Notebook 60 5 Updated May 9, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,098 4,793 Updated Apr 13, 2026

A book for Learning the Foundations of LLMs

16,037 1,529 Updated Dec 12, 2025

Source code for a LoRA-based continual relation extraction method.

Python 14 2 Updated Sep 25, 2023

Continual Learning for Transformers that allows training on multiple tasks sequentially while preserving knowledge from earlier tasks using Elastic Weight Consolidation.

Python 17 Updated Aug 8, 2025
Python 1,128 53 Updated Jan 10, 2026

PyContinual (An Easy and Extendible Framework for Continual Learning)

Python 324 69 Updated Jan 29, 2024

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 3,199 440 Updated Jan 17, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 437 58 Updated Mar 20, 2026

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 5,229 735 Updated Apr 14, 2026
Jupyter Notebook 98 6 Updated Jul 24, 2025

所有小初高、大学PDF教材。

Roff 69,280 15,463 Updated Oct 18, 2025

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,663 312 Updated Apr 14, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,559 15,566 Updated Apr 14, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,090 8,569 Updated Apr 12, 2026

Sky-T1: Train your own O1 preview model within $450

Python 3,373 343 Updated Jul 12, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,704 1,348 Updated Apr 14, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,674 3,651 Updated Apr 14, 2026

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,872 760 Updated Apr 14, 2026