Continual Learning for Transformers that allows training on multiple tasks sequentially while preserving knowledge from earlier tasks using Elastic Weight Consolidation.

Python 17 Updated Aug 8, 2025

huggingface / Math-Verify

Python 1,128 53 Updated Jan 10, 2026

ZixuanKe / PyContinual

PyContinual (An Easy and Extendible Framework for Continual Learning)

Python 324 69 Updated Jan 29, 2024

openai / human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 3,199 440 Updated Jan 17, 2025

ElliottYan / LUFFY

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 437 58 Updated Mar 20, 2026

shibing624 / MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 5,229 735 Updated Apr 14, 2026

wlll123456 / study_rlhf

Jupyter Notebook 98 6 Updated Jul 24, 2025

TapXWorld / ChinaTextbook

所有小初高、大学PDF教材。

Roff 69,280 15,463 Updated Oct 18, 2025

modelscope / evalscope

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,663 312 Updated Apr 14, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,559 15,566 Updated Apr 14, 2026

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,090 8,569 Updated Apr 12, 2026

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 3,373 343 Updated Jul 12, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,704 1,348 Updated Apr 14, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,674 3,651 Updated Apr 14, 2026

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,872 760 Updated Apr 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cyst219

Block or report cyst219

Stars

TheAgentArk / Toucan

AndrewZhe / lawyer-llama

QwenLM / Qwen3

modelscope / modelscope-classroom

Dao-AILab / flash-attention

agentscope-ai / Trinity-RFT

zzz47zzz / spurious-forgetting

deepspeedai / DeepSpeed

ZJU-LLMs / Foundations-of-LLMs

Raincleared-Song / ConPET

chandraprvkvsh / Continual-Learning-for-Transformers