cyst219

cyst219

Stars

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,037 4,669 Updated Dec 18, 2025

ZJU-LLMs / Foundations-of-LLMs

A book for Learning the Foundations of LLMs

14,679 1,345 Updated Dec 12, 2025

Raincleared-Song / ConPET

Source code for a LoRA-based continual relation extraction method.

Python 14 2 Updated Sep 25, 2023

chandraprvkvsh / Continual-Learning-for-Transformers

Continual Learning for Transformers that allows training on multiple tasks sequentially while preserving knowledge from earlier tasks using Elastic Weight Consolidation.

Python 17 Updated Aug 8, 2025

huggingface / Math-Verify

Python 1,040 49 Updated Jul 2, 2025

ZixuanKe / PyContinual

PyContinual (An Easy and Extendible Framework for Continual Learning)

Python 323 68 Updated Jan 29, 2024

openai / human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 3,055 426 Updated Jan 17, 2025

ElliottYan / LUFFY

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 393 48 Updated Oct 4, 2025

shibing624 / MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 4,461 648 Updated Aug 30, 2025

wlll123456 / study_rlhf

Jupyter Notebook 53 3 Updated Jul 24, 2025

yeongpin / cursor-free-vip

[Support 0.49.x]（Reset Cursor AI MachineID & Bypass Higher Token Limit） Cursor Ai ，自动重置机器ID ，免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machi…

Python 46,134 5,535 Updated Dec 2, 2025

TapXWorld / ChinaTextbook

所有小初高、大学PDF教材。

Roff 63,040 13,975 Updated Oct 18, 2025

modelscope / evalscope

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,135 243 Updated Dec 18, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,769 12,066 Updated Dec 19, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,226 7,787 Updated Dec 19, 2025

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 3,361 341 Updated Jul 12, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,748 1,070 Updated Dec 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,625 2,853 Updated Dec 19, 2025

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,441 705 Updated Dec 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly