-
xtuner Public
Forked from InternLM/xtunerXTuner is a toolkit for efficiently fine-tuning LLM
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedDec 3, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedOct 30, 2025 -
Intern-S1 Public
Forked from InternLM/Intern-S1A Scientific Multimodal Foundation Model
Apache License 2.0 UpdatedSep 30, 2025 -
slime Public
Forked from THUDM/slimeslime is a LLM post-training framework aiming at scaling RL.
Python Apache License 2.0 UpdatedSep 26, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Python Apache License 2.0 UpdatedAug 20, 2025 -
-
ms-swift Public
Forked from modelscope/ms-swiftUse PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, InternVL3, Ovis2.5, L…
Python Apache License 2.0 UpdatedAug 19, 2025 -
MHA2MLA Public
Forked from JT-Ushio/MHA2MLATowards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
Python Apache License 2.0 UpdatedMay 17, 2025 -
torchtitan Public
Forked from pytorch/torchtitanA native PyTorch Library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 1, 2025 -
DeepSpeedExamples Public
Forked from deepspeedai/DeepSpeedExamplesExample models using DeepSpeed
-
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
-
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedDec 26, 2024 -
Janus Public
Forked from deepseek-ai/JanusJanus-Series: Unified Multimodal Understanding and Generation Models
Python MIT License UpdatedDec 13, 2024 -
long-context-attention Public
Forked from feifeibear/long-context-attentionUSP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Python Apache License 2.0 UpdatedOct 31, 2024 -
VLMEvalKit Public
Forked from open-compass/VLMEvalKitOpen-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedSep 26, 2024 -
InternVL Public
Forked from OpenGVLab/InternVL[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Python MIT License UpdatedSep 10, 2024 -
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attentionRing attention implementation with flash attention
Python UpdatedSep 6, 2024 -
torchgpipe Public
Forked from kakaobrain/torchgpipeA GPipe implementation in PyTorch
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 27, 2024 -
llama3 Public
Forked from meta-llama/llama3The official Meta Llama 3 GitHub site
-
lvlm-interpret Public
Forked from josephtey/lvlm-interpretPython Apache License 2.0 UpdatedJun 12, 2024 -
-
lmms-eval Public
Forked from EvolvingLMMs-Lab/lmms-evalAccelerating the development of large multimodal models (LMMs) with lmms-eval
Python UpdatedMar 22, 2024 -
mmdetection Public
Forked from open-mmlab/mmdetectionOpenMMLab Detection Toolbox and Benchmark
-
-
llm-course Public
Forked from mlabonne/llm-courseCourse to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
-
LISA Public
Forked from dvlab-research/LISAProject Page for "LISA: Reasoning Segmentation via Large Language Model"
Python Apache License 2.0 UpdatedFeb 1, 2024 -
LLaVA Public
Forked from haotian-liu/LLaVAVisual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
Python Apache License 2.0 UpdatedFeb 1, 2024