-
AReaL Public
Forked from inclusionAI/AReaLDistributed RL System for LLM Reasoning
Python Apache License 2.0 UpdatedSep 15, 2025 -
RLinf Public
Forked from RLinf/RLinfRLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
Python Apache License 2.0 UpdatedSep 4, 2025 -
siiRL Public
Forked from sii-research/siiRLsiiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems
Python Apache License 2.0 UpdatedSep 3, 2025 -
Long-RL Public
Forked from NVlabs/Long-RLLong-RL: Scaling RL to Long Sequences
Python Apache License 2.0 UpdatedJul 22, 2025 -
ms-swift Public
Forked from modelscope/ms-swiftUse PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
Python Apache License 2.0 UpdatedJul 18, 2025 -
faiss Public
Forked from facebookresearch/faissA library for efficient similarity search and clustering of dense vectors.
C++ MIT License UpdatedJul 7, 2025 -
SimpleVLA-RL Public
Forked from PRIME-RL/SimpleVLA-RLOnline RL with Simple Reward Enables Training VLA Models with Only One Trajectory
Python MIT License UpdatedJun 20, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedMay 21, 2025 -
Pai-Megatron-Patch Public
Forked from alibaba/Pai-Megatron-PatchThe official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Python Apache License 2.0 UpdatedMay 12, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedMay 12, 2025 -
openvla-oft Public
Forked from moojink/openvla-oftFine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
Python MIT License UpdatedApr 28, 2025 -
flux Public
Forked from bytedance/fluxA fast communication-overlapping library for tensor/expert parallelism on GPUs.
C++ Apache License 2.0 UpdatedApr 11, 2025 -
LIBERO Public
Forked from Lifelong-Robot-Learning/LIBEROBenchmarking Knowledge Transfer in Lifelong Robot Learning
Jupyter Notebook MIT License UpdatedMar 15, 2025 -
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedMar 6, 2025 -
-
transformers-openvla-oft Public
Forked from moojink/transformers-openvla-oft🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedFeb 25, 2025 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Python Apache License 2.0 UpdatedJan 16, 2025 -
dlimp_openvla Public
Forked from moojink/dlimp_openvladataloading is my passion
Python UpdatedJul 12, 2024 -
datasets Public
Forked from huggingface/datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Python Apache License 2.0 UpdatedMar 11, 2022 -
ChineseDiachronicCorpus Public
Forked from yanshanjing/ChineseDiachronicCorpusChineseDiachronicCorpus,中文历时语料库,横跨六十余年,包括腾讯历时新闻2000-2016,人民日报历时语料1946-2003,参考消息历时语料1957-2002。基于历时流通语料库,可用于历时语言变化计算、语言监测、社会文化变迁研究提供基础性的语料支持。
UpdatedJan 10, 2021