-
Thoughtworks
- Singapore
- https://cemse.kaust.edu.sa/people/person/fangyuan-yu
-
mod_gpt Public
Modified GPT model pre-training for GPU poor
Jupyter Notebook MIT License UpdatedDec 19, 2025 -
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Python MIT License UpdatedNov 18, 2025 -
Metaworld Public
Forked from Farama-Foundation/MetaworldCollections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Python MIT License UpdatedNov 10, 2025 -
vlm-gym Public
Forked from sdan/vlm-gymRL gym for vision language models in JAX
Python Apache License 2.0 UpdatedOct 30, 2025 -
TextArena Public
Forked from LeonGuertler/TextArenaA Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
Python MIT License UpdatedOct 29, 2025 -
minimind Public
Forked from jingyaogong/minimind🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Jupyter Notebook Apache License 2.0 UpdatedOct 8, 2025 -
es-fine-tuning-paper Public
Forked from VsonicV/es-fine-tuning-paperThis repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"
Python Other UpdatedOct 7, 2025 -
TinyRecursiveModels Public
Forked from SamsungSAILMontreal/TinyRecursiveModelsA fork for TRM
Python MIT License UpdatedOct 7, 2025 -
MiniLive Public
Forked from NVlabs/LongLiveLongLive: Real-time Interactive Long Video Generation
Python Other UpdatedOct 2, 2025 -
bdh Public
Forked from pathwaycom/bdhBaby Dragon Hatchling (BDH) – Architecture and Code
Python MIT License UpdatedOct 1, 2025 -
MobileLLM-R1 Public
Forked from facebookresearch/MobileLLM-R1MobileLLM-R1
Python Other UpdatedSep 30, 2025 -
tinyworlds Public
Forked from AlmondGod/tinyworldsA minimal implementation of DeepMind's Genie world model
Python UpdatedSep 28, 2025 -
-
RL-Factory Public
Forked from Simple-Efficient/RL-FactoryTrain your Agent model via our easy and efficient framework
Python Apache License 2.0 UpdatedSep 11, 2025 -
HRM Public
Forked from sapientinc/HRMHierarchical Reasoning Model Official Release
Python Apache License 2.0 UpdatedSep 9, 2025 -
-
-
-
-
Diffusion-Explorer Public
Forked from helblazer811/Diffusion-ExplorerInteractive visualizations of the theory behind diffusion models.
Svelte UpdatedMay 17, 2025 -
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedMay 14, 2025 -
MoSA Public
Forked from piotrpiekos/MoSAUser-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice routing providing a content-based sparse attention mechanism.
Python MIT License UpdatedMay 3, 2025 -
Orpheus-TTS Public
Forked from canopyai/Orpheus-TTSTowards Human-Sounding Speech
Python Apache License 2.0 UpdatedApr 16, 2025 -
-
-
unidisc Public
Forked from alexanderswerdlow/unidiscUniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.
Python UpdatedApr 2, 2025 -
Agent-S Public
Forked from simular-ai/Agent-SAgent S: an open agentic framework that uses computers like a human
Python Apache License 2.0 UpdatedApr 2, 2025 -
-
-
CharacterLM Public
vocabulary curriculum + LLM