fangyuan-ksgk

Researching on MARL

Fangyuan Yu fangyuan-ksgk

Researching on MARL

AI Researcher (LLM) @thoughtworks || PhD KAUST

51 followers · 202 following

Thoughtworks
Singapore
https://cemse.kaust.edu.sa/people/person/fangyuan-yu

Achievements

mod_gpt Public

Modified GPT model pre-training for GPU poor

Jupyter Notebook MIT License Updated Dec 19, 2025
JiT Public
Forked from LTH14/JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python MIT License Updated Nov 18, 2025
Metaworld Public
Forked from Farama-Foundation/Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python MIT License Updated Nov 10, 2025
vlm-gym Public
Forked from sdan/vlm-gym

RL gym for vision language models in JAX

Python Apache License 2.0 Updated Oct 30, 2025
TextArena Public
Forked from LeonGuertler/TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python MIT License Updated Oct 29, 2025
minimind Public
Forked from jingyaogong/minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Jupyter Notebook Apache License 2.0 Updated Oct 8, 2025
es-fine-tuning-paper Public
Forked from VsonicV/es-fine-tuning-paper

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

Python Other Updated Oct 7, 2025
TinyRecursiveModels Public
Forked from SamsungSAILMontreal/TinyRecursiveModels

A fork for TRM

Python MIT License Updated Oct 7, 2025
MiniLive Public
Forked from NVlabs/LongLive

LongLive: Real-time Interactive Long Video Generation

Python Other Updated Oct 2, 2025
bdh Public
Forked from pathwaycom/bdh

Baby Dragon Hatchling (BDH) – Architecture and Code

Python MIT License Updated Oct 1, 2025
MobileLLM-R1 Public
Forked from facebookresearch/MobileLLM-R1

MobileLLM-R1

Python Other Updated Sep 30, 2025
tinyworlds Public
Forked from AlmondGod/tinyworlds

A minimal implementation of DeepMind's Genie world model

Python Updated Sep 28, 2025
abstraction-learning Public

Experiment for abstraction learning

Python 1 Updated Sep 24, 2025
RL-Factory Public
Forked from Simple-Efficient/RL-Factory

Train your Agent model via our easy and efficient framework

Python Apache License 2.0 Updated Sep 11, 2025
HRM Public
Forked from sapientinc/HRM

Hierarchical Reasoning Model Official Release

Python Apache License 2.0 Updated Sep 9, 2025
marin Public
Forked from marin-community/marin

HTML Apache License 2.0 Updated Sep 6, 2025
minFM Public
Forked from Kai-46/minFM

HTML Other Updated Aug 10, 2025
fangyuan-ksgk Public

github page

Updated Jul 18, 2025
search_algo Public

understanding search algorithm

Updated Jun 4, 2025
Diffusion-Explorer Public
Forked from helblazer811/Diffusion-Explorer

Interactive visualizations of the theory behind diffusion models.

Svelte Updated May 17, 2025
ColossalAI Public
Forked from hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

Python Apache License 2.0 Updated May 14, 2025
MoSA Public
Forked from piotrpiekos/MoSA

User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice routing providing a content-based sparse attention mechanism.

Python MIT License Updated May 3, 2025
Orpheus-TTS Public
Forked from canopyai/Orpheus-TTS

Towards Human-Sounding Speech

Python Apache License 2.0 Updated Apr 16, 2025
hogwild_llm Public
Forked from eqimp/hogwild_llm

Python Apache License 2.0 Updated Apr 9, 2025
ttt-video-dit Public
Forked from test-time-training/ttt-video-dit

Python Updated Apr 8, 2025
unidisc Public
Forked from alexanderswerdlow/unidisc

UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

Python Updated Apr 2, 2025
Agent-S Public
Forked from simular-ai/Agent-S

Agent S: an open agentic framework that uses computers like a human

Python Apache License 2.0 Updated Apr 2, 2025
yet_another_regularizer Public

Experiment on rank regularization

MIT License Updated Mar 31, 2025
yet_another_optimizer Public

Yet another optimizer

Python MIT License Updated Mar 29, 2025
CharacterLM Public

vocabulary curriculum + LLM

Jupyter Notebook 2 Apache License 2.0 Updated Mar 26, 2025

Fangyuan Yu fangyuan-ksgk

Achievements

Achievements

mod_gpt Public

Uh oh!

JiT Public

Uh oh!

Metaworld Public

Uh oh!

vlm-gym Public

Uh oh!

TextArena Public

Uh oh!

minimind Public

Uh oh!

es-fine-tuning-paper Public

Uh oh!

TinyRecursiveModels Public

Uh oh!

MiniLive Public

Uh oh!

bdh Public

Uh oh!

MobileLLM-R1 Public

Uh oh!

tinyworlds Public

Uh oh!

abstraction-learning Public

Uh oh!

RL-Factory Public

Uh oh!

HRM Public

Uh oh!

marin Public

Uh oh!

minFM Public

Uh oh!

fangyuan-ksgk Public

Uh oh!

search_algo Public

Uh oh!

Diffusion-Explorer Public

Uh oh!

ColossalAI Public

Uh oh!

MoSA Public

Uh oh!

Orpheus-TTS Public

Uh oh!

hogwild_llm Public

Uh oh!

ttt-video-dit Public

Uh oh!

unidisc Public

Uh oh!

Agent-S Public

Uh oh!

yet_another_regularizer Public

Uh oh!

yet_another_optimizer Public

Uh oh!

CharacterLM Public

Uh oh!