fangyuan-ksgk

Follow

Researching on MARL

Fangyuan Yu fangyuan-ksgk

Researching on MARL

Follow

AI Researcher (LLM) @thoughtworks || PhD KAUST

53 followers · 203 following

Thoughtworks
Singapore
https://cemse.kaust.edu.sa/people/person/fangyuan-yu

Achievements

Achievements

Stars

484 results for source starred repositories

unitreerobotics / unitree_rl_mjlab

This is a repository for reinforcement learning implementation for Unitree robots, based on Mujoco.

Python 86 6 Updated Jan 30, 2026

jurgisp / memory-maze

Evaluating long-term memory of reinforcement learning algorithms

Python 163 19 Updated Jun 23, 2023

brendenlake / SCAN

Simple language-driven navigation tasks for studying compositional learning

204 27 Updated Nov 5, 2020

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 1,117 176 Updated Feb 4, 2026

UniPat-AI / BabyVision

We introduce BabyVision, a benchmark revealing the infancy of AI vision.

Python 173 6 Updated Jan 13, 2026

Farama-Foundation / Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,732 337 Updated Jan 20, 2026

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,069 136 Updated Dec 8, 2025

galilai-group / lejepa

Python 861 79 Updated Jan 25, 2026

TextArena / TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 349 81 Updated Feb 3, 2026

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,788 623 Updated Feb 3, 2026

sdan / vlm-gym

RL gym for vision language models written in JAX

Python 142 14 Updated Oct 30, 2025

VsonicV / es-fine-tuning-paper

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

Python 289 29 Updated Nov 24, 2025

facebookresearch / MobileLLM-R1

MobileLLM-R1

Python 75 12 Updated Sep 30, 2025

fangyuan-ksgk / abstraction-learning

Experiment for abstraction learning

Python 1 Updated Sep 24, 2025

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,701 159 Updated Dec 5, 2025

marin-community / marin

Open-source framework for the research and development of foundation models.

HTML 749 75 Updated Feb 4, 2026

sapientinc / HRM

Hierarchical Reasoning Model Official Release

Python 12,297 1,789 Updated Sep 9, 2025

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 789 131 Updated Jan 20, 2026

SkyworkAI / SkyReels-V2

SkyReels-V2: Infinite-length Film Generative model

Python 6,191 1,248 Updated Jan 29, 2026

Kai-46 / minFM

HTML 170 9 Updated Oct 27, 2025

helblazer811 / Diffusion-Explorer

Interactive visualizations of the geometric intuition behind diffusion models.

JavaScript 1,060 51 Updated Jan 31, 2026

piotrpiekos / MoSA

User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice routing providing a content-based sparse attention mechanism.

Python 28 4 Updated May 3, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,337 4,538 Updated Jan 19, 2026

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,601 1,636 Updated Oct 16, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,460 1,869 Updated Jan 9, 2026

thu-ml / SpargeAttn

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 924 84 Updated Dec 31, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 5,924 512 Updated Dec 5, 2025

jerber / lang-jepa

Python 134 14 Updated Dec 23, 2024

eqimp / hogwild_llm

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Python 140 9 Updated Aug 13, 2025

test-time-training / ttt-video-dit

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,366 194 Updated Jun 5, 2025