Skip to content
View fangyuan-ksgk's full-sized avatar
:electron:
Researching on MARL
:electron:
Researching on MARL

Block or report fangyuan-ksgk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026🔥] 🧑‍🎨 OmniLottie, an open-sourced multi-modal instructed vector animation generator that produces Lottie JSONs.

Python 601 35 Updated Mar 20, 2026

Official Codebase for "DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos"

Python 682 46 Updated Mar 21, 2026

This is a repository for reinforcement learning implementation for Unitree robots, based on Mujoco.

C++ 285 58 Updated Apr 3, 2026

Evaluating long-term memory of reinforcement learning algorithms

Python 169 20 Updated Jun 23, 2023

Simple language-driven navigation tasks for studying compositional learning

204 26 Updated Nov 5, 2020

An interface library for RL post training with environments.

Python 1,516 282 Updated Apr 2, 2026

We introduce BabyVision, a benchmark revealing the infancy of AI vision.

Python 205 7 Updated Jan 13, 2026

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,779 340 Updated Jan 20, 2026

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,229 154 Updated Dec 8, 2025
Python 998 85 Updated Jan 25, 2026

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 369 87 Updated Mar 28, 2026

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,993 666 Updated Mar 27, 2026

RL gym for vision language models written in JAX

Python 145 14 Updated Oct 30, 2025

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

Python 341 36 Updated Feb 18, 2026

MobileLLM-R1

Python 78 12 Updated Sep 30, 2025

Experiment for abstraction learning

Python 1 Updated Sep 24, 2025

Train your Agent model via our easy and efficient framework

Python 1,725 163 Updated Dec 5, 2025

Open-source framework for the research and development of foundation models.

Python 829 102 Updated Apr 3, 2026

Hierarchical Reasoning Model Official Release

Python 12,373 1,806 Updated Mar 31, 2026

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 899 150 Updated Mar 24, 2026

SkyReels-V2: Infinite-length Film Generative model

Python 6,692 1,398 Updated Jan 29, 2026
HTML 173 9 Updated Oct 27, 2025

Interactive visualizations of the geometric intuition behind diffusion models.

JavaScript 1,084 51 Updated Jan 31, 2026

User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice routing providing a content-based sparse attention mechanism.

Python 28 4 Updated May 3, 2025

Making large AI models cheaper, faster and more accessible

Python 41,371 4,521 Updated Mar 30, 2026

Lets make video diffusion practical!

Python 16,715 1,651 Updated Oct 16, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,031 1,963 Updated Jan 9, 2026

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 971 90 Updated Feb 25, 2026

Towards Human-Sounding Speech

Python 6,051 517 Updated Dec 5, 2025
Next