Skip to content
View fangyuan-ksgk's full-sized avatar
:electron:
Researching on MARL
:electron:
Researching on MARL

Block or report fangyuan-ksgk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,675 326 Updated Nov 10, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,815 107 Updated Dec 8, 2025
Python 749 64 Updated Dec 9, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 330 77 Updated Oct 29, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,555 593 Updated Dec 17, 2025

RL gym for vision language models written in JAX

Python 135 13 Updated Oct 30, 2025

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

Python 277 27 Updated Nov 24, 2025

MobileLLM-R1

Python 69 13 Updated Sep 30, 2025

Experiment for abstraction learning

Python 1 Updated Sep 24, 2025

Train your Agent model via our easy and efficient framework

Python 1,664 156 Updated Dec 5, 2025

Open-source framework for the research and development of foundation models.

HTML 669 67 Updated Dec 19, 2025

Hierarchical Reasoning Model Official Release

Python 12,158 1,779 Updated Sep 9, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 718 101 Updated Dec 19, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 5,207 846 Updated Aug 11, 2025
HTML 167 9 Updated Oct 27, 2025

Interactive visualizations of the geometric intuition behind diffusion models.

Svelte 917 44 Updated Dec 18, 2025

User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice routing providing a content-based sparse attention mechanism.

Python 28 4 Updated May 3, 2025

Making large AI models cheaper, faster and more accessible

Python 41,297 4,545 Updated Dec 8, 2025

Lets make video diffusion practical!

Python 16,357 1,592 Updated Oct 16, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,814 1,811 Updated Oct 13, 2025

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 838 71 Updated Dec 17, 2025

Towards Human-Sounding Speech

Python 5,818 501 Updated Dec 5, 2025
Python 131 12 Updated Dec 23, 2024

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Python 136 8 Updated Aug 13, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,319 192 Updated Jun 5, 2025

UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

Python 133 5 Updated Apr 2, 2025

Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite

Python 46 12 Updated Dec 27, 2022

Agent S: an open agentic framework that uses computers like a human

Python 8,888 995 Updated Dec 16, 2025
Next