User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice routing providing a content-based sparse attention mechanism.

Python 28 4 Updated May 3, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,297 4,545 Updated Dec 8, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,357 1,592 Updated Oct 16, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,814 1,811 Updated Oct 13, 2025

thu-ml / SpargeAttn

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 838 71 Updated Dec 17, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 5,818 501 Updated Dec 5, 2025

jerber / lang-jepa

Python 131 12 Updated Dec 23, 2024

eqimp / hogwild_llm

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Python 136 8 Updated Aug 13, 2025

test-time-training / ttt-video-dit

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,319 192 Updated Jun 5, 2025

alexanderswerdlow / unidisc

UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

Python 133 5 Updated Apr 2, 2025

adityabingi / Dreamer

Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite

Python 46 12 Updated Dec 27, 2022

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 8,888 995 Updated Dec 16, 2025

lamm-mit / SciAgentsDiscovery

Python 573 100 Updated May 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fangyuan Yu fangyuan-ksgk

Achievements

Achievements

Block or report fangyuan-ksgk

Stars

Farama-Foundation / Metaworld

LTH14 / JiT

rbalestr-lab / lejepa

LeonGuertler / TextArena

open-compass / VLMEvalKit

sdan / vlm-gym

VsonicV / es-fine-tuning-paper

facebookresearch / MobileLLM-R1

fangyuan-ksgk / abstraction-learning

Simple-Efficient / RL-Factory

marin-community / marin

sapientinc / HRM

ScalingIntelligence / KernelBench

SkyworkAI / SkyReels-V2

Kai-46 / minFM

Multiverse4FM / Multiverse

helblazer811 / Diffusion-Explorer

piotrpiekos / MoSA