Skip to content
View asukaqaq-s's full-sized avatar

Block or report asukaqaq-s

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,292 70 Updated Mar 5, 2025

Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForcing is the first framework to distill bidirectional audio-visual diffusion mo…

Python 129 Updated Mar 29, 2026

Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.

Python 791 102 Updated Jan 6, 2026

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Python 124 5 Updated Feb 27, 2026

A unified inference and post-training framework for accelerated video generation.

Python 3,355 311 Updated Apr 9, 2026

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 2,009 149 Updated Dec 6, 2024

A curated list of Diffusion Model in RL resources (continually updated)

1,571 73 Updated Dec 15, 2025

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,028 392 Updated Apr 9, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,758 492 Updated Apr 8, 2026

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

2,492 109 Updated Apr 8, 2026

Distributed DataLoader For Pytorch Based On Ray

Python 25 2 Updated Nov 5, 2021

RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.

Python 58 6 Updated Mar 18, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,389 122 Updated Mar 24, 2026

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,078 225 Updated Mar 30, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,315 856 Updated Mar 22, 2026

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

331 17 Updated Apr 8, 2026

Light Image Video Generation Inference Framework

Python 2,149 183 Updated Apr 9, 2026

A framework for efficient model inference with omni-modality models

Python 4,238 721 Updated Apr 9, 2026

collection of diffusion model papers categorized by their subareas

2,186 100 Updated Mar 16, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,996 474 Updated Apr 9, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,532 3,610 Updated Apr 9, 2026

Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.

58 1 Updated Mar 4, 2026

Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation

Python 40 Updated Mar 30, 2026

[ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification".

Python 71 2 Updated Sep 18, 2025

PyTorch implementations of `BatchSampler` that under/over sample according to a chosen parameter alpha, in order to create a balanced training distribution.

Python 86 10 Updated Oct 25, 2019

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,654 874 Updated Dec 22, 2025

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

Rust 5,403 439 Updated Apr 9, 2026

A tensor-aware point-to-point communication primitive for machine learning

C++ 286 80 Updated Dec 17, 2025

Build multimodal data processing pipelines with Azure AI Services + LLMs

Jupyter Notebook 143 65 Updated Apr 15, 2025
Next