Skip to content
View asukaqaq-s's full-sized avatar

Block or report asukaqaq-s

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A unified inference and post-training framework for accelerated video generation.

Python 3,326 303 Updated Mar 29, 2026

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,996 146 Updated Dec 6, 2024

A curated list of Diffusion Model in RL resources (continually updated)

1,559 73 Updated Dec 15, 2025

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,923 373 Updated Mar 28, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,369 433 Updated Mar 28, 2026

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

2,381 102 Updated Mar 25, 2026

Distributed DataLoader For Pytorch Based On Ray

Python 25 2 Updated Nov 5, 2021

RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.

Python 58 6 Updated Mar 18, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,348 118 Updated Mar 24, 2026

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 1,948 207 Updated Mar 28, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,293 846 Updated Mar 22, 2026

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

313 13 Updated Mar 26, 2026

Light Image Video Generation Inference Framework

Python 2,111 172 Updated Mar 27, 2026

A framework for efficient model inference with omni-modality models

Python 3,928 637 Updated Mar 29, 2026

collection of diffusion model papers categorized by their subareas

2,173 99 Updated Mar 16, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,945 455 Updated Mar 27, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,284 3,529 Updated Mar 28, 2026

Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.

57 1 Updated Mar 4, 2026

Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation

Python 40 Updated Mar 23, 2026

[ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification".

Python 72 1 Updated Sep 18, 2025

PyTorch implementations of `BatchSampler` that under/over sample according to a chosen parameter alpha, in order to create a balanced training distribution.

Python 86 10 Updated Oct 25, 2019

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,545 862 Updated Dec 22, 2025

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

Rust 5,353 428 Updated Mar 29, 2026

A tensor-aware point-to-point communication primitive for machine learning

C++ 285 80 Updated Dec 17, 2025

Build multimodal data processing pipelines with Azure AI Services + LLMs

Jupyter Notebook 142 65 Updated Apr 15, 2025

A lightweight design for computation-communication overlap.

Python 225 15 Updated Jan 20, 2026

A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

Python 124 15 Updated Dec 25, 2025

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,654 660 Updated Mar 24, 2026

The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.

C++ 141 35 Updated Mar 24, 2026
Next