Skip to content
View SlotherCui's full-sized avatar

Block or report SlotherCui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generative Refinement Networks for Visual Synthesis

Python 50 Updated Apr 23, 2026

A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.

Python 299 23 Updated Apr 27, 2026

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 3,741 449 Updated Mar 23, 2026

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 6,146 972 Updated Apr 23, 2026

Ongoing research training transformer models at scale

Python 16,178 3,883 Updated Apr 28, 2026

code & model for arxiv paper "Autoregressive Image Generation with Masked Bit Modeling"

Python 49 2 Updated Apr 8, 2026

BitDance & UniWeTok: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model.

Python 471 29 Updated Apr 20, 2026

UEval: A Benchmark for Unified Multimodal Generation

Python 18 Updated Apr 20, 2026

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,882 147 Updated Apr 15, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,110 275 Updated Apr 28, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,476 132 Updated Apr 15, 2026

NEO Series: Native Vision-Language Models from First Principles

Python 722 27 Updated Apr 26, 2026

This repo provides the official code for : 1) TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/abs/2103.04430) , accepted by MICCAI2021. 2) TransBTSV2: Towards Bet…

Python 443 93 Updated Mar 11, 2024

Democratizing Reinforcement Learning for LLMs

Python 5,457 547 Updated Apr 28, 2026

Native Multimodal Models are World Learners

Python 1,504 62 Updated Dec 30, 2025

Grounding Image Matching in 3D with MASt3R

Python 2,878 256 Updated Jun 30, 2025

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 558 22 Updated Jan 4, 2026

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 4,078 572 Updated Apr 28, 2026

Open-source unified multimodal model

Python 5,871 520 Updated Oct 27, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,685 238 Updated Jun 17, 2025

[ICLR'24 & IJCV‘25] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Python 557 44 Updated Dec 3, 2025

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Python 301 14 Updated Jan 23, 2025
Python 4,646 459 Updated Apr 15, 2026

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,794 84 Updated May 11, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 6,021 712 Updated Mar 23, 2025

Raster to Vector Graphics Converter

Rust 5,916 391 Updated Mar 23, 2026

[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"

Python 659 32 Updated Jul 1, 2025
Next