SlotherCui

Follow

SlotherCui

Follow

31 followers · 9 following

Achievements

Achievements

Stars

MGenAI / GRN

Generative Refinement Networks for Visual Synthesis

Python 50 Updated Apr 23, 2026

SandAI-org / MagiCompiler

A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.

Python 299 23 Updated Apr 27, 2026

facebookresearch / vjepa2

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 3,741 449 Updated Mar 23, 2026

meituan-longcat / LongCat-Next

407 20 Updated Apr 24, 2026

Lightricks / LTX-2

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 6,146 972 Updated Apr 23, 2026

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 16,178 3,883 Updated Apr 28, 2026

amazon-far / BAR

code & model for arxiv paper "Autoregressive Image Generation with Masked Bit Modeling"

Python 49 2 Updated Apr 8, 2026

shallowdream204 / BitDance

BitDance & UniWeTok: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model.

Python 471 29 Updated Apr 20, 2026

zlab-princeton / UEval

UEval: A Benchmark for Unified Multimodal Generation

Python 18 Updated Apr 20, 2026

nv-tlabs / vipe

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,882 147 Updated Apr 15, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,110 275 Updated Apr 28, 2026

Tencent-Hunyuan / HY-WorldPlay

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,476 132 Updated Apr 15, 2026

EvolvingLMMs-Lab / NEO

NEO Series: Native Vision-Language Models from First Principles

Python 722 27 Updated Apr 26, 2026

Rubics-Xuan / TransBTS

This repo provides the official code for : 1) TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/abs/2103.04430) , accepted by MICCAI2021. 2) TransBTSV2: Towards Bet…

Python 443 93 Updated Mar 11, 2024

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,457 547 Updated Apr 28, 2026

meituan-longcat / LongCat-Video

Python 2,310 355 Updated Apr 24, 2026

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,504 62 Updated Dec 30, 2025

naver / mast3r

Grounding Image Matching in 3D with MASt3R

Python 2,878 256 Updated Jun 30, 2025

yongliang-wu / DFT

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 558 22 Updated Jan 4, 2026

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 4,078 572 Updated Apr 28, 2026

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,871 520 Updated Oct 27, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,685 238 Updated Jun 17, 2025

aim-uofa / Matcher

[ICLR'24 & IJCV‘25] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Python 557 44 Updated Dec 3, 2025

baaivision / DIVA

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Python 301 14 Updated Jan 23, 2025

arcprize / ARC-AGI-2

696 98 Updated May 22, 2025

LLaVA-VL / LLaVA-NeXT

Python 4,646 459 Updated Apr 15, 2026

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,794 84 Updated May 11, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 6,021 712 Updated Mar 23, 2025

visioncortex / vtracer

Raster to Vector Graphics Converter

Rust 5,916 391 Updated Mar 23, 2026

kwsong0113 / diffusion-forcing-transformer

[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"

Python 659 32 Updated Jul 1, 2025