Skip to content
View PkuRainBow's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report PkuRainBow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 24 Updated Jul 16, 2025

[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Python 755 33 Updated Apr 4, 2026
883 47 Updated Aug 30, 2025

aider is AI pair programming in your terminal

Python 43,193 4,184 Updated Apr 9, 2026

Official implementation of Inductive Moment Matching

Python 582 15 Updated Jul 11, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,477 114 Updated Jan 19, 2026

A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.

Jupyter Notebook 221 21 Updated Jun 26, 2025

Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?

Jupyter Notebook 42 2 Updated Jul 26, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

10,620 810 Updated Jan 21, 2026

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.

Python 3,169 279 Updated Dec 15, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 6,747 1,418 Updated Jan 29, 2026

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

Python 621 87 Updated Mar 1, 2026

The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.

TypeScript 6,094 586 Updated Mar 2, 2026

Official implementation for "SVGFusion: Scalable Text-to-SVG Generation via Vector Space Diffusion" https://arxiv.org/abs/2412.10437

69 1 Updated Dec 13, 2024

Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.

TypeScript 118 10 Updated Oct 28, 2025

[ICLR-2026] Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 147 7 Updated Jun 30, 2025

OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871

Jupyter Notebook 4,043 23 Updated Mar 20, 2026

[ICLR'26] Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Python 534 34 Updated Jan 27, 2026

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Jupyter Notebook 310 13 Updated Jun 2, 2025
Python 24 1 Updated Jun 18, 2025

Roblox Foundation Model for 3D Intelligence

Jupyter Notebook 934 88 Updated Jul 22, 2025

[NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting

Jupyter Notebook 72 2 Updated Jan 9, 2026

This is official Pytorch implementation of "Rethinking the necessity of image fusion in high-level vision tasks: A practical infrared and visible image fusion network based on progressive semantic …

Python 211 13 Updated Apr 7, 2026

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 760 102 Updated Oct 29, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 3,136 282 Updated Jul 7, 2025

MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]

Python 23 Updated Dec 10, 2025

PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models

Jupyter Notebook 35 1 Updated Jan 14, 2026

Mobile-Agent: The Powerful GUI Agent Family

Python 8,447 852 Updated Mar 31, 2026

[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"

Python 256 18 Updated Nov 24, 2024
Next