Skip to content
View PkuRainBow's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report PkuRainBow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 24 Updated Jul 16, 2025

[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Python 746 28 Updated Dec 17, 2025
864 51 Updated Aug 30, 2025

aider is AI pair programming in your terminal

Python 39,090 3,754 Updated Dec 18, 2025

Official implementation of Inductive Moment Matching

Python 567 13 Updated Jul 11, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,116 99 Updated Nov 23, 2025

A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.

Jupyter Notebook 200 13 Updated Jun 26, 2025

Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?

Jupyter Notebook 42 2 Updated Jul 26, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,739 705 Updated Nov 7, 2025

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.

Python 3,133 274 Updated Dec 15, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 5,215 855 Updated Aug 11, 2025

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

Python 567 77 Updated Nov 12, 2025

The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.

TypeScript 5,528 497 Updated Nov 10, 2025

Official implementation for "SVGFusion: Scalable Text-to-SVG Generation via Vector Space Diffusion" https://arxiv.org/abs/2412.10437

68 1 Updated Dec 13, 2024

Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.

TypeScript 108 10 Updated Oct 28, 2025

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 131 7 Updated Jun 30, 2025

OmniGen2: Exploration to Advanced Multimodal Generation.

Jupyter Notebook 3,972 12 Updated Dec 2, 2025

Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Python 516 34 Updated Sep 23, 2025

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Jupyter Notebook 277 14 Updated Jun 2, 2025
Python 24 1 Updated Jun 18, 2025

Roblox Foundation Model for 3D Intelligence

Jupyter Notebook 869 80 Updated Jul 22, 2025

[NeurIPS 2025 Oral] Exploring Diffusion Transformer Designs via Grafting

Jupyter Notebook 67 2 Updated Jun 18, 2025

This is official Pytorch implementation of "Rethinking the necessity of image fusion in high-level vision tasks: A practical infrared and visible image fusion network based on progressive semantic …

Python 205 10 Updated Apr 28, 2025

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 687 92 Updated Oct 29, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 3,002 264 Updated Jul 7, 2025

MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]

Python 22 Updated Dec 10, 2025

PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models

Jupyter Notebook 22 1 Updated Aug 11, 2025

Mobile-Agent: The Powerful GUI Agent Family

Python 6,767 687 Updated Dec 2, 2025

[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"

Python 251 19 Updated Nov 24, 2024
Next