Skip to content
View wzk1015's full-sized avatar
😎
😎

Organizations

@OpenGVLab

Block or report wzk1015

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871

Jupyter Notebook 4,045 23 Updated Mar 20, 2026

Official Repo for paper "Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning"

Python 2 Updated Apr 2, 2026

JoyAI-Image is the unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing.

Python 1,655 89 Updated Apr 12, 2026

注意建立这个repo只是因为网页自身就是全部源码,原作者并未声明license所以本repo也不包含license,一切行为请自行斟酌,不要给原作者添麻烦。 原作者:B站@蛆肉儿串儿

HTML 376 308 Updated Apr 14, 2026

SF Mono Font

843 103 Updated Jun 7, 2018

CORAL is a robust, lightweight infrastructure for multi-agent autonomous self-evolution, built for autoresearch.

Python 460 60 Updated Apr 14, 2026

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,417 42 Updated Mar 9, 2026

ScaleEdit-12M is the largest open-source image editing dataset to date, spanning 23 task families across diverse real and synthetic domains.

10 Updated Apr 3, 2026

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 183,898 107,991 Updated Apr 13, 2026

Bridging the gap between image generation and real-world design: a benchmark for structured, multi-constraint commercial visual content generation.

Python 14 2 Updated Apr 3, 2026

Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection

Python 12,770 1,319 Updated Apr 12, 2026

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

135,144 33,969 Updated Mar 28, 2026

GRADE: Grounded Reasoning Assessment for Discipline-informed Editing

Python 25 1 Updated Mar 15, 2026

We provide TextEdit, a high-quality, multi-scenario text editing benchmark for generation models.

Python 19 Updated Mar 16, 2026
Python 1 Updated Mar 4, 2026

The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchmarks.

Jupyter Notebook 41 4 Updated Apr 12, 2026

InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.

Python 265 14 Updated Mar 21, 2026

This is a repository for awesome any2any work collection.

21 1 Updated Mar 13, 2026

Memento-Skills: Let Agents Design Agents

Python 1,204 122 Updated Mar 31, 2026

RISE-Video: Can Video Generators Decode Implicit World Rules?

Python 27 Updated Mar 26, 2026

PaperBanana: Automating Academic Illustration For AI Scientists

Python 5,843 416 Updated Mar 24, 2026

Build, evaluate, and integrate long-term memory for self-evolving agents.

Python 3,811 401 Updated Apr 14, 2026
Python 210 123 Updated Mar 19, 2026

🏆 Add dynamically generated GitHub Stat Trophies on your readme

TypeScript 6,476 1,670 Updated Apr 13, 2026
Python 177 1 Updated Jan 19, 2026

ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory

Python 59 7 Updated Nov 27, 2025

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 2,396 282 Updated Oct 5, 2025
Next