Skip to content
View mingqian-233's full-sized avatar

Block or report mingqian-233

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Encoding and parsing tools.

JavaScript 1,027 85 Updated Jan 28, 2026

[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark

Python 158 2 Updated May 4, 2026
Python 17 Updated Jun 2, 2026

PhotoFlow: Agentic 3D Virtual Photography Missions

HTML 36 Updated May 27, 2026

SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

Python 6,842 660 Updated Jun 14, 2026

SpaceDG: Benchmarking Spatial Intelligence under Visual Degradation

Python 30 Updated May 28, 2026

所有小初高、大学PDF教材。

Roff 74,205 16,600 Updated Oct 18, 2025

Open-Source Frontier Voice AI

Python 49,348 5,491 Updated May 6, 2026

Manga/Comics Translation Helper Tool

Python 1,523 163 Updated Nov 22, 2022

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Python 365 3 Updated May 24, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,803 109,966 Updated Jun 8, 2026

Free football data from StatsBomb

3,282 933 Updated May 26, 2026

Bridging the gap between image generation and real-world design: a benchmark for structured, multi-constraint commercial visual content generation.

Python 18 2 Updated Apr 24, 2026

The official code of FineRMoE.

Python 20 Updated Mar 17, 2026

Code repo for "EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation"

Python 21 Updated Mar 30, 2026

The official repo of CrossEarth-SAR, a sar-centric and billion-scale geospatial foundation model for cross-domain semantic segmentation

Python 46 Updated Mar 18, 2026

Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Python 39 1 Updated Mar 13, 2026

GRADE: Grounded Reasoning Assessment for Discipline-informed Editing

Python 25 1 Updated Apr 23, 2026

InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.

Python 286 16 Updated Mar 21, 2026

Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports

Python 70 Updated Mar 15, 2026

[ICML 2026 Oral] Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

321 6 Updated May 25, 2026

[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Python 284 17 Updated Aug 2, 2025

The official repository of the first version of ACE-Brain foundation model.

78 2 Updated Mar 13, 2026

CRS-自建Claude Code镜像,一站式开源中转服务,让 Claude、OpenAI、Gemini、Droid 订阅统一接入,支持拼车共享,更高效分摊成本,原生工具无缝使用。

JavaScript 12,091 1,820 Updated Jun 14, 2026

让每一次引用都成为可解释的影响力 Turning Every Citation into Explainable Impact

Python 307 17 Updated May 24, 2026

FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity co…

Python 1,258 74 Updated Apr 3, 2026

A brand new web Visual Novel engine | 全新的网页端视觉小说引擎

TypeScript 3,838 353 Updated Jun 14, 2026

PaperBanana: Automating Academic Illustration For AI Scientists

Python 6,561 487 Updated May 11, 2026
Next