SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

Python 6,842 660 Updated Jun 14, 2026

Visionary-Laboratory / SpaceDG

SpaceDG: Benchmarking Spatial Intelligence under Visual Degradation

Python 30 Updated May 28, 2026

TapXWorld / ChinaTextbook

所有小初高、大学PDF教材。

Roff 74,205 16,600 Updated Oct 18, 2025

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 49,348 5,491 Updated May 6, 2026

KUR-creative / SickZil-Machine

Manga/Comics Translation Helper Tool

Python 1,523 163 Updated Nov 22, 2022

MME-Benchmarks / Video-MME-v2

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Python 365 3 Updated May 24, 2026

ultraworkers / claw-code

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,803 109,966 Updated Jun 8, 2026

statsbomb / open-data

Free football data from StatsBomb

3,282 933 Updated May 26, 2026

microsoft / BizGenEval

Bridging the gap between image generation and real-world design: a benchmark for structured, multi-constraint commercial visual content generation.

Python 18 2 Updated Apr 24, 2026

liaoning97 / FineRMoE

The official code of FineRMoE.

Python 20 Updated Mar 17, 2026

VisionXLab / EvoTok

Code repo for "EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation"

Python 21 Updated Mar 30, 2026

VisionXLab / CrossEarth-SAR

The official repo of CrossEarth-SAR, a sar-centric and billion-scale geospatial foundation model for cross-domain semantic segmentation

Python 46 Updated Mar 18, 2026

VisionXLab / FIRM-Reward

Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Python 39 1 Updated Mar 13, 2026

VisionXLab / GRADE

GRADE: Grounded Reasoning Assessment for Discipline-informed Editing

Python 25 1 Updated Apr 23, 2026

OpenGVLab / InternVL-U

InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.

Python 286 16 Updated Mar 21, 2026

Visionary-Laboratory / CourtSI

Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports

Python 70 Updated Mar 15, 2026

Visionary-Laboratory / holi-spatial

[ICML 2026 Oral] Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

321 6 Updated May 25, 2026

UMass-Embodied-AGI / Mirage

[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Python 284 17 Updated Aug 2, 2025

ACE-Brain-Team / ACE-Brain-0

The official repository of the first version of ACE-Brain foundation model.

78 2 Updated Mar 13, 2026

Wei-Shaw / claude-relay-service

CRS-自建Claude Code镜像，一站式开源中转服务，让 Claude、OpenAI、Gemini、Droid 订阅统一接入，支持拼车共享，更高效分摊成本，原生工具无缝使用。

JavaScript 12,091 1,820 Updated Jun 14, 2026

VisionXLab / CitationClaw

让每一次引用都成为可解释的影响力 Turning Every Citation into Explainable Impact

Python 307 17 Updated May 24, 2026

ResearAI / AutoFigure-Edit

Python 3,772 257 Updated Jun 11, 2026

FireRedTeam / FireRed-Image-Edit

FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity co…

Python 1,258 74 Updated Apr 3, 2026

OpenWebGAL / WebGAL

A brand new web Visual Novel engine | 全新的网页端视觉小说引擎

TypeScript 3,838 353 Updated Jun 14, 2026

dwzhu-pku / PaperBanana

PaperBanana: Automating Academic Illustration For AI Scientists

Python 6,561 487 Updated May 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziqian Fan mingqian-233

Achievements