Skip to content
View lizzy8587's full-sized avatar

Highlights

  • Pro

Block or report lizzy8587

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 613 33 Updated Jun 15, 2026
Python 17 Updated Jun 2, 2026

SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

Python 7,003 670 Updated Jun 15, 2026

OfficeCLI is the first and best Office suite purpose-built for AI agents to read, edit, and automate Word, Excel, and PowerPoint files. Free, open-source, single binary, no Office installation requ…

C# 7,114 531 Updated Jun 15, 2026

Making daily work at MSRA easier — especially cluster training, data management, and server operations.

TeX 6 Updated Jun 14, 2026

Bridging the gap between image generation and real-world design: a benchmark for structured, multi-constraint commercial visual content generation.

Python 18 2 Updated Apr 24, 2026

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,174 274 Updated Jun 15, 2026

Build coherent and visually polished multimodal webpages with hierarchical planning, AIGC tools, and iterative reflection.

Python 12 2 Updated May 17, 2026

Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.

Python 35,041 4,314 Updated Jun 15, 2026

The official code of FineRMoE.

Python 20 Updated Mar 17, 2026

📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!

Python 187 3 Updated May 15, 2026

GRADE: Grounded Reasoning Assessment for Discipline-informed Editing

Python 25 1 Updated Apr 23, 2026

Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Python 39 1 Updated Mar 13, 2026

Code repo for "EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation"

Python 21 Updated Mar 30, 2026

InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.

Python 287 16 Updated Mar 21, 2026

[ICML 2026 Oral] Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

321 6 Updated May 25, 2026

(ICML2026) Official implementation of VLANeXt.

Python 200 9 Updated May 17, 2026

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

452 24 Updated May 8, 2026

A unified framework for easy reinforcement learning in Flow-Matching models

Python 572 47 Updated Jun 15, 2026

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 2,341 165 Updated May 7, 2026

[ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision

Python 228 7 Updated May 31, 2026
Python 11,545 787 Updated Feb 9, 2026

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Python 508 32 Updated Apr 17, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,259 2,860 Updated Mar 5, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,970 4,076 Updated Jun 15, 2026

official training and inference code of bitwise tokenizer

Python 72 2 Updated May 18, 2025

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,573 92 Updated Apr 16, 2026

Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).

Python 425 11 Updated Aug 26, 2025

[ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++

Python 221 5 Updated Apr 15, 2026
Next