lizzy8587

Yan Li lizzy8587

6 followers · 1 following

Shanghai Jiao Tong University
Shanghai
https://lizzy8587.github.io

Highlights

Stars

Tencent-Hunyuan / UniRL

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 613 33 Updated Jun 15, 2026

SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

Python 7,003 670 Updated Jun 15, 2026

iOfficeAI / OfficeCLI

OfficeCLI is the first and best Office suite purpose-built for AI agents to read, edit, and automate Word, Excel, and PowerPoint files. Free, open-source, single binary, no Office installation requ…

C# 7,114 531 Updated Jun 15, 2026

gaoxin492 / msra-skills

Making daily work at MSRA easier — especially cluster training, data management, and server operations.

TeX 6 Updated Jun 14, 2026

microsoft / BizGenEval

Bridging the gap between image generation and real-world design: a benchmark for structured, multi-constraint commercial visual content generation.

Python 18 2 Updated Apr 24, 2026

OpenSenseNova / SenseNova-U1

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,174 274 Updated Jun 15, 2026

microsoft / MM-WebAgent

Build coherent and visually polished multimodal webpages with hierarchical planning, AIGC tools, and iterative reflection.

Python 12 2 Updated May 17, 2026

github / awesome-copilot

Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.

Python 35,041 4,314 Updated Jun 15, 2026

liaoning97 / FineRMoE

The official code of FineRMoE.

Python 20 Updated Mar 17, 2026

chengzhag / UCPE

📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!

Python 187 3 Updated May 15, 2026

VisionXLab / GRADE

GRADE: Grounded Reasoning Assessment for Discipline-informed Editing

Python 25 1 Updated Apr 23, 2026

VisionXLab / FIRM-Reward

Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Python 39 1 Updated Mar 13, 2026

VisionXLab / EvoTok

Code repo for "EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation"

Python 21 Updated Mar 30, 2026

OpenGVLab / InternVL-U

InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.

Python 287 16 Updated Mar 21, 2026