Skip to content
View XDUWen's full-sized avatar

Block or report XDUWen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"

Python 82 12 Updated Oct 14, 2023

SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

Python 6,741 653 Updated Jun 14, 2026

AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images …

Python 27,527 2,449 Updated Jun 14, 2026

"Paper2Slides: From Paper to Presentation in One Click"

Python 3,724 474 Updated May 20, 2026

Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.

Jupyter Notebook 2,238 342 Updated Feb 10, 2026

[NeurIPS 2025] Open-source Multi-agent Poster Generation from Papers

Python 3,782 278 Updated Jun 8, 2026

"OpenSpace: Make Your Agents: Smarter, Low-Cost, Self-Evolving" -- Community: https://open-space.cloud/

Python 6,530 812 Updated Jun 4, 2026

Reference code for the Meta-Harness paper.

Python 1,068 104 Updated Apr 29, 2026

MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]

Python 24 Updated Dec 10, 2025

MazeBench: Can multimodal LLMs solve visual mazes, or do they just brute-force in token space? Benchmark, 110-maze eval set, and paper (arXiv:2603.26839).

Python 4 Updated May 31, 2026
Python 188 18 Updated Nov 26, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,748 2,231 Updated Feb 1, 2025

ThinkGen: Generalized Thinking for Visual Generation

Python 58 Updated Dec 30, 2025

LLaDA2.0-Uni: Understanding and Generation the World.

Python 759 48 Updated May 29, 2026

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex

Python 785 41 Updated Mar 19, 2026

https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT

Python 136 7 Updated Jan 30, 2026

This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"

Python 77 3 Updated Apr 14, 2026

[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 188 14 Updated May 1, 2026

[ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision

Python 228 7 Updated May 31, 2026

InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.

Python 286 16 Updated Mar 21, 2026

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Python 48 5 Updated Jul 22, 2025

Open-source unified multimodal model

Python 6,012 532 Updated May 4, 2026

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

826 41 Updated Oct 10, 2025

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 11,075 488 Updated Jun 10, 2026

The development and future prospects of large multimodal reasoning models.

613 22 Updated Jan 9, 2026

一套为研究生和学术研究者设计的完整AI Prompt库 📖 包含内容: ✨ 40+ 精心设计的AI Prompt ✨ 论文选题系统方法(生成、评估、论证) ✨ 论文查找快速方案(8个不同方案) ✨ 文献综述框架和工具 ✨ Excel自动评估表格 ✨ 3个完整的论证模板 🚀 核心优势: ⚡ 节省时间 50-70%(选题3-5天而不是2-3周) 🎯 科学方法(基于系统的5维度评估体系) 💡 即插…

1,220 76 Updated Feb 12, 2026

西安电子科技大学毕业论文Typst模板

Typst 16 1 Updated May 4, 2025

Xidian University TeX Suite 西安电子科技大学LaTeX套装

TeX 1,127 101 Updated May 4, 2025

X-LoRA: Mixture of LoRA Experts

Python 275 21 Updated Aug 4, 2024

An Efficient "Factory" to Build Multiple LoRA Adapters

Python 379 67 Updated Feb 13, 2025
Next