XDUWen

XDUWen

Stars

NormXU / Layout2Graph

An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"

Python 82 12 Updated Oct 14, 2023

SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

Python 6,741 653 Updated Jun 14, 2026

hugohe3 / ppt-master

AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images …

Python 27,527 2,449 Updated Jun 14, 2026

HKUDS / Paper2Slides

"Paper2Slides: From Paper to Presentation in One Click"

Python 3,724 474 Updated May 20, 2026

jmiao24 / Paper2Agent

Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.

Jupyter Notebook 2,238 342 Updated Feb 10, 2026

Paper2Poster / Paper2Poster

[NeurIPS 2025] Open-source Multi-agent Poster Generation from Papers

Python 3,782 278 Updated Jun 8, 2026

HKUDS / OpenSpace

"OpenSpace: Make Your Agents: Smarter, Low-Cost, Self-Evolving" -- Community: https://open-space.cloud/

Python 6,530 812 Updated Jun 4, 2026

stanford-iris-lab / meta-harness

Reference code for the Meta-Harness paper.

Python 1,068 104 Updated Apr 29, 2026

MMMGBench / MMMG

MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]

Python 24 Updated Dec 10, 2025

alrod97 / LLMs_mazes

MazeBench: Can multimodal LLMs solve visual mazes, or do they just brute-force in token space? Benchmark, 110-maze eval set, and paper (arXiv:2603.26839).

Python 4 Updated May 31, 2026

janhq / visual-thinker

Python 188 18 Updated Nov 26, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,748 2,231 Updated Feb 1, 2025

jiaosiyuu / ThinkGen

ThinkGen: Generalized Thinking for Visual Generation

Python 58 Updated Dec 30, 2025

inclusionAI / LLaDA2.0-Uni

LLaDA2.0-Uni: Understanding and Generation the World.

Python 759 48 Updated May 29, 2026

CodeGoat24 / UnifiedReward

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex

Python 785 41 Updated Mar 19, 2026

multimodal-reasoning-lab / Bagel-Zebra-CoT

https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT

Python 136 7 Updated Jan 30, 2026

shiwk24 / MathCanvas

This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"

Python 77 3 Updated Apr 14, 2026

ThinkMorph / ThinkMorph

[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 188 14 Updated May 1, 2026

Fr0zenCrane / UniCoT

[ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision

Python 228 7 Updated May 31, 2026

OpenGVLab / InternVL-U

InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.

Python 286 16 Updated Mar 21, 2026

mm-vl / ULM-R1

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Python 48 5 Updated Jul 22, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 6,012 532 Updated May 4, 2026

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

826 41 Updated Oct 10, 2025

windingwind / zotero-pdf-translate

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 11,075 488 Updated Jun 10, 2026

HITsz-TMG / Awesome-Large-Multimodal-Reasoning-Models

The development and future prospects of large multimodal reasoning models.

613 22 Updated Jan 9, 2026

bohyy / academic-ai-prompt

一套为研究生和学术研究者设计的完整AI Prompt库 📖 包含内容： ✨ 40+ 精心设计的AI Prompt ✨ 论文选题系统方法（生成、评估、论证） ✨ 论文查找快速方案（8个不同方案） ✨ 文献综述框架和工具 ✨ Excel自动评估表格 ✨ 3个完整的论证模板 🚀 核心优势： ⚡ 节省时间 50-70%（选题3-5天而不是2-3周） 🎯 科学方法（基于系统的5维度评估体系） 💡 即插…

1,220 76 Updated Feb 12, 2026

CoderJackZhu / XDUthesis-Typst

西安电子科技大学毕业论文Typst模板

Typst 16 1 Updated May 4, 2025

note286 / xduts

Xidian University TeX Suite 西安电子科技大学LaTeX套装

TeX 1,127 101 Updated May 4, 2025

EricLBuehler / xlora

X-LoRA: Mixture of LoRA Experts

Python 275 21 Updated Aug 4, 2024

TUDB-Labs / mLoRA

An Efficient "Factory" to Build Multiple LoRA Adapters

Python 379 67 Updated Feb 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly