Skip to content
View wuhaer's full-sized avatar

Block or report wuhaer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026 Oral).

31 Updated Nov 14, 2025

[AAAI 2026 Oral] The official GitHub page of "PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography"

2 Updated Nov 13, 2025

[arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR

243 3 Updated Aug 28, 2025

Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

Python 125 4 Updated Nov 13, 2023

[IEEE TPAMI 2025] Privacy-Preserving Biometric Verification With Handwritten Random Digit String

Python 65 Updated Aug 3, 2025

[ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration"

Python 51 4 Updated Dec 22, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 8,073 677 Updated Dec 17, 2025

[PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories"

Python 71 5 Updated Dec 22, 2025

[CVPR NTIRE2025 ImageSRx4] BBox Team's Solution

Python 5 Updated Mar 24, 2025

[IEEE TIFS 2024] Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach

Python 55 1 Updated Aug 3, 2025

[IJCV 2025] Smaller But Better: Unifying Layout Generation with Smaller Large Language Models

Python 149 3 Updated Aug 3, 2025