myhloli

Xiaomeng Zhao myhloli

380 followers · 17 following

Shanghai AI Lab
Shanghai
23:54 (UTC +08:00)
myhloli.com
https://orcid.org/0009-0007-3365-3090

Achievements

x4 x4 x3 x4

Achievements

x4 x4 x3 x4

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

opendatalab / MinerU-Diffusion

A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.

Python 553 35 Updated Mar 31, 2026

rustdesk / rustdesk

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 111,367 16,680 Updated Apr 16, 2026

kreuzberg-dev / kreuzberg

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Pyt…

Rust 7,571 378 Updated Apr 16, 2026

myhloli / magic-calculator

2026马年春晚魔术计算器应用在线版，免安装，更好用

HTML 6 1 Updated Feb 16, 2026

Kenshin / simpread

简悦 ( SimpRead ) - 让你瞬间进入沉浸式阅读的扩展

JavaScript 8,614 556 Updated Sep 16, 2025

mozilla / readability

A standalone version of the readability lib

JavaScript 11,106 706 Updated Jan 21, 2026

RapidAI / RapidDoc

A high-performance, open-source PDF data extraction tool. 一站式开源高性能数据提取工具，将复杂 PDF 文档转换为 Markdown 和 JSON 格式，使用onnx模型。

Python 147 28 Updated Apr 15, 2026

datalab-to / pykatex

Python 2 Updated Feb 5, 2026

Pacalini / PicaComic

Forked from wgh136/PicaComic

A comic app built with Flutter, supporting multiple comic sources.

Dart 2,729 78 Updated Apr 6, 2026

zai-org / GLM-OCR

GLM-OCR: Accurate × Fast × Comprehensive

Python 5,953 547 Updated Apr 16, 2026

TencentCloudADP / youtu-parsing

Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding

Python 66 6 Updated Feb 10, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 358,705 72,933 Updated Apr 16, 2026

opendatalab / mineru-vl-utils

A Python package for interacting with the MinerU Vision-Language Model.

Python 113 31 Updated Apr 15, 2026

gpustack / gpustack

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 4,855 499 Updated Apr 16, 2026

tecnickcom / tc-lib-pdf

TCPDF - PHP PDF Library - https://tcpdf.org

PHP 1,810 244 Updated Apr 15, 2026

tecnickcom / TCPDF

Official clone of PHP library to generate PDF documents and barcodes

PHP 4,543 1,593 Updated Mar 3, 2026

MiroMindAI / MiroThinker

MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7 and MiroThinker-H1, achieve 74.0 and 88.2 on the BrowseComp, respectively.

Python 8,123 607 Updated Apr 13, 2026

python-pillow / Pillow

Python Imaging Library (fork)

Python 13,523 2,424 Updated Apr 16, 2026

nissl-lab / npoi

a .NET library that can read/write Office formats without Microsoft Office installed. No COM+, no interop.

C# 6,163 1,482 Updated Apr 16, 2026

opendatalab / MinerU-HTML

MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.

Python 234 24 Updated Mar 27, 2026

Tencent-Hunyuan / HunyuanOCR

Python 1,597 126 Updated Apr 8, 2026

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,779 685 Updated Apr 16, 2026

KaTeX / KaTeX

Fast math typesetting for the web.

TypeScript 19,979 1,298 Updated Apr 16, 2026

mwilliamson / python-mammoth

Convert Word documents (.docx files) to HTML

Python 1,077 144 Updated Mar 13, 2026

cubist38 / mlx-openai-server

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-fri…

Python 302 53 Updated Apr 13, 2026

python-openxml / python-docx

Create and modify Word documents with Python

Python 5,525 1,269 Updated Jun 17, 2025

Blaizzy / mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 4,376 478 Updated Apr 16, 2026

datalab-to / chandra

OCR model that handles complex tables, forms, handwriting with full layout.

Python 8,821 915 Updated Apr 9, 2026

NanoNets / docstrange

Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

Python 1,413 126 Updated Oct 31, 2025

alibaba / Logics-Parsing

Python 1,298 107 Updated Apr 8, 2026