hwf1324

Follow

WangFeng Huang hwf1324

Follow

11 followers · 9 following

05:24 (UTC +08:00)

Achievements

Achievements

Lists (3)

Sort

Developer Tools

mdbook

🚀 My stack

Starred repositories

75 stars written in Python

EbookFoundation / free-programming-books

📚 Freely available programming books

Python 376,255 65,318 Updated Nov 4, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,180 31,061 Updated Nov 6, 2025

python / cpython

The Python programming language

Python 69,695 33,302 Updated Nov 6, 2025

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 62,833 9,258 Updated Nov 6, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,339 11,080 Updated Nov 6, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 48,354 9,328 Updated Nov 6, 2025

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,211 3,988 Updated Nov 6, 2025

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,425 3,116 Updated Nov 6, 2025

ocrmypdf / OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 31,689 2,202 Updated Oct 27, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,506 6,476 Updated Nov 6, 2025

frappe / erpnext

Free and Open Source Enterprise Resource Planning (ERP)

Python 29,968 9,744 Updated Nov 6, 2025

datalab-to / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 29,653 1,996 Updated Nov 3, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,359 1,888 Updated Jun 3, 2025

Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 22,358 1,667 Updated Mar 13, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 19,713 1,390 Updated Oct 25, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 15,820 1,198 Updated Nov 4, 2025

microsoft / pyright

Static Type Checker for Python

Python 14,950 1,746 Updated Nov 2, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 14,862 1,677 Updated Oct 30, 2025

LibreTranslate / LibreTranslate

Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.

Python 13,015 1,334 Updated Nov 6, 2025

sml2h3 / ddddocr

带带弟弟通用验证码识别OCR pypi版

Python 12,973 2,135 Updated Jun 9, 2025

pwxcoo / chinese-xinhua

📙 中华新华字典数据库。包括歇后语，成语，词语，汉字。

Python 11,379 2,642 Updated Dec 26, 2023

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 8,887 670 Updated Jan 3, 2025

KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 8,864 750 Updated Jul 11, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 8,379 1,022 Updated Nov 3, 2025

gaogaotiantian / viztracer

A debugging and profiling tool that can trace and visualize python code execution

Python 7,297 464 Updated Nov 5, 2025

levihsu / OOTDiffusion

[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"

Python 6,455 932 Updated May 13, 2024

rednote-hilab / dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,594 562 Updated Oct 31, 2025

RapidAI / RapidOCR

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.

Python 5,217 527 Updated Oct 15, 2025

mrexodia / ida-pro-mcp

AI-powered reverse engineering assistant that bridges IDA Pro with language models through MCP.

Python 4,148 418 Updated Nov 6, 2025

xiaofengShi / CHINESE-OCR

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Python 2,952 956 Updated Aug 13, 2019

Starred topics

command-line