Skip to content
View mainjzb's full-sized avatar

Block or report mainjzb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

35 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,141 31,052 Updated Nov 6, 2025

⏬ Dumb downloader that scrapes the web

Python 56,533 9,801 Updated Apr 27, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 45,089 6,499 Updated Nov 5, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,610 4,613 Updated Nov 6, 2025

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Python 39,518 3,909 Updated May 31, 2025

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 36,442 6,054 Updated Oct 30, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,918 6,622 Updated Sep 30, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 28,323 3,493 Updated Sep 24, 2024

中文独立博客列表

Python 22,346 2,603 Updated Nov 4, 2025

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

Python 8,792 860 Updated Oct 27, 2025

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Python 7,998 827 Updated Aug 21, 2025

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

Python 6,230 1,155 Updated Sep 8, 2025

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.

Python 5,213 527 Updated Oct 15, 2025

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,887 1,166 Updated Nov 3, 2025

跟我一起写Makefile重制版

Python 3,608 597 Updated Oct 21, 2025

A synthetic data generator for text recognition

Python 3,588 1,014 Updated Jul 18, 2024

A dark style sheet for QtWidgets application

Python 3,031 744 Updated Jul 16, 2025

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Python 2,952 956 Updated Aug 13, 2019

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

Python 2,786 1,073 Updated Oct 8, 2019

ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning

Python 2,290 312 Updated May 7, 2025

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,255 474 Updated Aug 7, 2024

一款基于VUE3.0的高颜值卡密发卡系统,特别适合虚拟商品、知识付费等。

Python 2,224 565 Updated Dec 13, 2023

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,876 144 Updated Dec 30, 2024

make a better chinese character recognition OCR than tesseract

Python 1,514 483 Updated Nov 12, 2017

基于Pytorch的OCR工具库,支持常用的文字检测和识别算法

Python 1,498 313 Updated Sep 2, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 933 234 Updated Jun 18, 2020

基于深度学习的漫画翻译辅助工具,包含翻译、朗读、图像去字、自动嵌字功能。 目的是帮助非专业汉化人员完成更简单,快速的翻译任务。

Python 630 51 Updated Nov 22, 2022

Arch Linux CN Community repo mirrors list

Python 573 53 Updated Apr 22, 2025

Library for translating between 200 languages. Built on 🤗 transformers.

Python 494 50 Updated Sep 2, 2024
Next