Skip to content
View kerlomz's full-sized avatar

Block or report kerlomz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
70 stars written in Python
Clear filter

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 62,751 9,247 Updated Nov 5, 2025

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 55,933 17,301 Updated Nov 2, 2025

Grok open release

Python 50,555 8,370 Updated Aug 30, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,614 4,613 Updated Nov 6, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,415 3,115 Updated Nov 6, 2025

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,488 3,300 Updated Aug 17, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,740 5,273 Updated Nov 15, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,671 5,058 Updated Nov 6, 2025

Image-to-Image Translation in PyTorch

Python 24,656 6,537 Updated Aug 6, 2025

SOTA Open Source TTS

Python 23,987 1,957 Updated Nov 3, 2025

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…

Python 22,820 4,958 Updated Nov 6, 2025

⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡

Python 20,143 1,310 Updated Mar 5, 2025

Contexts Optical Compression

Python 19,643 1,380 Updated Oct 25, 2025

PyTorch implementations of Generative Adversarial Networks.

Python 17,316 4,096 Updated Jun 18, 2024

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,383 2,190 Updated Jul 24, 2024

End-to-End Object Detection with Transformers

Python 14,834 2,613 Updated Mar 12, 2024

带带弟弟 通用验证码识别OCR pypi版

Python 12,971 2,134 Updated Jun 9, 2025

📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。

Python 11,375 2,642 Updated Dec 26, 2023

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 10,149 2,402 Updated Jun 8, 2025

End-to-End Speech Processing Toolkit

Python 9,565 2,343 Updated Nov 5, 2025

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,417 1,593 Updated Aug 9, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,977 566 Updated Feb 26, 2025

JSON Web Token implementation in Python

Python 5,512 719 Updated Nov 3, 2025

Fawkes, privacy preserving tool against facial recognition systems. More info at https://sandlab.cs.uchicago.edu/fawkes

Python 5,450 493 Updated Aug 2, 2023

由图灵的猫开发,基于开源GPT2.0的初代创作型人工智能 | 可扩展、可进化

Python 5,319 899 Updated Mar 31, 2024

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,491 252 Updated Apr 28, 2025

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

Python 3,475 345 Updated Aug 9, 2024

[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.

Python 3,162 829 Updated Oct 24, 2022

Largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet

Python 3,077 514 Updated Apr 20, 2022

Lingvo

Python 2,852 452 Updated Oct 29, 2025
Next