kerlomz

kerlomz

445 followers · 11 following

Achievements

Highlights

Developer Program Member

Stars

70 stars written in Python

Clear filter

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 62,751 9,247 Updated Nov 5, 2025

ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 55,933 17,301 Updated Nov 2, 2025

xai-org / grok-1

Grok open release

Python 50,555 8,370 Updated Aug 30, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,614 4,613 Updated Nov 6, 2025

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,415 3,115 Updated Nov 6, 2025

LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,488 3,300 Updated Aug 17, 2024

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,740 5,273 Updated Nov 15, 2024

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,671 5,058 Updated Nov 6, 2025

junyanz / pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Python 24,656 6,537 Updated Aug 6, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 23,987 1,957 Updated Nov 3, 2025

mlflow / mlflow

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…

Python 22,820 4,958 Updated Nov 6, 2025

bee-san / Ciphey

⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡

Python 20,143 1,310 Updated Mar 5, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 19,643 1,380 Updated Oct 25, 2025

eriklindernoren / PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Python 17,316 4,096 Updated Jun 18, 2024

microsoft / Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,383 2,190 Updated Jul 24, 2024

facebookresearch / detr

End-to-End Object Detection with Transformers

Python 14,834 2,613 Updated Mar 12, 2024

sml2h3 / ddddocr

带带弟弟通用验证码识别OCR pypi版

Python 12,971 2,134 Updated Jun 9, 2025

pwxcoo / chinese-xinhua

📙 中华新华字典数据库。包括歇后语，成语，词语，汉字。

Python 11,375 2,642 Updated Dec 26, 2023

Megvii-BaseDetection / YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 10,149 2,402 Updated Jun 8, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,565 2,343 Updated Nov 5, 2025

WongKinYiu / yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,417 1,593 Updated Aug 9, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,977 566 Updated Feb 26, 2025

jpadilla / pyjwt

JSON Web Token implementation in Python

Python 5,512 719 Updated Nov 3, 2025

Shawn-Shan / fawkes

Fawkes, privacy preserving tool against facial recognition systems. More info at https://sandlab.cs.uchicago.edu/fawkes

Python 5,450 493 Updated Aug 2, 2023

Turing-Project / WriteGPT

由图灵的猫开发，基于开源GPT2.0的初代创作型人工智能 | 可扩展、可进化

Python 5,319 899 Updated Mar 31, 2024

kuprel / min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,491 252 Updated Apr 28, 2025

POSTECH-CVLab / PyTorch-StudioGAN

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

Python 3,475 345 Updated Aug 9, 2024

kerlomz / captcha_trainer

[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.

Python 3,162 829 Updated Oct 24, 2022

Tencent / tencent-ml-images

Largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet

Python 3,077 514 Updated Apr 20, 2022

tensorflow / lingvo

Lingvo

Python 2,852 452 Updated Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kerlomz

Achievements

Achievements

Highlights

Block or report kerlomz

Stars

PaddlePaddle / PaddleOCR

ultralytics / yolov5

xai-org / grok-1

deepspeedai / DeepSpeed

gradio-app / gradio

LAION-AI / Open-Assistant

babysor / MockingBird

huggingface / pytorch-image-models

junyanz / pytorch-CycleGAN-and-pix2pix

fishaudio / fish-speech

mlflow / mlflow

bee-san / Ciphey

deepseek-ai / DeepSeek-OCR

eriklindernoren / PyTorch-GAN

microsoft / Swin-Transformer

facebookresearch / detr

sml2h3 / ddddocr

pwxcoo / chinese-xinhua

Megvii-BaseDetection / YOLOX

espnet / espnet

WongKinYiu / yolov9

AILab-CVC / YOLO-World

jpadilla / pyjwt

Shawn-Shan / fawkes

Turing-Project / WriteGPT

kuprel / min-dalle

POSTECH-CVLab / PyTorch-StudioGAN

kerlomz / captcha_trainer

Tencent / tencent-ml-images

tensorflow / lingvo