Skip to content
View kerlomz's full-sized avatar

Block or report kerlomz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
70 stars written in Python
Clear filter

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

Python 2,786 1,073 Updated Oct 8, 2019

复现大模型相关算法及一些学习记录

Python 2,480 339 Updated Nov 6, 2025

Code for BLT research paper

Python 2,006 180 Updated Nov 3, 2025

Physical Symbolic Optimization

Python 1,917 259 Updated Sep 10, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Python 1,898 388 Updated Apr 9, 2023

基于Frida的脱壳工具

Python 1,514 332 Updated Jun 11, 2025

Reproduce MTCNN using Tensorflow

Python 1,507 706 Updated Dec 16, 2019

Generate text images for training deep learning ocr model

Python 1,452 388 Updated Jan 17, 2022

PyTorch implementation of Multi-Label Image Recognition with Graph Convolutional Networks, CVPR 2019.

Python 1,451 352 Updated Dec 31, 2021

Research Framework for easy and efficient training of GANs based on Pytorch

Python 1,425 168 Updated Oct 23, 2022

🛠️ 哔哩哔哩(B站)辅助工具箱,支持Cookie/Token/Password融合持久化登录与多用户操作

Python 1,298 160 Updated Apr 28, 2021

Deep Image Matting

Python 985 262 Updated Aug 20, 2019

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)

Python 928 90 Updated Apr 24, 2024

⚡ Based on Yolo's low-power, ultra-lightweight universal target detection algorithm, the parameter is only 250k, and the speed of the smart phone mobile terminal can reach ~300fps+

Python 906 203 Updated Jan 17, 2024

The implementation of various lightweight networks by using PyTorch. such as:MobileNetV2,MobileNeXt,GhostNet,ParNet,MobileViT、AdderNet,ShuffleNetV1-V2,LCNet,ConvNeXt,etc. ⭐⭐⭐⭐⭐

Python 899 163 Updated May 16, 2022

⚡ A newly designed ultra lightweight anchor free target detection algorithm, weight only 250K parameters, reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is …

Python 837 148 Updated Mar 16, 2023

[验证码识别-部署] This project is based on CNN+BLSTM+CTC to realize verificationtion. This projeccode identificat is only for deployment models.

Python 680 237 Updated Nov 21, 2022

All-in-one Toolbox for Computer Vision Research.

Python 658 76 Updated Mar 10, 2023

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

Python 634 214 Updated Aug 30, 2021

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

Python 579 106 Updated Nov 10, 2024

Code release for Best-of-N Jailbreaking

Python 543 90 Updated Feb 5, 2025

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

Python 452 69 Updated Mar 18, 2022

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!

Python 441 37 Updated Jun 30, 2025

Ensembling Off-the-shelf Models for GAN Training (CVPR 2022 Oral)

Python 413 32 Updated Sep 9, 2022

Official TensorFlow code for the paper "Efficient-CapsNet: Capsule Network with Self-Attention Routing".

Python 273 61 Updated Nov 25, 2021

基于MobileNetV2/EfficientNet-b0/... + LSTM + CTC的不定长图像识别训练pytorch框架

Python 203 51 Updated Aug 31, 2020

PyTorch code for "EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer" (ECCV 2022)

Python 178 35 Updated Jul 23, 2022

Code and data for paper: https://arxiv.org/abs/1802.07101

Python 158 26 Updated Apr 7, 2021

pdd (拼多多) 爬虫 js 解密 anti_content 参数解密及全站抓取代码思路实现

Python 152 77 Updated Apr 2, 2019

Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.

Python 147 14 Updated Nov 4, 2025