-
-
VLMEvalKit Public
Forked from open-compass/VLMEvalKitOpen-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Python Apache License 2.0 UpdatedMay 23, 2025 -
SwinTextSpotterv2 Public
Pytorch re-implementation of Paper: SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting (IJCV 2025)
-
SwinTextSpotter Public
Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)
-
Monkey Public
Forked from Yuliang-Liu/Monkey【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
-
Bridging-Text-Spotting Public
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
-
VimTS Public
Forked from Yuliang-Liu/VimTS[arXiv 2024.19652] VimTS: A Unified Video and Image Text Spotter
GNU General Public License v3.0 UpdatedMay 2, 2024 -
ESTextSpotter Public
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
-
Awesome-Chinese-NLP Public
Forked from crownpku/Awesome-Chinese-NLPA curated list of resources for Chinese NLP 中文自然语言处理相关资料
-
E2E-MLT Public
Forked from MichalBusta/E2E-MLTE2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
C++ MIT License UpdatedMar 31, 2020 -
Awesome-Scene-Text-Recognition Public
Forked from WeihongM/Awesome-Scene-Text-RecognitionA curated list of resources dedicated to scene text localization and recognition
UpdatedApr 9, 2018