Lists (20)
Sort Name ascending (A-Z)
ADV
Adversarial attack, defence, and training in NLP and CVagent
CodeLLM
GAN
GlotLID
GraphVec
Graph, Node, Community to Vec and LearningGUI
ISSUE_TEMPLATE
LMU Thesis BSc/MSc
MY TOP
My top projects.NLP: BIG corpus
NLP: General
NLP: Hate speach
NLP: LLM
OCR
Poem
RLM
tableprivacy
translation
Used AIM Or Related to AIM
Starred repositories
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
Collection of swadesh lists in CSV table format with possible connections to Indo European
A list of recent papers about adversarial learning
Curated list of awesome datasets for various table understanding tasks
A list of fonts organized by unicode script
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
Vision Language Model : tailored for tasks that involve [messy] optical character recognition (ocr), image-to-text conversion, and math problem solving with latex formatting.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Open source Farsi OCR, اوسیآر متنباز فارسی
an OCR tool to translate Old Persian cuneiform (Achaemenid language) by AI
This repository contains code for line detection, character detection and recognition on the cuneiform 2d images
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
Source code for the FineTranslations dataset
A curated list of awesome resources and tools for Kurdish language and speech technologies
A Multilingual Keyboard Layout-Based Typo Generator