Lists (25)
Sort Name ascending (A-Z)
ADV
Adversarial attack, defence, and training in NLP and CVagent
ai waifu
audio
CodeLLM
CS
GAN
GlotLID
GraphVec
Graph, Node, Community to Vec and LearningGUI
Iranian NLP
ISSUE_TEMPLATE
LMU Thesis BSc/MSc
MY TOP
My top projects.NLP: BIG corpus
NLP: General
NLP: Hate speach
NLP: LLM
OCR
Poem
RLM
tableprivacy
translation
udhr
Used AIM Or Related to AIM
Starred repositories
A Grapheme to Phoneme model using LSTM implemented in pytorch
A tool for translating Persian text to IPA (International Phonetic Alphabet).
Universal Romanizer that can convert any unicode script to roman (latin) script
AI that sees your screen, listens to your conversations and tells you what to do
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
Inference repo for Falcon-Perception and Falcon-OCR model, early-fusion, natively multimodal, dense Autoregressive Transformer models.
Normalize any language identifier to canonical ISO 639-3 + ISO 15924 form. Powered by LinguaMeta from Google Research
mishig25 / hf-autoresearch
Forked from karpathy/autoresearchAI agents running research on Hugging Face infra
OCR model that handles complex tables, forms, handwriting with full layout.
AI agents running research on single-GPU nanochat training automatically
OpenLID-v3
A repository for tooling for the community to evaluate open source models.
Multimodal OCR: Parse Anything from Documents
A machine learning software for extracting information from scholarly documents
Font files available from Google Fonts, and a public issue tracker for all things Google Fonts
Qianfan-VL: Domain-Enhanced Universal Vision-Language Models
Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)
Tesseract Open Source OCR Engine (main repository)
extract text from any document. no muss. no fuss.
The official repository of "Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling"
Training code for TabDPT: Scaling Tabular Foundation Models on Real Data
Repository of NUDS/XML files that represent ANS-published digital type corpora