Comprehensive Scene Text Recognition Toolkit across 11 Indian Languages
-
Updated
Dec 6, 2025 - Python
Comprehensive Scene Text Recognition Toolkit across 11 Indian Languages
Tamil Handwriting Detection through Deep Learning
Dialpad team's submission to the MUCS 2021 workshop
इंग्रजी ते मराठीचा कोश. English to Marathi thesaurus.
Code to extract multilingual parallel corpus from Press Information Bureau (PIB) website.
Speeech Recognition for Indic languages.
Unsupervised Transliterator using phonetic features (particularly for Indian languages)
This bot politely suggests Redittors to use the correct denomyn "Marathi" instead of "Maharashtrian".
Large-Scale Scene Text Dataset for Indic Languages
Anuvad is open-source translation platform for Indian Languages based on Weblate
The project proposes a novel design for Disambiguation based chording keyboard for blinds. Results also state that there is an optimum number of words to be included in the corpus which would benefit the users by not increasing his cognitive toll. Empirically, we have found that for the Swarachakra Hindi corpus of 10,000 words, the prediction is…
Renders text in Indic or any other complex languages accurately in Blender's video sequence editor.
This repository contains annotated corpora developed under the Bhashini project for 4 Indian Languages for Named Entity Recognition Task, and the code for inference of the models fine-tuned on XLM-Roberta architecture.
😑This package will help you to detect any profanity in indian langauges
Lot Of Indic Tweets
indic2unicode: Converts data in proprietary Indian fonts into Unicode
Repo that analyses the frequency of words in Tamil
Reddit bot that translates Indian languages to English for convenience
Saving endangered Indian languages with open AI innovation
Add a description, image, and links to the indian-language topic page so that developers can more easily learn about it.
To associate your repository with the indian-language topic, visit your repo's landing page and select "manage topics."