Large-Scale Scene Text Dataset for Indic Languages
-
Updated
Dec 6, 2025 - Python
Large-Scale Scene Text Dataset for Indic Languages
Comprehensive Scene Text Recognition Toolkit across 11 Indian Languages
Saving endangered Indian languages with open AI innovation
Code to extract multilingual parallel corpus from Press Information Bureau (PIB) website.
This repository contains annotated corpora developed under the Bhashini project for 4 Indian Languages for Named Entity Recognition Task, and the code for inference of the models fine-tuned on XLM-Roberta architecture.
इंग्रजी ते मराठीचा कोश. English to Marathi thesaurus.
Repo that analyses the frequency of words in Tamil
Dialpad team's submission to the MUCS 2021 workshop
😑This package will help you to detect any profanity in indian langauges
This bot politely suggests Redittors to use the correct denomyn "Marathi" instead of "Maharashtrian".
Speeech Recognition for Indic languages.
Renders text in Indic or any other complex languages accurately in Blender's video sequence editor.
The project proposes a novel design for Disambiguation based chording keyboard for blinds. Results also state that there is an optimum number of words to be included in the corpus which would benefit the users by not increasing his cognitive toll. Empirically, we have found that for the Swarachakra Hindi corpus of 10,000 words, the prediction is…
Anuvad is open-source translation platform for Indian Languages based on Weblate
Lot Of Indic Tweets
Tamil Handwriting Detection through Deep Learning
Unsupervised Transliterator using phonetic features (particularly for Indian languages)
Reddit bot that translates Indian languages to English for convenience
indic2unicode: Converts data in proprietary Indian fonts into Unicode
Add a description, image, and links to the indian-language topic page so that developers can more easily learn about it.
To associate your repository with the indian-language topic, visit your repo's landing page and select "manage topics."