Skip to content
View prajdabre's full-sized avatar

Block or report prajdabre

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines

Python 3,393 217 Updated Dec 22, 2025

Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including CPU, AMD, and NVIDIA GPUs.

Python 67 4 Updated Dec 4, 2025

A library for minimum Bayes risk (MBR) decoding

Python 50 6 Updated Nov 2, 2025

Additional resources from our AACL tutorial

10 1 Updated Nov 13, 2023

The central repo for Creole based NLU and NLG work

HTML 18 5 Updated May 2, 2025

Chitralekha - A video transcreation platform for Indic languages, supporting transcription, translation and voice-over

111 25 Updated Oct 31, 2025

Translation models for 22 scheduled languages of India

Python 387 103 Updated Oct 3, 2025

Whisper realtime streaming for long speech-to-text transcription and translation

Python 3,491 411 Updated Nov 12, 2025

Various transformers for FSDP research

Jupyter Notebook 38 5 Updated Nov 11, 2022

Accessible large language models via k-bit quantization for PyTorch.

Python 7,844 804 Updated Dec 12, 2025

PyScript is an open source platform for Python in the browser. Try PyScript: https://pyscript.com Examples: https://tinyurl.com/pyscript-examples Community: https://discord.gg/HxvBtukrg2

Python 18,651 1,487 Updated Dec 15, 2025

[EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages

Python 23 2 Updated Feb 13, 2023

Pre-trained, multilingual sequence-to-sequence models for Indian languages

Python 51 5 Updated Jul 20, 2022

Yet Another Neural Machine Translation Toolkit

Python 180 31 Updated Mar 7, 2025
Python 16 5 Updated Aug 23, 2022

Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus

13 3 Updated Feb 17, 2019

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

TypeScript 3,618 371 Updated Dec 5, 2025

A parallel evaluation data set of SAP software documentation with document structure annotation

Mathematica 14 5 Updated Jul 30, 2025

An open-source Python framework for hybrid quantum-classical machine learning.

Python 2,067 643 Updated Dec 19, 2025

This project is real-time visualization of a network recognizing digits from user's input.

Processing 589 68 Updated Dec 30, 2019

PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)

Jupyter Notebook 517 117 Updated Nov 1, 2020

Japanese--Russian--English News Commentary Parallel Data

Ruby 18 3 Updated Jul 9, 2019

Pun-GAN: Generative Adversarial Network for Pun Generation (EMNLP 2019)

Python 42 7 Updated Aug 19, 2019

Making Art with Deep Learning Workshop | ML@B

Jupyter Notebook 26 12 Updated Feb 22, 2018

Curated Collection of BCI resources

1,386 269 Updated Sep 7, 2025

a project to visualize global weather conditions

JavaScript 6,472 1,238 Updated Oct 1, 2022

Xlit-Crowd: Hindi-English Transliteration Corpus

38 39 Updated Feb 17, 2015