Build software better, together

moziarnj07-sys / doubaoime-asr

🎤 Enable voice recognition for the Doubao input method using Python; ideal for learning and research with a focus on audio processing.

Updated Apr 4, 2026
Python

HemShah2018 / deeplip

Star

🔍 Read lips in videos with an end-to-end deep learning model, enhancing accessibility and transcribing speech from mouth movements effectively.

computer-vision deep-learning tensorflow keras video-processing lstm ctc lip-reading conv3d

Updated Apr 4, 2026
Python

damn8daniel / asr-russian

Star

Russian ASR system from scratch. Conformer encoder, CTC/Attention hybrid decoder, SpecAugment, beam search with LM fusion.

deep-learning pytorch speech-recognition russian asr ctc conformer

Updated Mar 27, 2026
Python

molind / mlx-conformer

Star

NVIDIA Conformer-CTC and Conformer-Transducer (RNN-T) running natively on Apple Silicon via MLX. Loads NeMo checkpoints directly.

macos inference speech-recognition nemo transducer asr mlx ctc conformer fine-tuning rnnt apple-silicon

Updated Mar 23, 2026
Python

GioiaZheng / handwritten-ocr-system

Star

Deep learning-based handwritten OCR system using CNN-RNN-CTC with evaluation via CER/WER metrics.

ocr computer-vision deep-learning pytorch handwritten-text-recognition ctc sequence-modeling

Updated Mar 20, 2026
Python

David366AI / mini-ocr

Star

Very simple ocr based on crnn with traing and inference

ocr lstm gru vgg ctc crnn

Updated Mar 16, 2026
Python

ayutaz / cc-g2pnp

Sponsor

Star

Reimplementation of CC-G2PnP: Streaming Conformer-CTC based Japanese Grapheme-to-Phoneme and Prosody model (arXiv:2602.17157)

text-to-speech streaming japanese pytorch prosody ctc g2p conformer

Updated Apr 1, 2026
Python

githubharald / CTCDecoder

Star

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

python opencl recurrent-neural-networks speech-recognition beam-search language-model handwriting-recognition ctc loss prefix-search ctc-loss token-passing best-path

Updated Jan 31, 2026
Python

siripuramvinodkumar / hack-quest

Star

CTF writeups and proof files from TCS HackQuest Season 10 cybersecurity challenges.

cryptography forensics cybersecurity writeups ctc hackquest tcs-hackquest

Updated Dec 14, 2025
Python

Thulium is a production-ready Python library for offline handwritten text recognition (HTR) supporting 52+ languages across Latin, Cyrillic, Greek, Arabic, Hebrew, Devanagari, Chinese, Japanese, Korean, and Georgian scripts.

multilingual python nlp ocr research computer-vision deep-learning pytorch transformer optical-character-recognition document-analysis htr handwriting-recognition ctc historical-documents

Updated Dec 12, 2025
Python

abdulvahapmutlu / asrle

Star

ASR-LE is an advanced ASR evaluation + observability toolkit that goes beyond WER: it shows where errors happen in time, estimates streaming p95 first-word latency, generates “worst moments” automatically, and produces reusable artifacts (report.json, timeline bins, moments, etc.)

streaming latency speech-recognition automatic-speech-recognition speech-to-text whisper audio-processing asr ctc error-analysis word-error-rate mlops streamlit faster-whisper word-error-rate-calculator

Updated Nov 28, 2025
Python

zaydabash / deeplip

Star

Deep learning lip-reading model using Conv3D + BiLSTM + CTC architecture. Transcribes speech from mouth region video clips for accessibility applications.

computer-vision deep-learning accessibility tensorflow keras video-processing lstm ctc lip-reading conv3d

Updated Nov 11, 2025
Python

AKBiradar02 / iam_handwritten_model

Star

The goal of this project is to accurately transcribe handwritten English word images into text using a deep learning model. This is achieved through the use of: The IAM Handwriting Dataset is a labeled image corpus CNN + BLSTM + CTC architecture An end-to-end training and inference pipeline

python deep-learning cnn handwriting-recognition ctc blstm