Parakeet ASR Demo

Speech recognition using NVIDIA's Parakeet TDT model.

Features

🎙️ Speech-to-text transcription using NVIDIA Parakeet TDT model
📊 Real-time transcription with progress tracking
📝 Support for multiple audio formats (WAV, FLAC)
📈 Transcription history with export options
🎯 Optimized for both short and long audio files
💻 GPU acceleration support with fallback to CPU

Setup

Install dependencies:

pip install -r requirements.txt

Run the application:

streamlit run app.py

Requirements

Python 3.8+
NVIDIA GPU with CUDA support (strongly recommended for optimal performance)
FFmpeg (for audio processing)

Usage

Upload an audio file or record directly in the browser
Wait for the model to process and transcribe
View and export transcription results

Notes

NVIDIA GPU with CUDA support is strongly recommended for optimal performance
Long audio files (>8 minutes) will automatically use optimized settings
Maximum recommended audio duration is 30 minutes

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Dataset		Dataset
.gitattributes		.gitattributes
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Parakeet ASR Demo

Features

Setup

Requirements

Usage

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Languages

SridharSampath/parakeet-asr-demo

Folders and files

Latest commit

History

Repository files navigation

Parakeet ASR Demo

Features

Setup

Requirements

Usage

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages