#

pydub

Here are 187 public repositories matching this topic...

space-contributes / arby-audio_3d

Arby Audio delivers cinematic-grade 3D sound experiences with immersive 7.1.4 spatial audio and advanced technology. Enjoy living sound that adapts, bounces, and reacts in real time, bringing games, movies, and music to life with lifelike reflections, precise positioning, and stunning binaural effects.

audio python gaming numpy spatial-audio audio-processing 3d-engine 3d-audio pydub immersive-audio immersive-sim simulative-3d

Updated Dec 18, 2025
HTML

atsu12345 / WhisperAlign-CLI

🗣️ Align audio with text seamlessly on macOS, generating accurate timestamps and subtitles in multiple formats for better accessibility.

python macos cli pytorch subtitles speech-recognition vtt speech-to-text mps whisper asr tqdm forced-alignment pydub ffmpg audio-transcription apple-silicon stable-ts

Updated Dec 18, 2025
Python

sanskrit

rhcad / sanskrit

Sanskrit 梵语（读音、天城体、转写）学习

sanskrit pydub

Updated Dec 17, 2025
HTML

808scriptz / beatstamp

Protect beats by stamping them with a repeating audio tag.

audio python music ffmpeg pydub beatmaking beatmakers producer-tools

Updated Dec 14, 2025
Python

king-tri-ton / VoiceToTextKTBot

Бот для Telegram, который распознаёт голосовые сообщения и возвращает их в виде текста

python speech-recognition telebot telegrambot voice-to-text pydub king-triton

Updated Dec 9, 2025
Python

djleamen / renamer

Utility to rename mp3 files based on speech content

python utility pyaudio ffmpeg mp3 torch util wav speech-recognition openai speech-to-text whisper google-speech-recognition pydub googlespeechapi whisper-ai

Updated Dec 5, 2025
Python

Vaga-666 / tg-voice-ai-image-bot-

Telegram bot: voice-to-text + OpenAI chat replies + image generation + TTS.

python tts speech-recognition openai telegrambot gtts pydub

Updated Dec 4, 2025
Python

delbertina / HelpfulAudioChopper

Python application to split up an audio file and name the parts

audio python audio-processing pydub

Updated Nov 30, 2025
Python

Kikaayy / Slowed-SpeedUp_and_Reverb

Discord bot that turns youtube music into a slowed or reverb version

music discord-bot youtube-dl pydub slowed-reverb-generator speed-up-reverb-generator

Updated Nov 29, 2025
Python

ginesthoii / SecureMaestro

Secure, automated practice tools for musicians — inspired by Vivaldi’s Four Seasons and engineered with AppSec principles. Blending music and security, SecureMaestro offers sandboxed tools for looping, tempo-mapping, and safe performance analysis.

python sandbox youtube-api appsec bandit pydub semgrep tempomap

Updated Nov 26, 2025
Python

voxten / media-utility-app

Application for everyday utilities

python pydub pyqt6 yt-dlp pyqt6-app

Updated Nov 9, 2025
Python

guruprasanth02 / Emotiraga---A-Carnatic-Mood-Harmonizer

An innovative web application that bridges the world of emotions and Carnatic music! This project analyzes your mood through text input and generates personalized Carnatic music compositions to harmonize and uplift your spirits.

javascript css python html flask-application nltk flask-login flask-wtf pydub flask-bcrypt vadersentiment mongodcompass

Updated Nov 9, 2025
Python

Abhiee123 / Conversational-AI-using-LLM

This AI medical assistant listens, sees, and speaks. It uses speech-to-text, vision analysis, and text-to-speech to simulate a doctor-patient consultation.

speech-recognition gradio gtts pydub

Updated Nov 5, 2025
C

Kratugautam99 / Scriptoria-Project

Scriptoria-Project is an AI-powered framework designed for intelligent document parsing, structured data extraction, and dynamic annotation. Built with modularity and performance in mind, it empowers seamless integration with NLP pipelines, making it ideal for research and production environments.

text-to-speech torch speech-to-text protobuf3 pyttsx3 beautifulsoup4 pydub uvicorn pydantic python-dotenv streamlit playwright vosk chromadb google-generativeai ai-review ai-rewrite academic-book-publication

Updated Oct 9, 2025
HTML

suchirmv-1524 / MusicVAD

A multimodal framework that analyzes both audio & facial imagery to detect emotional states via Valence, Arousal & Dominance (VAD) scores, & recommends music aligned with the user’s emotional context. The system bypasses transcription by extracting VAD signals directly from raw inputs & uses emotion-music mappings for personalized recommendations

librosa pydub chromadb audiocraft google-generative-ai speechrecognitionapi

Updated Oct 8, 2025
Jupyter Notebook

Cyriliang / WhisperAlign-CLI

Mac-friendly CLI for speech-to-text with stable-ts (stable_whisper): transcription, forced alignment, SRT/VTT/TXT, CJK-aware wrapping, MPS acceleration.

Updated Sep 17, 2025
Python

ARYANSINGH0611 / PronouncePerfect

PronouncePerfect helps kids improve pronunciation and reading with interactive speech-based activities.

python machine-learning python3 pygame gtts pyttsx3 pydub streamlit

Updated Sep 14, 2025
Python

EbrahimAR / AI-Voice-Cloner-XTTS-v2

A Streamlit web app for AI-powered voice cloning using Coqui XTTS v2. Record or upload reference voices, clone speech in multiple languages, and generate natural audio outputs.

python json text-to-speech deep-learning numpy speech-synthesis pydub voice-cloning ai-voice streamlit tts-model coqui-tts multilingual-tts xtts-v2

Updated Sep 10, 2025
Python

mishraanuraagx / Ad_Break

Local, offline pipeline that finds clean ad-break points via scene cuts + quiet audio, transcribes nearby context with Whisper, and exports JSON/SRT/timeline for ad conditioning.

opencv ffmpeg pytorch matplotlib whisper pydub pyscenedetect keybert

Updated Sep 10, 2025
Python

brunoallison / audio-merge

AWS Lambda function in Python to merge or zip MP3 files stored in Amazon S3, with authentication via x-api-key

python aws-lambda ffmpeg s3 boto3 audio-processing pydub

Updated Sep 3, 2025
Python

Improve this page

Add a description, image, and links to the pydub topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pydub topic, visit your repo's landing page and select "manage topics."