You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Arby Audio delivers cinematic-grade 3D sound experiences with immersive 7.1.4 spatial audio and advanced technology. Enjoy living sound that adapts, bounces, and reacts in real time, bringing games, movies, and music to life with lifelike reflections, precise positioning, and stunning binaural effects.
Secure, automated practice tools for musicians — inspired by Vivaldi’s Four Seasons and engineered with AppSec principles. Blending music and security, SecureMaestro offers sandboxed tools for looping, tempo-mapping, and safe performance analysis.
An innovative web application that bridges the world of emotions and Carnatic music! This project analyzes your mood through text input and generates personalized Carnatic music compositions to harmonize and uplift your spirits.
This AI medical assistant listens, sees, and speaks. It uses speech-to-text, vision analysis, and text-to-speech to simulate a doctor-patient consultation.
Scriptoria-Project is an AI-powered framework designed for intelligent document parsing, structured data extraction, and dynamic annotation. Built with modularity and performance in mind, it empowers seamless integration with NLP pipelines, making it ideal for research and production environments.
A multimodal framework that analyzes both audio & facial imagery to detect emotional states via Valence, Arousal & Dominance (VAD) scores, & recommends music aligned with the user’s emotional context. The system bypasses transcription by extracting VAD signals directly from raw inputs & uses emotion-music mappings for personalized recommendations
A Streamlit web app for AI-powered voice cloning using Coqui XTTS v2. Record or upload reference voices, clone speech in multiple languages, and generate natural audio outputs.
Local, offline pipeline that finds clean ad-break points via scene cuts + quiet audio, transcribes nearby context with Whisper, and exports JSON/SRT/timeline for ad conditioning.