You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
Job seekers often face challenges in resume optimization, interview prep, and self-assessment. This AI-driven platform helps users refine resumes, practice with tailored interview questions, and receive feedback via mock interviews—boosting confidence and job readiness.
An advanced, AI-powered web application that instantly converts PDF documents into ultra-realistic spoken audiobooks using Microsoft Edge Neural TTS and a premium glassmorphism UI.
A security-hardened fork of the ElevenLabs Obsidian plugin — addressing API key exposure, path traversal, and log leakage risks identified in a full threat model audit.
A sophisticated functional programming-based AI pipeline system (YET EXPERIMENTAL) built on LangChain, implementing category theory principles, monadic composition, and functional programming patterns to create robust, self-improving AI agents.
Real-time voice agent pipeline — Twilio to STT to MCP agent with vector retrieval to TTS to Twilio. Latency budget enforcement, barge-in handling, session continuity, Cal.com tool integration. Few clean public references exist for this.