Highlights
- Pro
Stars
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Suno AI's Bark model in C/C++ for fast text-to-speech generation
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
C0untFloyd / bark-gui
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model with Gradio
JonathanFly / bark
Forked from suno-ai/bark🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model
🔊 Text-Prompted Generative Audio Model
A fast and lightweight python-based CTC beam search decoder for speech recognition.
Keras Implementation of Flair's Contextualized Embeddings
Python tools for interacting with Wikidata
python3 package supporting efficient storage and querying of sets of sets using the trie data structure. Supports finding all the supersets/subsets of a given set from a collection of sets. Also in…
Automatic extraction of relevant features from time series:
Pure AngularJS directive for Google Places Autocomplete
[DEPRECATED] - Cordova plugin to support Universal/Deep Links for iOS/Android.
A powerful cross-platform UI toolkit for building native-quality iOS, Android, and Progressive Web Apps with HTML, CSS, and JavaScript.
Citation parsing to quickly access publications