- Los Angeles, CA
- https://pkmital.com
- @pkmital
Highlights
- Pro
Stars
Google Research
A multi-voice TTS system trained with an emphasis on quality
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K…
Silero Models: pre-trained text-to-speech models made embarrassingly simple
Efficient Image Captioning code in Torch, runs on GPU
Try out deep learning models online on Google Colab
A small C++ implementation of LSTM networks, focused on OCR.
Speech Recognition with the Caffe deep learning framework, migrating to
Audio and Music Analysis and Synthesis in Python
A generative network for animal vocalizations. For dimensionality reduction, sequencing, clustering, corpus-building, and generating novel 'stimulus spaces'. All with notebook examples using freely…
Soundscape Ecology Toolkit