Skip to content
View jelly-hst's full-sized avatar

Block or report jelly-hst

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Streaming ASR and TTS based on FastAPI+ sherpa-onnx

Python 216 30 Updated Nov 2, 2025

Sesame CSM 1B Voice Cloning

Python 337 44 Updated Mar 15, 2025

A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.

Python 4,214 544 Updated Jun 5, 2026

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 2,165 169 Updated Feb 2, 2026

Generate audiobooks from e-books, voice cloning & 1158+ languages!

Python 19,319 1,605 Updated Jun 19, 2026

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code inclu…

Python 451 28 Updated Jul 15, 2024

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learni…

Python 111 9 Updated May 18, 2024

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…

686 43 Updated Dec 25, 2024

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementa…

Python 118 5 Updated Oct 24, 2025

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…

Python 525 23 Updated May 5, 2025

ICCV 2023-2025 Papers: Discover cutting-edge research from ICCV 2023-25, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included.…

Python 968 49 Updated Nov 7, 2025

Solutions for Leetcode

Python 1 Updated Feb 11, 2023

Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"

Jupyter Notebook 1,367 450 Updated Feb 8, 2026

I will update this repository to learn Machine learning with python with statistics content and materials

Jupyter Notebook 60 68 Updated Nov 22, 2020

A day to day plan for this challenge. Covers both theoritical and practical aspects

Jupyter Notebook 224 114 Updated Feb 6, 2023

Here my amazing tutorial collection contain amazing notebook must read. It's contain pytorch, Advance pandas, Ensemble learning, Tensorflow, Genetic Algorithms, Dask, Word Embedding

Jupyter Notebook 29 11 Updated May 31, 2019

My notebook on using Python with Jupyter Notebook, PySpark etc

Jupyter Notebook 11 7 Updated Aug 25, 2021

Text and code for the second edition of Think Bayes, by Allen Downey.

Jupyter Notebook 2,049 1,553 Updated Jun 4, 2026

πŸŽ“ Path to a free self-taught education in Computer Science!

HTML 205,131 25,490 Updated Apr 21, 2026

Detect the lip, Recognition sentences and Show Subtitles.

Jupyter Notebook 1 5 Updated Jun 11, 2022

Docker container with interesting tools to work with audio-visual data in pytorch

Dockerfile 2 Updated Feb 14, 2023

A self-supervised learning framework for audio-visual speech

Python 988 161 Updated Dec 7, 2023

Applied Deep Learning Course

3,545 743 Updated Jan 28, 2023

My notes / works on deep learning from Coursera

Jupyter Notebook 472 363 Updated May 8, 2024

Hands-On Computer Vision with TensorFlow 2, published by Packt

Jupyter Notebook 381 267 Updated Apr 29, 2023

πŸ§‘β€πŸ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes πŸ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 66,976 6,715 Updated Jan 22, 2026

Visual Speech Recognition for Multiple Languages

Python 477 80 Updated Aug 17, 2023

A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.

Python 93 22 Updated Jul 23, 2025
C++ 1 Updated Aug 12, 2021