Skip to content
View motapinto's full-sized avatar
💭
Coding
💭
Coding

Block or report motapinto

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Go ahead and axolotl questions

Python 12,053 1,369 Updated Jun 15, 2026

The ultimate training toolkit for finetuning diffusion models

Python 10,876 1,353 Updated Jun 15, 2026

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,903 145 Updated Jul 5, 2024

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,588 5,968 Updated Jun 15, 2026

A fast, local neural text to speech system

C++ 11,102 1,025 Updated Aug 26, 2025

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,413 5,357 Updated Sep 22, 2025

A PyTorch-based Speech Toolkit

Python 11,619 1,699 Updated Jun 15, 2026

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,285 220 Updated Apr 13, 2026

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

Jupyter Notebook 574 76 Updated Aug 27, 2024

SSSegmentation: An Open Source Supervised Semantic Segmentation Toolbox Based on PyTorch.

Python 876 110 Updated Nov 7, 2025

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 7,076 496 Updated Mar 18, 2025

⚡️ 10x - Up to 20x faster AI coding with multi-step Superpowers. Open-source agent with smart model routing, BYOK, fully self-hosted.

TypeScript 1,350 113 Updated Jan 5, 2026

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Python 18,038 1,850 Updated Jun 15, 2026

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

C 8,466 459 Updated Jun 2, 2026

Open source inference code for Rev's model

Python 436 27 Updated Apr 22, 2025

Whisper with Medusa heads

Python 861 53 Updated Jun 9, 2026

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,614 272 Updated Dec 14, 2025

computer vision and sports

Python 5,062 611 Updated Nov 7, 2025

Kolmogorov Arnold Networks

Jupyter Notebook 16,306 1,563 Updated Jan 19, 2025

Noise supression using deep filtering

Python 4,324 471 Updated Oct 17, 2024

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,897 320 Updated Mar 14, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 102,792 12,539 Updated Apr 15, 2026

A playbook for systematically maximizing the performance of deep learning models.

30,187 2,423 Updated Jun 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 59,682 10,291 Updated Nov 12, 2025

A collection of libraries to optimise AI model performances

Python 8,339 620 Updated Jul 22, 2024

Spatial Sparse Convolution Library

Python 2,279 418 Updated Dec 15, 2024

A collection of research materials on explainable AI/ML

Markdown 1,649 222 Updated Mar 7, 2026

Latex code for making neural networks diagrams

TeX 24,819 3,065 Updated Aug 21, 2023

View model summaries in PyTorch!

Python 2,939 137 Updated Jun 15, 2026

PyTorch implementation of Darknet53

Python 118 25 Updated Aug 8, 2021
Next