Skip to content
View palonso's full-sized avatar

Block or report palonso

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
69 stars written in Python
Clear filter

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,550 5,140 Updated Mar 23, 2026

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 22,063 2,696 Updated Jan 23, 2026

Automatic headphone equalization from frequency responses

Python 15,553 2,536 Updated Jul 20, 2025

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 11,387 885 Updated Jan 13, 2026

NumPy & SciPy for GPU

Python 10,862 1,006 Updated Mar 23, 2026

Python library for audio and music analysis

Python 8,284 1,039 Updated Mar 24, 2026

A PyTorch implementation of EfficientNet

Python 8,219 1,539 Updated Apr 8, 2022

The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

Python 8,207 932 Updated Mar 25, 2026

🎮 ⌨ An easy to use tool to change the behaviour of your input devices.

Python 5,483 211 Updated Mar 25, 2026

HeartMuLa Official Repo: The Most Powerful Open-Source Music Generation Model of 2026

Python 4,321 345 Updated Mar 5, 2026

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Python 4,014 795 Updated Oct 8, 2021

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,927 355 Updated Jan 4, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,826 280 Updated Feb 13, 2025

PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)

Python 3,421 554 Updated Dec 26, 2023

A collection of themes for kitty terminal 😻

Python 3,057 214 Updated Feb 16, 2024

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 3,026 233 Updated Feb 9, 2026

Gin provides a lightweight configuration framework for Python

Python 2,148 118 Updated Jan 14, 2026

Python audio and music signal processing library

Python 1,607 269 Updated Mar 20, 2026

Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS

Python 1,584 216 Updated Jun 13, 2024

A Language Server Protocol implementation for Ruff.

Python 1,516 49 Updated Dec 1, 2025

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,280 111 Updated Mar 2, 2025

Interact with Jupyter from NeoVim.

Python 1,227 56 Updated Jan 4, 2024

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 1,108 188 Updated Jan 5, 2026

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Python 1,093 339 Updated Jun 8, 2024

A lightweight, simple-to-use, RNN wake word listener

Python 960 246 Updated Nov 25, 2023
Python 945 85 Updated Jan 25, 2026

A tool for converting ONNX files to LiteRT/TFLite/TensorFlow, PyTorch native code (nn.Module), TorchScript (.pt), state_dict (.pt), Exported Program (.pt2), and Dynamo ONNX. It also supports direct…

Python 940 97 Updated Mar 24, 2026

cuSignal - RAPIDS Signal Processing Library

Python 739 134 Updated Sep 21, 2023

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 731 103 Updated Feb 1, 2026
Next