Skip to content
View pkmital's full-sized avatar

Highlights

  • Pro

Block or report pkmital

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
189 results for source starred repositories
Clear filter

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,532 216 Updated Dec 17, 2025

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Python 2,545 343 Updated Oct 17, 2025

Python's missing "algorave" module. Live code music with Python using MIDI, OSC and/or SuperCollider.

Python 285 39 Updated May 19, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,648 697 Updated Dec 10, 2025

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Jupyter Notebook 5,663 358 Updated Dec 5, 2025

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,887 496 Updated Oct 12, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,644 3,966 Updated Apr 19, 2025

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,742 2,049 Updated Nov 19, 2024

[WIP] VoiceSmith makes training text to speech models easy.

Python 228 33 Updated Oct 10, 2022

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…

Python 4,012 623 Updated Dec 10, 2025

Official implementation of "Separate Anything You Describe"

Python 1,853 140 Updated Nov 26, 2024

A new timeline addon for openframeworks.

C++ 45 3 Updated Jun 27, 2024

loaf: lua, osc, and openFrameworks

C++ 56 5 Updated Aug 22, 2025

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Jupyter Notebook 14,066 4,405 Updated Aug 19, 2024

Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

TypeScript 139,600 18,511 Updated Dec 17, 2025

Let us control diffusion models!

Python 33,440 2,995 Updated Feb 25, 2024

Collection of audio-focused loss functions in PyTorch

Python 829 72 Updated Jul 30, 2024

Tracking states of the arts and recent results (bibliography) on sound tasks.

32 2 Updated Jan 10, 2023

The “Quite OK Audio Format” for fast, lossy audio compression

C 899 51 Updated Dec 3, 2025

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 367 37 Updated Sep 29, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,103 6,612 Updated Dec 17, 2025

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,738 462 Updated Oct 14, 2025

A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.

Python 195 10 Updated Apr 27, 2023

Audio generation using diffusion models, in PyTorch.

Python 2,089 178 Updated Jun 12, 2023

A collection of pre-trained audio models, in PyTorch.

Python 114 4 Updated Jan 27, 2023

A collection of resources and papers on Diffusion Models

HTML 12,203 1,010 Updated Aug 1, 2024

"Automatic Language-Agnostic Subtitle Synchronization"

Rust 1,272 63 Updated Dec 28, 2023

Wavelet scattering transforms in Python with GPU acceleration

Python 819 139 Updated Jan 28, 2025

Trainer for audio-diffusion-pytorch

Python 130 22 Updated Jan 13, 2023

A generative network for animal vocalizations. For dimensionality reduction, sequencing, clustering, corpus-building, and generating novel 'stimulus spaces'. All with notebook examples using freely…

Jupyter Notebook 70 21 Updated Dec 27, 2022
Next