pkmital

Parag K Mital pkmital

Artist and researcher with 20+ years experience in AI and computational arts

1.2k followers · 86 following

Achievements

x3 x4

Achievements

x3 x4

Highlights

Stars

189 results for source starred repositories

Clear filter

Tencent-Hunyuan / HunyuanWorld-1.0

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,532 216 Updated Dec 17, 2025

Tencent-Hunyuan / Hunyuan3D-2.1

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Python 2,545 343 Updated Oct 17, 2025

Bubobubobubobubo / sardine

Python's missing "algorave" module. Live code music with Python using MIDI, OSC and/or SuperCollider.

Python 285 39 Updated May 19, 2025

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,648 697 Updated Dec 10, 2025

snakers4 / silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Jupyter Notebook 5,663 358 Updated Dec 5, 2025

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,887 496 Updated Oct 12, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,644 3,966 Updated Apr 19, 2025

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,742 2,049 Updated Nov 19, 2024

dunky11 / voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Python 228 33 Updated Oct 10, 2022

PINTO0309 / PINTO_model_zoo

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…

Python 4,012 623 Updated Dec 10, 2025

Audio-AGI / AudioSep

Official implementation of "Separate Anything You Describe"

Python 1,853 140 Updated Nov 26, 2024

roymacdonald / ofxLineaDeTiempo

A new timeline addon for openframeworks.

C++ 45 3 Updated Jun 27, 2024

danomatika / loaf

loaf: lua, osc, and openFrameworks

C++ 56 5 Updated Aug 22, 2025

WongKinYiu / yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Jupyter Notebook 14,066 4,405 Updated Aug 19, 2024

f / awesome-chatgpt-prompts

Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

TypeScript 139,600 18,511 Updated Dec 17, 2025

lllyasviel / ControlNet

Let us control diffusion models!

Python 33,440 2,995 Updated Feb 25, 2024

csteinmetz1 / auraloss

Collection of audio-focused loss functions in PyTorch

Python 829 72 Updated Jul 30, 2024

soham97 / sound_ai_progress

Tracking states of the arts and recent results (bibliography) on sound tasks.

32 2 Updated Jan 10, 2023

phoboslab / qoa

The “Quite OK Audio Format” for fast, lossy audio compression

C 899 51 Updated Dec 3, 2025

haoheliu / audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 367 37 Updated Sep 29, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,103 6,612 Updated Dec 17, 2025

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,738 462 Updated Oct 14, 2025

Kinyugo / msanii

A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.

Python 195 10 Updated Apr 27, 2023

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 2,089 178 Updated Jun 12, 2023

archinetai / archisound

A collection of pre-trained audio models, in PyTorch.

Python 114 4 Updated Jan 27, 2023

diff-usion / Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

HTML 12,203 1,010 Updated Aug 1, 2024

kaegi / alass

"Automatic Language-Agnostic Subtitle Synchronization"

Rust 1,272 63 Updated Dec 28, 2023

kymatio / kymatio

Wavelet scattering transforms in Python with GPU acceleration

Python 819 139 Updated Jan 28, 2025

archinetai / audio-diffusion-pytorch-trainer

Trainer for audio-diffusion-pytorch

Python 130 22 Updated Jan 13, 2023

timsainb / AVGN

A generative network for animal vocalizations. For dimensionality reduction, sequencing, clustering, corpus-building, and generating novel 'stimulus spaces'. All with notebook examples using freely…

Jupyter Notebook 70 21 Updated Dec 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parag K Mital pkmital

Achievements

Achievements

Highlights

Block or report pkmital

Stars

Tencent-Hunyuan / HunyuanWorld-1.0

Tencent-Hunyuan / Hunyuan3D-2.1

Bubobubobubobubo / sardine

snakers4 / silero-vad

snakers4 / silero-models

microsoft / muzic

myshell-ai / OpenVoice

neonbjb / tortoise-tts

dunky11 / voicesmith

PINTO0309 / PINTO_model_zoo

Audio-AGI / AudioSep

roymacdonald / ofxLineaDeTiempo

danomatika / loaf

WongKinYiu / yolov7

f / awesome-chatgpt-prompts

lllyasviel / ControlNet

csteinmetz1 / auraloss

soham97 / sound_ai_progress

phoboslab / qoa

haoheliu / audioldm_eval

huggingface / diffusers

stanford-futuredata / ColBERT

Kinyugo / msanii

archinetai / audio-diffusion-pytorch

archinetai / archisound

diff-usion / Awesome-Diffusion-Models

kaegi / alass

kymatio / kymatio

archinetai / audio-diffusion-pytorch-trainer

timsainb / AVGN