pkmital

Parag K Mital pkmital

Artist and researcher with 20+ years experience in AI and computational arts

1.2k followers · 86 following

Achievements

x3 x4

Achievements

x3 x4

Highlights

Stars

Tencent-Hunyuan / HunyuanWorld-1.0

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,531 216 Updated Dec 3, 2025

Tencent-Hunyuan / Hunyuan3D-2.1

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Python 2,545 343 Updated Oct 17, 2025

Bubobubobubobubo / sardine

Python's missing "algorave" module. Live code music with Python using MIDI, OSC and/or SuperCollider.

Python 285 39 Updated May 19, 2025

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,646 697 Updated Dec 10, 2025

snakers4 / silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Jupyter Notebook 5,662 358 Updated Dec 5, 2025

idiap / coqui-ai-TTS

Forked from coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 2,046 260 Updated Dec 15, 2025

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,887 496 Updated Oct 12, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,641 3,966 Updated Apr 19, 2025

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,742 2,049 Updated Nov 19, 2024

dunky11 / voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Python 228 33 Updated Oct 10, 2022

PINTO0309 / PINTO_model_zoo

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…

Python 4,011 623 Updated Dec 10, 2025

Audio-AGI / AudioSep

Official implementation of "Separate Anything You Describe"

Python 1,853 140 Updated Nov 26, 2024

roymacdonald / ofxLineaDeTiempo

A new timeline addon for openframeworks.

C++ 45 3 Updated Jun 27, 2024

danomatika / loaf

loaf: lua, osc, and openFrameworks

C++ 56 5 Updated Aug 22, 2025

WongKinYiu / yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Jupyter Notebook 14,065 4,405 Updated Aug 19, 2024

f / awesome-chatgpt-prompts

Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

TypeScript 139,594 18,509 Updated Dec 17, 2025

lllyasviel / ControlNet

Let us control diffusion models!

Python 33,440 2,995 Updated Feb 25, 2024

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,382 588 Updated Oct 28, 2024

csteinmetz1 / auraloss

Collection of audio-focused loss functions in PyTorch

Python 829 72 Updated Jul 30, 2024

soham97 / sound_ai_progress

Tracking states of the arts and recent results (bibliography) on sound tasks.

32 2 Updated Jan 10, 2023

phoboslab / qoa

The “Quite OK Audio Format” for fast, lossy audio compression

C 899 51 Updated Dec 3, 2025

haoheliu / audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 367 37 Updated Sep 29, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,102 6,612 Updated Dec 17, 2025

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,738 462 Updated Oct 14, 2025

Kinyugo / msanii

A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.

Python 195 10 Updated Apr 27, 2023

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 2,089 178 Updated Jun 12, 2023

archinetai / archisound

A collection of pre-trained audio models, in PyTorch.

Python 114 4 Updated Jan 27, 2023

diff-usion / Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

HTML 12,203 1,009 Updated Aug 1, 2024

kaegi / alass

"Automatic Language-Agnostic Subtitle Synchronization"

Rust 1,272 63 Updated Dec 28, 2023

kymatio / kymatio

Wavelet scattering transforms in Python with GPU acceleration

Python 819 139 Updated Jan 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parag K Mital pkmital

Achievements

Achievements

Highlights

Block or report pkmital

Stars

Tencent-Hunyuan / HunyuanWorld-1.0

Tencent-Hunyuan / Hunyuan3D-2.1

Bubobubobubobubo / sardine

snakers4 / silero-vad

snakers4 / silero-models

idiap / coqui-ai-TTS

microsoft / muzic

myshell-ai / OpenVoice

neonbjb / tortoise-tts

dunky11 / voicesmith

PINTO0309 / PINTO_model_zoo

Audio-AGI / AudioSep

roymacdonald / ofxLineaDeTiempo

danomatika / loaf

WongKinYiu / yolov7

f / awesome-chatgpt-prompts

lllyasviel / ControlNet

FMInference / FlexLLMGen

csteinmetz1 / auraloss

soham97 / sound_ai_progress

phoboslab / qoa

haoheliu / audioldm_eval

huggingface / diffusers

stanford-futuredata / ColBERT

Kinyugo / msanii

archinetai / audio-diffusion-pytorch

archinetai / archisound

diff-usion / Awesome-Diffusion-Models

kaegi / alass

kymatio / kymatio