Skip to content
View pkmital's full-sized avatar

Highlights

  • Pro

Block or report pkmital

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,531 216 Updated Dec 3, 2025

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Python 2,545 343 Updated Oct 17, 2025

Python's missing "algorave" module. Live code music with Python using MIDI, OSC and/or SuperCollider.

Python 285 39 Updated May 19, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,646 697 Updated Dec 10, 2025

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Jupyter Notebook 5,662 358 Updated Dec 5, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 2,046 260 Updated Dec 15, 2025

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,887 496 Updated Oct 12, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,641 3,966 Updated Apr 19, 2025

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,742 2,049 Updated Nov 19, 2024

[WIP] VoiceSmith makes training text to speech models easy.

Python 228 33 Updated Oct 10, 2022

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…

Python 4,011 623 Updated Dec 10, 2025

Official implementation of "Separate Anything You Describe"

Python 1,853 140 Updated Nov 26, 2024

A new timeline addon for openframeworks.

C++ 45 3 Updated Jun 27, 2024

loaf: lua, osc, and openFrameworks

C++ 56 5 Updated Aug 22, 2025

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Jupyter Notebook 14,065 4,405 Updated Aug 19, 2024

Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

TypeScript 139,594 18,509 Updated Dec 17, 2025

Let us control diffusion models!

Python 33,440 2,995 Updated Feb 25, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,382 588 Updated Oct 28, 2024

Collection of audio-focused loss functions in PyTorch

Python 829 72 Updated Jul 30, 2024

Tracking states of the arts and recent results (bibliography) on sound tasks.

32 2 Updated Jan 10, 2023

The “Quite OK Audio Format” for fast, lossy audio compression

C 899 51 Updated Dec 3, 2025

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 367 37 Updated Sep 29, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,102 6,612 Updated Dec 17, 2025

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,738 462 Updated Oct 14, 2025

A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.

Python 195 10 Updated Apr 27, 2023

Audio generation using diffusion models, in PyTorch.

Python 2,089 178 Updated Jun 12, 2023

A collection of pre-trained audio models, in PyTorch.

Python 114 4 Updated Jan 27, 2023

A collection of resources and papers on Diffusion Models

HTML 12,203 1,009 Updated Aug 1, 2024

"Automatic Language-Agnostic Subtitle Synchronization"

Rust 1,272 63 Updated Dec 28, 2023

Wavelet scattering transforms in Python with GPU acceleration

Python 819 139 Updated Jan 28, 2025
Next