Skip to content
View hathibelagal-dev's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report hathibelagal-dev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
40 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,647 11,926 Updated Dec 15, 2025

real time face swap and one-click video deepfake with only a single image

Python 80,338 11,724 Updated Mar 23, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,130 6,126 Updated Feb 9, 2026

SOTA Open Source TTS

Python 28,806 2,417 Updated Mar 23, 2026

State-of-the-art 2D and 3D Face Analysis Project

Python 28,194 5,958 Updated Mar 18, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,683 2,495 Updated Mar 5, 2026

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 12,058 1,134 Updated Nov 5, 2025

Spark-TTS Inference Code

Python 10,957 1,169 Updated Apr 9, 2025

Towards Human-Sounding Speech

Python 6,034 512 Updated Dec 5, 2025

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,805 504 Updated Feb 12, 2026

Taming Stable Diffusion for Lip Sync!

Python 5,526 904 Updated Jun 20, 2025

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 5,489 471 Updated May 12, 2025

​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation

Python 5,122 849 Updated Dec 18, 2025

NanoGPT (124M) in 2 minutes

Python 5,006 685 Updated Mar 17, 2026

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,706 251 Updated Oct 17, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 3,212 287 Updated Jan 8, 2026

[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,905 312 Updated Feb 19, 2025

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,893 300 Updated Jan 26, 2026

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 2,863 480 Updated Dec 18, 2025

[CVPR 2026] PersonaLive! : Expressive Portrait Image Animation for Live Streaming

Python 2,493 337 Updated Mar 5, 2026

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,129 246 Updated Feb 23, 2026

MLX native implementations of state-of-the-art generative image models

Python 1,931 131 Updated Mar 23, 2026

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,497 96 Updated Sep 11, 2025

[TMLR] Memory-Guided Diffusion for Expressive Talking Video Generation

Python 1,075 104 Updated Aug 6, 2025

A custom node set for Video Frame Interpolation in ComfyUI.

Python 998 118 Updated Mar 22, 2026

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 734 57 Updated Mar 6, 2026

FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers

Python 502 35 Updated Aug 20, 2025

[CVPR 2025 Highlight🌟] Official ComfyUI implementation of "HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis"

Python 489 28 Updated Jun 25, 2025
Next