Skip to content
View hathibelagal-dev's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report hathibelagal-dev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Flutter makes it easy and fast to build beautiful apps for mobile and beyond

Dart 175,673 30,154 Updated Mar 24, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,513 11,917 Updated Dec 15, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,123 13,596 Updated Mar 21, 2026

real time face swap and one-click video deepfake with only a single image

Python 80,300 11,716 Updated Mar 23, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,064 6,121 Updated Feb 9, 2026

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,057 4,687 Updated Aug 19, 2024

SOTA Open Source TTS

Python 28,712 2,403 Updated Mar 23, 2026

State-of-the-art 2D and 3D Face Analysis Project

Python 28,168 5,955 Updated Mar 18, 2026

🖍 Terminal string styling done right

JavaScript 23,084 935 Updated Jan 27, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,665 2,486 Updated Mar 5, 2026

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 12,039 1,135 Updated Nov 5, 2025

Spark-TTS Inference Code

Python 10,957 1,170 Updated Apr 9, 2025

Towards Human-Sounding Speech

Python 6,030 512 Updated Dec 5, 2025

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,804 504 Updated Feb 12, 2026

Taming Stable Diffusion for Lip Sync!

Python 5,522 902 Updated Jun 20, 2025

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 5,486 471 Updated May 12, 2025

​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation

Python 5,086 841 Updated Dec 18, 2025

NanoGPT (124M) in 2 minutes

Python 4,980 681 Updated Mar 17, 2026

Examples of ComfyUI workflows

HTML 4,032 1,282 Updated Nov 26, 2025

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,704 251 Updated Oct 17, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 3,208 287 Updated Jan 8, 2026

[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,904 312 Updated Feb 19, 2025

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,892 299 Updated Jan 26, 2026

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 2,858 480 Updated Dec 18, 2025

[CVPR 2026] PersonaLive! : Expressive Portrait Image Animation for Live Streaming

Python 2,482 337 Updated Mar 5, 2026

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,130 246 Updated Feb 23, 2026

MLX native implementations of state-of-the-art generative image models

Python 1,921 131 Updated Mar 23, 2026

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,498 95 Updated Sep 11, 2025

[TMLR] Memory-Guided Diffusion for Expressive Talking Video Generation

Python 1,075 104 Updated Aug 6, 2025
Next