Skip to content
View rogue-yogi's full-sized avatar

Organizations

@synchronicityAI

Block or report rogue-yogi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
64 stars written in Python
Clear filter

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 54,739 5,985 Updated Dec 30, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 51,610 4,264 Updated Feb 4, 2026

Universal memory layer for AI Agents

Python 46,619 5,120 Updated Feb 3, 2026

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,456 5,949 Updated Aug 16, 2024

The uncompromising Python code formatter

Python 41,349 2,718 Updated Jan 31, 2026

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,899 4,002 Updated Apr 19, 2025

Easily train a good VC model with voice data <= 10 mins!

Python 34,291 4,878 Updated Nov 24, 2024

Python packaging and dependency management made easy

Python 34,180 2,394 Updated Feb 1, 2026

one-click face swap

Python 30,506 6,913 Updated Aug 19, 2024

Industry leading face manipulation platform

Python 26,644 4,275 Updated Jan 29, 2026

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,425 2,723 Updated Aug 12, 2024

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

Python 23,985 5,233 Updated Feb 5, 2026

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 21,542 2,095 Updated Feb 5, 2026

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,890 2,226 Updated Mar 11, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 18,484 2,281 Updated Dec 2, 2025

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 17,789 3,696 Updated Nov 18, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,277 2,366 Updated Dec 15, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,049 1,680 Updated Dec 17, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,814 2,777 Updated Jun 22, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,689 1,186 Updated Nov 21, 2025

Open Source framework for voice and multimodal conversational AI

Python 10,183 1,693 Updated Feb 5, 2026

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 9,720 1,411 Updated Apr 24, 2024

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,424 941 Updated Feb 5, 2026

Official repository for LTX-Video

Python 9,229 865 Updated Jan 5, 2026

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,641 1,122 Updated Sep 14, 2024

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 8,103 1,390 Updated Jul 21, 2024

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 7,042 488 Updated Mar 18, 2025

Python datetimes made easy

Python 6,615 414 Updated Jan 30, 2026

[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.

Python 5,493 634 Updated Feb 5, 2026

Taming Stable Diffusion for Lip Sync!

Python 5,399 872 Updated Jun 20, 2025
Next