Stars
mlx image models for Apple Silicon machines
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
[ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
The power of Claude Code / GeminiCLI / CodexCLI + [Gemini / OpenAI / OpenRouter / Azure / Grok / Ollama / Custom Model / All Of The Above] working as one.
An open-source fighting game engine that supports MUGEN resources.
A connector for Claude Desktop to read and search an Obsidian vault.
Wan: Open and Advanced Large-Scale Video Generative Models
HighDoping / Wan2.1-Mac
Forked from bakhti-ai/Wan2.1Wan2.1 for Mac.
A TTS model capable of generating ultra-realistic dialogue in one pass.
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
ACE-Step: A Step Towards Music Generation Foundation Model
Lets make video diffusion practical!
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
A Conversational Speech Generation Model
Stylometric analysis of Satoshi & comparison of Satoshi with 75,000+ authors
BrainPress is a simple NextJS app to self-publish Obsidian vaults. It supports the new canvas files.
Reverse Engineering the Abstraction and Reasoning Corpus
⚡️ Convert, compress, resize, annotate, markup, draw, crop, rotate, flip, align images directly in Obsidian. Drag-resize, rename with variables, batch process. WEBP, JPG, PNG, HEIC, TIF.
A minimal GPU design in Verilog to learn how GPUs work from the ground up
Add Automatic Captions to YouTube Shorts with AI
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜
🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation
aider is AI pair programming in your terminal
Provides control over RME TotalMix master volume via OSC.