Stars
The best way to get AI coding agents to solve hard problems in complex codebases.
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
🦄 ai that works - every tuesday 10 AM PST
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
"RAG-Anything: All-in-One RAG Framework"
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
A TTS model capable of generating ultra-realistic dialogue in one pass.
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
real time face swap and one-click video deepfake with only a single image
Python GUI tool for preparing video datasets (LORA, Wan, Hunyuan training). Features range clipping, cropping, FPS conversion & optional Gemini descriptions. (Enhanced refactor of HunyClip).
The official Python SDK for Model Context Protocol servers and clients
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Create images of a given character in different poses
Minimal reproduction of DeepSeek R1-Zero
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise"
Tools and methods for detecting and anonymizing Personally Identifiable Information (PII) using AI-driven approaches. This repository includes implementations of fine-tuned models and comparative e…
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
An introduction to "Highly Accurate Dichotomous Image Segmentation" technique to achieve premium quality background removal from image.
🔊 Text-Prompted Generative Audio Model
This is the main repository for the MakeHuman application as such.
Module for automatic summarization of text documents and HTML pages.
Recognize handwritten text in scanned documents using MultiDimensional Recurrent Neural Networks