Stars
Complexity within, simplicity without. 繁于内,简于形。
IronClaw is OpenClaw inspired implementation in Rust focused on privacy and security
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
Translate the video from one language to another and embed dubbing & subtitles.
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
Ghidra is a software reverse engineering (SRE) framework
🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.
Windows desktop front end for Spleeter - AI source separation
vibecoding在AI写作领域的初步尝试,具备function calling,rag,mcp,skills,人在回路等功能,或许能当一个小cursor用? A preliminary attempt at vibecoding in the field of AI writing, featuring capabilities such as function calling, RAG…
The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
A video processing framework with simplicity in mind
VapourSynth port of RemoveGrain and Repair plugins from Avisynth
iFlow cli is a comprehensive command-line intelligence that embeds in your terminal, analyzes your repositories, does coding tasks, interprets your needs across contexts, and boosts efficiency by p…
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Gemini polling proxy service (gemini轮询代理服务)
Gemini ➜ OpenAI API proxy. Serverless!
Real time interactive streaming digital human
This is the official implementation of our paper: "MiniMax-Remover: Taming Bad Noise Helps Video Object Removal"
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
リアルタイムボイスチェンジャー Realtime Voice Changer
快如闪电的硬字幕提取工具。仅需苹果M1芯片或英伟达3060显卡即可达到10倍速提取。A very fast tool for video hardcode subtitle extraction
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。| 基于 Webgpu 技术和 wasm 技术的免费开源 inpainting & image-upscaling 工具, 纯浏览器端实现。
一个使用lama模型进行图像处理和视频处理的客户端,在iopaint项目基础上,实现批量调整蒙版以批量处理图片、逐帧调整蒙版以处理视频。
ACE-Step: A Step Towards Music Generation Foundation Model