Stars
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
Make bilingual epub books Using AI translate
Open Source framework for voice and multimodal conversational AI
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
A powerful framework for building realtime voice AI agents 🤖🎙️📹
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Use LLMs to track and extract websites, RSS feeds, and social media
Text-audio foundation model from Boson AI
Free English to Chinese Dictionary Database
A extendable, replaceable Python algorithmic backtest && trading framework supporting multiple securities
Reverse engineering and pentesting for Android applications
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
A framework for Claude Opus to intelligently orchestrate subagents.
Tool to look for several security related Android application vulnerabilities
Have a natural, spoken conversation with AI!
LiYing is an automated photo processing program designed for automating the post-processing workflow of ID photos in general photo studios. | LiYing 是一套适用于自动化 完成一般照相馆后期证件照处理流程的照片自动处理的程序。
Bag of Tricks and A Strong Baseline for Deep Person Re-identification
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
AI powered speech denoising and enhancement
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
EPUB to audiobook converter, optimized for Audiobookshelf, WebUI included
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
A python package to build AI-powered real-time audio applications
Training Large Language Model to Reason in a Continuous Latent Space
Here lieth a pioneer in open source sustainability. RIP