Stars
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
Awesome IoT. A collaborative list of great resources about IoT Framework, Library, OS, Platform
CapsWriter 的离线版,一个好用的 PC 端的语音输入工具
[NO LONGER MAINTAINED] Command-line utility for auto-generating subtitles for any video file
MCP Server for Computer Use in Windows
The successor to reDuh, pwn a bastion webserver and create SOCKS proxies through the DMZ. Pivot and pwn.
A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.
WebSocket and WAMP in Python for Twisted and asyncio
Scrapy extension to write scraped items using Django models
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
DEPRECATED, NOT MAINTAINED. An experiment in mixing up django and twisted
Creates useful lists of sites with age restricted content
Simple Python lib for home automation. Allows you to read values from sensors/pilight/llap etc. on the Pi and then trigger receivers like lights, fans, etc. into certain states based on rule defini…