Lists (2)
Sort Name ascending (A-Z)
Stars
💫 Toolkit to help you get started with Spec-Driven Development
Awesome curated collection of images and prompts built with the soon-to-launch Nano Banana Pro model. Browse limited early-access test cases that highlight Nano Banana Pro’s strengths in consistent…
ElevenLabs UI is a component library and custom registry built on top of shadcn/ui to help you build multimodal agents faster.
The world's best AI personal assistant for email. Open source app to help you reach inbox zero fast.
基于RecyclerView实现网格分页LayoutManager——PagerGridLayoutManager
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Turn detection for full-duplex dialogue communication
Open-source framework for conversational voice AI agents
🚀 The best real-time interactive AI avatar(digital human) with on-premise deployment and <1.5 s latency.
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181, with codec support for H.264, H.265, AV1, VP9, AAC, Opus, and …
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
AgentScope is a production-ready, easy-to-use agent framework with essential abstractions that work with rising model capability and built-in support for finetuning.
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Try to do Acoustic Echo Cancellation on Android with AEC modules from Speex and WebRTC.
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Android Serial Port Assistant - Android 串口助手
cmliu / edgetunnel
Forked from zizifn/edgetunneledgetunnel 2.0 VLESS/Trojan 多功能面板
every websites have been tested and fixed, all can be running in localhost. After clone the repository enter the website's folder, simply start a local HTTP server such as live-server to run the we…
肖像大师 中文版 comfyui-portrait-master
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers