Stars
My public programs and models - mostly combinatorial problems and puzzles
The open-source Playwright library for AI browser regression testing with intelligent caching, auto-healing, and multi-model verification.
AI-powered, vision-driven UI automation for every platform.
Linux SVSM (Secure VM Service Module) for secure x86 virtualization in Rust
MCP Server for Computer Use in Windows
Audio Split 基于双门限法的语音端点检测及语音分割
Low-level unprivileged sandboxing tool used by Flatpak and similar projects
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Secure, Fast, and Extensible Sandbox runtime for AI agents.
π RuView turns commodity WiFi signals into real-time spatial intelligence, vital sign monitoring, and presence detection — all without a single pixel of video.
Lightweight and portable LLM sandbox runtime (code interpreter) Python library.
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
Secure and fast microVMs for serverless computing.
Secure environments for developers and their agents
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Open-source, secure environment with real-world tools for enterprise-grade agents.
OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need th…
The PyTorch-based audio source separation toolkit for researchers
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
Xiaomi Home Integration for Home Assistant
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
JAXB-based Java library for Word docx, Powerpoint pptx, and Excel xlsx files