- Shanghai, China
- ym1076302261@163.com
- https://civitai.com/user/Y_Man
Stars
Voice-to-text dictation app with local (Nvidia Parakeet/Whisper) and cloud models (BYOK). Privacy-first and available cross-platform.
Game-Time: Evaluating Temporal Dynamics in Full-Duplex Spoken Language Models
Sub2API is an open-source relay platform that unifies Claude, OpenAI, Gemini, and Antigravity subscriptions into a single endpoint. It supports account sharing and cost-sharing, with seamless nativ…
Scripts for agents, shared between my repositories.
[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos
[ACL'26 Oral] Official implementation of "SegTune: Structured and Fine-Grained Control for Song Generation".
Lightweight FFmpeg and FFprobe for Node, built as modern WebAssembly for local media automation.
Fast CLI tool for finding and resuming Claude Code sessions
CCEdit: Creative and Controllable Video Editing via Diffusion Models
Free AI tool to remove watermarks from Google Veo AI-generated videos. Specialized for Veo watermark patterns, maintains HD quality, fast processing. Perfect for AI video creators, content marketer…
Open-source, ad-free Android multimedia recorder with background video recording, screen recording, live streaming, and remote camera control
OpenShot Video Editor is an award-winning free and open-source video editor for Linux, Mac, and Windows, and is dedicated to delivering high quality video editing and animation solutions to the world.
OpenReel Video - Professional browser-based video editor. Open source CapCut alternative. 100% browser-based, no installation, no cloud uploads, no watermarks.
SIA is a Self Improving AI framework to autonomously improve the performance of any AI system (Model / Agent) on a benchmark task.
Ask the oracle when you're stuck. Invoke GPT-5 Pro with a custom context and files.
Local-first session intelligence and analytics for coding agents, supporting Claude Code, Codex, and more than 20 other agents. Also: 100x faster replacement for ccusage!
Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. 🔥💬
🕵️♂️ Collect a dossier on a person by username from 3000+ sites
Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks.
Generate text, images, video, speech, and music by MiniMax.
Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.
A Scalable Agentic RL Training Framework for Deep Research Agent