-
Technical University of Munich
- Munich, Germany
- https://www.tianyusong.com
- https://orcid.org/0000-0002-8428-9651
- in/tianyu-song
Highlights
- Pro
Stars
On-device English medical speech-to-text — CLI for Omi Med STT v1 (MLX / NeMo / parakeet.cpp)
rsxdalv / chatterbox
Forked from resemble-ai/chatterboxSoTA open-source TTS
12 Lessons to Get Started Building AI Agents
The agent that grows with you
AriaNg, a modern web frontend making aria2 easier to use.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
A Python framework for AI-driven character animation using neural networks.
omo/lazycodex: The coding agent for tokenmaxxers;the one and only agent harness for complex codebases. For your Codex, for your OpenCode
A set of Docker Containers for the Unity3D Based Virtual Agent Pipeline for NVIDIA Clara AGX Devices
Create stunning demos for free. Open-source, no subscriptions, no watermarks, and free for commercial use. An alternative to Screen Studio.
👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.
Curated academic CV templates and guidelines for PhD students, researchers, and faculty job applicants.
Built by a PhD whose memory was failing, whose diet was a mess, and whose anxiety had its own agenda. Most second brain tools ignore the fact that your brain doesn't work in isolation: your body an…
Autonomous coding agent as an SDK, IDE extension, or CLI assistant.
Official implementation of Kimodo, a kinematic motion diffusion model for high-quality human(oid) motion generation.
Unity SDK for real-time Audio-to-3D facial animation powered by AI. Convert speech audio into expressive 3D facial blendshapes with a simple API.
A highly customizable homepage (or startpage / application dashboard) with Docker and service API integrations.
Real-time text-to-speech with Qwen3-TTS
A browser extension for the Motrix Download Manager and its forks
A full-featured download manager — rebuilt from the ground up
Claude Autoresearch Skill — Autonomous goal-directed iteration for Claude Code. Inspired by Karpathy's autoresearch. Modify → Verify → Keep/Discard → Repeat forever.
AI agents running research on single-GPU nanochat training automatically
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
提供一个人人会用的的路由、NAS系统 (目前活跃的分支是 istoreos-24.10,main或master分支不维护请勿使用)
Browse media content with your own rules on Android TV
An agentic skills framework & software development methodology that works.
groxaxo / Qwen3-TTS-Openai-Fastapi
Forked from QwenLM/Qwen3-TTSQwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Open-source object detection for Python developers. Frictionless installation. Free for commercial use.