Skip to content
View yui0303's full-sized avatar
  • National Chiao Tung University
  • Taiwan
  • 15:14 (UTC +08:00)

Highlights

  • Pro

Block or report yui0303

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
21 stars written in Python
Clear filter

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 340,001 55,004 Updated Mar 20, 2026

The agent engineering platform

Python 130,817 21,543 Updated Mar 24, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,117 14,698 Updated Mar 24, 2026

AI agents running research on single-GPU nanochat training automatically

Python 52,842 7,368 Updated Mar 21, 2026

An AI Hedge Fund Team

Python 49,519 8,611 Updated Mar 19, 2026

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,445 4,788 Updated Jun 2, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 31,738 3,346 Updated Mar 23, 2026

Universal LLM Deployment Engine with ML Compilation

Python 22,264 1,971 Updated Mar 18, 2026

Ongoing research training transformer models at scale

Python 15,777 3,738 Updated Mar 24, 2026

Proxy server to bypass Cloudflare protection

Python 13,210 1,060 Updated Jan 12, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 12,974 1,582 Updated Feb 27, 2026

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 10,361 978 Updated Mar 22, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 9,899 1,250 Updated Mar 17, 2026

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,844 1,387 Updated Dec 6, 2023

AIOS: AI Agent Operating System

Python 5,377 740 Updated Jan 22, 2026

Witness the aha moment of VLM with less than $3.

Python 4,045 286 Updated May 19, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,319 298 Updated May 11, 2025

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,323 80 Updated Mar 6, 2025

Turn detection for full-duplex dialogue communication

Python 542 36 Updated Dec 26, 2025

LLaMa/RWKV onnx models, quantization and testcase

Python 366 29 Updated Jul 6, 2023
Python 174 96 Updated Mar 18, 2026