#

tongyi

Here is 1 public repository matching this topic...

Pogud / MegaQwen

🚀 Achieve faster Qwen3-0.6B inference with the MegaQwen CUDA megakernel, delivering 531 tok/s decode on RTX 3090—3.9x faster than HuggingFace.

android privacy chatbot cloud-storage chinese android-studio pretrained-models large-language-models llm chatgpt-api comfyui tongyi qwen2 qwen-api qwen3-vl

Updated Apr 18, 2026
Cuda

Improve this page

Add a description, image, and links to the tongyi topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tongyi topic, visit your repo's landing page and select "manage topics."