tkhe

Xiaolin Wang tkhe

12 followers · 44 following

Achievements

Starred repositories

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,485 2,362 Updated Nov 6, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,339 11,080 Updated Nov 6, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,865 3,288 Updated Nov 6, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,180 31,061 Updated Nov 6, 2025

meta-pytorch / torchcomms

torchcomms: a modern PyTorch communications API

C++ 244 27 Updated Nov 6, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,506 6,476 Updated Nov 6, 2025

langbot-app / LangBot

🤩 Easy-to-use global IM bot platform designed for LLM era / 简单易用的大模型即时通信机器人开发平台 ⚡️ Bots for QQ / QQ频道 / Discord / LINE / WeChat(微信, 企业微信)/ Telegram / 飞书 / 钉钉 / Slack 🧩 Integrated with ChatGPT(GPT),…

Python 13,932 1,140 Updated Nov 6, 2025

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 118,232 18,277 Updated Nov 6, 2025

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 22,238 4,346 Updated Nov 6, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 10,591 988 Updated Nov 6, 2025

opencv / opencv_contrib

Repository for OpenCV's extra modules

C++ 9,879 5,860 Updated Nov 6, 2025

opencv / opencv

Open Source Computer Vision Library

C++ 84,769 56,352 Updated Nov 6, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,762 295 Updated Nov 6, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,628 259 Updated Nov 6, 2025

Charles-Xie / awesome-described-object-detection

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…

328 24 Updated Nov 6, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,622 3,778 Updated Nov 6, 2025

thu-pacman / chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,325 88 Updated Nov 6, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,683 5,062 Updated Nov 6, 2025