Skip to content
View Weili17's full-sized avatar

Block or report Weili17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

43 results for sponsorable starred repositories
Clear filter

A framework for efficient model inference with omni-modality models

Python 1,038 142 Updated Dec 20, 2025

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 760 108 Updated Dec 21, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,045 341 Updated Dec 20, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,474 500 Updated Dec 13, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,680 4,096 Updated Dec 20, 2025

Ohayou(おはよう), HTTP load generator, inspired by rakyll/hey with tui animation.

Rust 9,817 278 Updated Dec 18, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,436 328 Updated Dec 19, 2025

OpenFaaS - Serverless Functions Made Simple

Go 26,011 1,970 Updated Nov 1, 2025

Python bindings for llama.cpp

Python 9,836 1,259 Updated Aug 15, 2025

Large-scale text-video dataset. 10 million captioned short videos.

Python 668 40 Updated Aug 14, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 7,839 801 Updated Dec 12, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,839 12,094 Updated Dec 20, 2025

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 68,009 7,206 Updated Dec 20, 2025

Python client for the etcd API v3

Python 441 194 Updated Dec 9, 2024

Reverse engineered ChatGPT API

Python 28,005 4,443 Updated Aug 2, 2023

Elegant and Powerfull. Powered by OpenAI and Vercel.

TypeScript 3,230 2,943 Updated Oct 16, 2024

Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

TypeScript 139,980 18,567 Updated Dec 20, 2025

TensorFlow, TensorFlow-Lite Pytorch, Torchvision, TensorRT Benchmarks

Python 23 4 Updated Nov 26, 2024

📚 Freely available programming books

Python 379,087 65,636 Updated Dec 16, 2025

Your ultimate Go microservices framework for the cloud-native era.

Go 25,241 4,137 Updated Dec 17, 2025

Bear is a tool that generates a compilation database for clang tooling.

C++ 6,080 347 Updated Dec 14, 2025

Universal configuration library parser

C 1,713 148 Updated Dec 9, 2025

Sol3 (sol2 v3.0) - a C++ <-> Lua API wrapper with advanced features and top notch performance - is here, and it's great! Documentation:

C++ 4,840 583 Updated Mar 7, 2025

✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解

Go 33,774 5,767 Updated Dec 11, 2024

✍🏻 这里是写博客的地方 —— Halfrost-Field 冰霜之地

Go 13,254 1,892 Updated Dec 28, 2023

A General-purpose Task-parallel Programming System using Modern C++

C++ 11,492 1,345 Updated Dec 20, 2025

:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.

Python 137,806 11,000 Updated Nov 28, 2025

📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/

C++ 25,247 3,085 Updated Aug 17, 2024
Next