Skip to content
View jiweibo's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report jiweibo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Lightweight Kubernetes

Go 32,651 2,636 Updated Apr 3, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,905 385 Updated Apr 4, 2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 888 65 Updated Mar 4, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 33,611 3,861 Updated Mar 30, 2026

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,137 4,024 Updated Apr 4, 2026

A list of free LLM inference resources accessible via API.

Python 17,871 1,777 Updated Mar 10, 2026

Write scalable load tests in plain Python 🚗💨

Python 27,674 3,195 Updated Apr 2, 2026

The open source coding agent.

TypeScript 137,034 15,017 Updated Apr 4, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 86,002 9,945 Updated Apr 3, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,917 552 Updated Mar 13, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 41,946 7,406 Updated Apr 4, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,013 131 Updated Apr 4, 2026

A book for Learning the Foundations of LLMs

16,011 1,521 Updated Dec 12, 2025

LLM inference in C/C++

C++ 101,333 16,347 Updated Apr 4, 2026

🌸 A command-line fuzzy finder

Go 79,251 2,750 Updated Apr 4, 2026

微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

Python 40,147 7,450 Updated Mar 13, 2026

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,252 269 Updated Feb 20, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,478 992 Updated Apr 4, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,874 1,714 Updated Jan 30, 2026

Move and resize windows on macOS with keyboard shortcuts and snap areas

Swift 28,718 907 Updated Apr 2, 2026
316 28 Updated Feb 26, 2026

The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…

27,794 4,849 Updated Jan 13, 2026

Infisical is the open-source platform for secrets, certificates, and privileged access management.

TypeScript 25,694 1,785 Updated Apr 4, 2026

Sync notes between local and cloud with smart conflict: S3 (Amazon S3/Cloudflare R2/Backblaze B2/...), Dropbox, webdav (NextCloud/InfiniCLOUD/Synology/...), OneDrive, Google Drive (GDrive), Box, pC…

TypeScript 7,109 354 Updated Nov 10, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 47,275 10,168 Updated Apr 3, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,038 1,965 Updated Jan 9, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,242 15,160 Updated Apr 4, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,421 5,177 Updated Apr 4, 2026

Windows Subsystem for Linux

C++ 31,683 1,669 Updated Apr 4, 2026
Next