Skip to content
View jiweibo's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report jiweibo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Lightweight Kubernetes

Go 32,663 2,636 Updated Apr 3, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,912 386 Updated Apr 4, 2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 888 65 Updated Mar 4, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 33,737 3,884 Updated Mar 30, 2026

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,157 4,026 Updated Apr 5, 2026

A list of free LLM inference resources accessible via API.

Python 17,919 1,779 Updated Mar 10, 2026

Write scalable load tests in plain Python 🚗💨

Python 27,676 3,195 Updated Apr 2, 2026

The open source coding agent.

TypeScript 137,506 15,093 Updated Apr 5, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 86,098 9,955 Updated Apr 3, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,923 555 Updated Mar 13, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 41,954 7,406 Updated Apr 5, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,013 132 Updated Apr 4, 2026

A book for Learning the Foundations of LLMs

16,014 1,521 Updated Dec 12, 2025

LLM inference in C/C++

C++ 101,526 16,383 Updated Apr 5, 2026

🌸 A command-line fuzzy finder

Go 79,278 2,751 Updated Apr 5, 2026

微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

Python 40,166 7,456 Updated Mar 13, 2026

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,254 269 Updated Feb 20, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,484 992 Updated Apr 5, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,884 1,715 Updated Jan 30, 2026

Move and resize windows on macOS with keyboard shortcuts and snap areas

Swift 28,730 908 Updated Apr 2, 2026
317 28 Updated Feb 26, 2026

The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…

27,842 4,858 Updated Jan 13, 2026

Infisical is the open-source platform for secrets, certificates, and privileged access management.

TypeScript 25,713 1,786 Updated Apr 4, 2026

Sync notes between local and cloud with smart conflict: S3 (Amazon S3/Cloudflare R2/Backblaze B2/...), Dropbox, webdav (NextCloud/InfiniCLOUD/Synology/...), OneDrive, Google Drive (GDrive), Box, pC…

TypeScript 7,119 355 Updated Nov 10, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 47,317 10,175 Updated Apr 3, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,043 1,967 Updated Jan 9, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,324 15,188 Updated Apr 5, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,439 5,194 Updated Apr 5, 2026

Windows Subsystem for Linux

C++ 31,685 1,670 Updated Apr 5, 2026
Next