Skip to content
View jiweibo's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report jiweibo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,853 379 Updated Mar 30, 2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 879 65 Updated Mar 4, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 32,199 3,635 Updated Mar 30, 2026

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,020 4,013 Updated Mar 30, 2026

A list of free LLM inference resources accessible via API.

Python 17,467 1,731 Updated Mar 10, 2026

Write scalable load tests in plain Python 🚗💨

Python 27,654 3,196 Updated Mar 29, 2026

The open source coding agent.

TypeScript 132,949 14,284 Updated Mar 30, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 85,105 9,861 Updated Mar 30, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,856 535 Updated Mar 13, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 41,880 7,396 Updated Mar 30, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,005 130 Updated Mar 30, 2026

A book for Learning the Foundations of LLMs

15,986 1,518 Updated Dec 12, 2025

LLM inference in C/C++

C++ 100,149 16,042 Updated Mar 30, 2026

🌸 A command-line fuzzy finder

Go 79,102 2,745 Updated Mar 30, 2026

微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

Python 40,038 7,432 Updated Mar 13, 2026

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,248 264 Updated Feb 20, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,443 973 Updated Mar 30, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,818 1,707 Updated Jan 30, 2026

Move and resize windows on macOS with keyboard shortcuts and snap areas

Swift 28,676 902 Updated Mar 20, 2026
316 28 Updated Feb 26, 2026

The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…

27,520 4,795 Updated Jan 13, 2026

Infisical is the open-source platform for secrets, certificates, and privileged access management.

TypeScript 25,629 1,774 Updated Mar 30, 2026

Sync notes between local and cloud with smart conflict: S3 (Amazon S3/Cloudflare R2/Backblaze B2/...), Dropbox, webdav (NextCloud/InfiniCLOUD/Synology/...), OneDrive, Google Drive (GDrive), Box, pC…

TypeScript 7,074 351 Updated Nov 10, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 46,927 10,081 Updated Mar 24, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,015 1,948 Updated Jan 9, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,748 14,969 Updated Mar 30, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,241 5,066 Updated Mar 30, 2026

Windows Subsystem for Linux

C++ 31,616 1,669 Updated Mar 30, 2026

Manage and switch between multiple proxies quickly & easily.

CoffeeScript 7,113 320 Updated Oct 16, 2025
Next