jiweibo

Follow

I may be slow to respond.

Wilber jiweibo

I may be slow to respond.

Follow

Hello, World!

35 followers · 21 following

China
https://jiweibo.github.io

Achievements

Achievements

Lists (16)

Sort

Agent

Algorithm

Autonomous Driving

Books

CNCF

CPP

15 repositories

CUDA

DL-Compiler

DL-Server

LC

LLM

20 repositories

ML-Books

ROS

Tools

Utilities

24 repositories

WebServer

Starred repositories

k3s-io / k3s

Lightweight Kubernetes

Go 32,663 2,636 Updated Apr 3, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,913 387 Updated Apr 4, 2026

BytedTsinghua-SIA / CUDA-Agent

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 888 65 Updated Mar 4, 2026

datawhalechina / hello-agents

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 33,744 3,884 Updated Mar 30, 2026

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,159 4,026 Updated Apr 5, 2026

cheahjs / free-llm-api-resources

A list of free LLM inference resources accessible via API.

Python 17,920 1,779 Updated Mar 10, 2026

locustio / locust

Write scalable load tests in plain Python 🚗💨

Python 27,676 3,195 Updated Apr 2, 2026

anomalyco / opencode

The open source coding agent.

TypeScript 137,549 15,097 Updated Apr 5, 2026

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 86,107 9,955 Updated Apr 3, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,923 555 Updated Mar 13, 2026

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 41,955 7,406 Updated Apr 5, 2026

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,013 132 Updated Apr 4, 2026

ZJU-LLMs / Foundations-of-LLMs

A book for Learning the Foundations of LLMs

16,015 1,521 Updated Dec 12, 2025

open-neutrino / neutrino

C 245 26 Updated Dec 25, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 101,544 16,386 Updated Apr 5, 2026

junegunn / fzf

🌸 A command-line fuzzy finder

Go 79,277 2,751 Updated Apr 5, 2026

666ghj / BettaFish

微舆：人人可用的多Agent舆情分析助手，打破信息茧房，还原舆情原貌，预测未来走向，辅助决策！从0实现，不依赖任何框架。

Python 40,168 7,456 Updated Mar 13, 2026

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,254 269 Updated Feb 20, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,485 992 Updated Apr 5, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,884 1,715 Updated Jan 30, 2026

rxhanson / Rectangle

Move and resize windows on macOS with keyboard shortcuts and snap areas

Swift 28,731 908 Updated Apr 2, 2026

NVlabs / NVBit

317 28 Updated Feb 26, 2026

ashishpatel26 / 500-AI-Agents-Projects

The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…

27,851 4,859 Updated Jan 13, 2026

Infisical / infisical

Infisical is the open-source platform for secrets, certificates, and privileged access management.

TypeScript 25,716 1,786 Updated Apr 5, 2026

remotely-save / remotely-save

Sync notes between local and cloud with smart conflict: S3 (Amazon S3/Cloudflare R2/Backblaze B2/...), Dropbox, webdav (NextCloud/InfiniCLOUD/Synology/...), OneDrive, Google Drive (GDrive), Box, pC…

TypeScript 7,121 355 Updated Nov 10, 2024

NanmiCoder / MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫、百度贴吧帖子｜百度贴吧评论回复爬虫 | 知乎问答文章｜评论爬虫

Python 47,321 10,179 Updated Apr 3, 2026

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,045 1,967 Updated Jan 9, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,342 15,189 Updated Apr 5, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,442 5,195 Updated Apr 5, 2026

microsoft / WSL

Windows Subsystem for Linux

C++ 31,687 1,670 Updated Apr 5, 2026

Starred topics

Docker

Homebrew

Chrome

Chrome extension

nvidia-jetson

Command-line interface

API

Compiler

Google

Git

See all starred topics