Skip to content
View jr-shen's full-sized avatar
🤔
🤔

Highlights

  • Pro

Block or report jr-shen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Techniques and numbers for estimating system's performance from first-principles

Rust 5,249 216 Updated Mar 21, 2026

AI agents running research on single-GPU nanochat training automatically

Python 86,798 12,568 Updated Mar 26, 2026

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

Python 13,412 1,571 Updated Jun 3, 2026

A vector indexing library to bring fast, fresh and filtered search to your database

Rust 1,844 427 Updated Jun 14, 2026

Fast, small, and fully autonomous AI personal assistant infrastructure, any OS, any platform — deploy anywhere, swap anything 🦀

Rust 31,907 4,719 Updated Jun 15, 2026

Toolkit to run Python benchmarks

Python 937 97 Updated Jun 2, 2026

Concurrency permutation testing tool for Rust.

Rust 2,727 141 Updated Feb 20, 2026
Python 282 53 Updated Jun 9, 2026

Get Started From Here. The main repo for the whole open-rdma project. Including introduction, hands-on guide, new events and many other things.

293 48 Updated May 22, 2026

Transaction traces generated by FissLock's generator.

1 Updated Dec 19, 2024

Installs OFED from Mellanox Repositories

Jinja 7 4 Updated Apr 8, 2022

Reparent a running program to a new terminal

C 6,281 230 Updated Nov 20, 2025

CLI for interacting with clash

Rust 273 31 Updated Apr 2, 2024

Asterinas aims to be a production-grade Linux alternative—memory safe, high-performance, and more.

Rust 4,672 313 Updated Jun 15, 2026

Provide with pre-build flash-attention 2 and 3 package wheels on Linux and Windows using GitHub Actions

Python 1,530 67 Updated Jun 15, 2026

Nix, the purely functional package manager

C++ 17,082 1,942 Updated Jun 14, 2026

Manage your dotfiles across multiple diverse machines, securely.

Go 20,199 647 Updated Jun 9, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,398 698 Updated May 17, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,969 1,055 Updated May 7, 2026

A scheduling framework for multitasking over diverse XPUs, including GPUs, NPUs, ASICs, and FPGAs

C 174 27 Updated May 9, 2026

FalconFS is a high-performance distributed file system (DFS) designed for AI workloads.

C++ 63 19 Updated May 19, 2026

A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)

C++ 461 73 Updated May 31, 2026

Meta's fleetwide profiler framework

C++ 348 22 Updated Jun 12, 2026

High-Throughput, Cost-Effective Billion-Scale Vector Search with a Single GPU [SIGMOD'26]

Cuda 27 5 Updated Apr 22, 2026

A low-latency, billion-scale, and updatable graph-based vector store on SSD.

C++ 139 42 Updated May 27, 2026

PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]

Python 58 8 Updated Jun 12, 2026

Distributed KV cache scheduling & offloading libraries

Go 156 139 Updated Jun 14, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,497 604 Updated Jun 15, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,792 1,049 Updated Jun 15, 2026

LongBench v2 and LongBench (ACL 25'&24')

Python 1,192 134 Updated Jan 15, 2025
Next