Skip to content
View jr-shen's full-sized avatar
🤔
🤔

Highlights

  • Pro

Block or report jr-shen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 62,373 8,719 Updated Mar 26, 2026

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

Python 9,752 1,067 Updated Mar 31, 2026

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Rust 1,738 391 Updated Mar 31, 2026

Toolkit to run Python benchmarks

Python 920 97 Updated Mar 21, 2026

Concurrency permutation testing tool for Rust.

Rust 2,654 133 Updated Feb 20, 2026
Python 198 38 Updated Mar 31, 2026

RoCE v2 hardware and software implementation

187 41 Updated Sep 26, 2024

Transaction traces generated by FissLock's generator.

1 Updated Dec 19, 2024

Installs OFED from Mellanox Repositories

Jinja 7 4 Updated Apr 8, 2022

Reparent a running program to a new terminal

C 6,219 230 Updated Nov 20, 2025

CLI for interacting with clash

Rust 256 28 Updated Apr 2, 2024

Asterinas aims to be a production-grade Linux alternative—memory safe, high-performance, and more.

Rust 4,394 286 Updated Mar 31, 2026

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 1,181 60 Updated Mar 31, 2026

Nix, the purely functional package manager

C++ 16,463 1,878 Updated Mar 31, 2026

Manage your dotfiles across multiple diverse machines, securely.

Go 18,821 619 Updated Mar 30, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,882 542 Updated Mar 13, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,788 1,024 Updated Mar 30, 2026

A scheduling framework for multitasking over diverse XPUs, including GPUs, NPUs, ASICs, and FPGAs

C 163 22 Updated Jan 13, 2026

FalconFS is a high-performance distributed file system (DFS) designed for AI workloads.

C++ 60 18 Updated Mar 25, 2026

A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)

C++ 452 71 Updated Feb 7, 2026

Meta's fleetwide profiler framework

C++ 346 22 Updated Sep 22, 2025

High-Throughput, Cost-Effective Billion-Scale Vector Search with a Single GPU [to appear in SIGMOD'26]

Cuda 21 5 Updated Jan 16, 2026

A low-latency, billion-scale, and updatable graph-based vector store on SSD.

Jupyter Notebook 106 40 Updated Mar 28, 2026

PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]

Python 49 5 Updated Feb 24, 2026

Distributed KV cache scheduling & offloading libraries

Go 122 105 Updated Mar 31, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,448 493 Updated Mar 31, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,244 835 Updated Mar 31, 2026

LongBench v2 and LongBench (ACL 25'&24')

Python 1,137 124 Updated Jan 15, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,492 125 Updated Nov 13, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,393 774 Updated Mar 30, 2026
Next