Skip to content
View hopkins516's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report hopkins516

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Collect some CS textbooks for learning.

6 264 Updated Mar 14, 2022

Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions

C 164 40 Updated Apr 21, 2019

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 183,853 107,980 Updated Apr 13, 2026

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 110,508 8,038 Updated Apr 14, 2026

First open-source KVTC implementation (NVIDIA, ICLR 2026) -- 8-32x KV cache compression via PCA + adaptive quantization + entropy coding

Python 12 2 Updated Apr 1, 2026

UniCNet is a cycle-accurate simulator supporting effienct simulation for composable chiplet networks.

C++ 4 Updated Jan 29, 2026
TypeScript 118 21 Updated Mar 18, 2026

decrypt NVML debug log files

Python 1 Updated Mar 29, 2024

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 941 82 Updated Feb 28, 2026

High Performance KV Cache Store for LLM

C 53 8 Updated Apr 6, 2026

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,721 548 Updated Apr 13, 2026

NVLeak: Off-Chip Side-Channel Attacks via Non-Volatile Memory Systems [USENIX Security '23]

TeX 20 2 Updated Nov 17, 2022

A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson

Python 367 37 Updated May 19, 2022

https://github.com/eunomia-bpf homepage, documents and blogs

TypeScript 209 36 Updated Mar 17, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,352 32,868 Updated Apr 14, 2026

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 29,709 2,934 Updated Apr 13, 2026

A comprehensive toolkit for GPU Communications Libraries performance testing and data analysis.

Python 10 1 Updated Jan 6, 2026

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 2,021 461 Updated Mar 12, 2026
475 34 Updated Nov 3, 2023

vArmor is a cloud native container sandbox system based on AppArmor/BPF/Seccomp. It also includes multiple built-in protection rules that are ready to use out of the box.

Go 456 53 Updated Apr 13, 2026

A book for Learning the Foundations of LLMs

16,037 1,530 Updated Dec 12, 2025

Machine Learning Engineering Open Book

Python 17,692 1,121 Updated Mar 16, 2026

hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 408 44 Updated Apr 7, 2026
C 202 67 Updated Mar 9, 2026

Using Persistent Memory Region in NVMe SSD to boost KVStore accessing

C++ 2 1 Updated Jul 15, 2024

SMDK, Scalable Memory Development Kit, is developed for Samsung CXL(Compute Express Link) Memory Expander to enable full-stack Software-Defined Memory system

C 319 66 Updated Dec 9, 2024
Next