Skip to content
View jren73's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report jren73

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CXL Memory Resource Kit top-level repository

Shell 63 12 Updated Oct 31, 2022

DFlash: Block Diffusion for Flash Speculative Decoding

Python 550 34 Updated Feb 6, 2026

Ongoing research training transformer models at scale

Python 15,219 3,601 Updated Feb 17, 2026

Official inference framework for 1-bit LLMs

Python 28,484 2,331 Updated Feb 3, 2026

IOR and mdtest

C 464 190 Updated Jul 9, 2025

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 620 81 Updated Sep 11, 2024

Numbers every LLM developer should know

4,284 138 Updated Jan 16, 2024

The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures

C++ 1 1 Updated Jan 8, 2024

A benchmarking suite to evaluate the performance of persistent memory access (PerMA-Bench @ VLDB '22)

C++ 20 3 Updated Sep 3, 2022

Prefetching and efficient data path for memory disaggregation

C 69 24 Updated Jul 16, 2020

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,182 570 Updated Aug 22, 2025

Repo for summer research@W&M.

C++ 1 Updated Jul 3, 2024
Python 9 7 Updated Sep 19, 2023

LLM papers I'm reading, mostly on inference and model compression

750 38 Updated Dec 21, 2023

REMORA: REsource MOnitoring for Remote Applications

Shell 64 18 Updated Nov 11, 2025

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 477 56 Updated Apr 19, 2025

CoRM: Compactable Remote Memory over RDMA

C++ 20 2 Updated Jun 18, 2021

📺 Discover the latest machine learning / AI courses on YouTube.

17,091 2,093 Updated Jan 22, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 181,847 46,218 Updated Feb 17, 2026

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Python 9,943 588 Updated Sep 7, 2024

Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM and VLM works!

Python 9,170 1,804 Updated Nov 25, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,381 591 Updated Oct 28, 2024

A list of papers about distributed consensus.

2,609 216 Updated Aug 8, 2024

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

16,890 1,546 Updated Feb 13, 2023

像黑客一样使用命令行

TeX 1,454 82 Updated Nov 18, 2022

NAS Parallel Benchmarks for evaluating GPU and APIs

C++ 29 10 Updated Sep 29, 2025

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 868 147 Updated Sep 26, 2025

Pytorch domain library for recommendation systems

Python 2,470 604 Updated Feb 17, 2026

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 3,272 662 Updated Dec 16, 2025
Next