luckyq

luckyq

2 followers · 4 following

Achievements

Starred repositories

caitaozhan / EB1A-DIY

EB1A DIY Collection

18 6 Updated Jun 8, 2026

sarasultanphd / EB1A

I self petitioned my EB1A and got approved. This repository contains my original petition, RFE response, and link to resources I used.

33 31 Updated May 6, 2026

ChenLiu-1996 / CitationMap

A simple pip-installable Python tool to generate your HTML citation world map from your Google Scholar ID.

Python 715 66 Updated Jun 15, 2026

ShawhinT / ai-tutor-skill

Example Claude skill for explaining technical AI concepts.

Python 78 44 Updated Dec 21, 2025

JunhuiLi1017 / DIY-NIW-EB1A

DIY for NIW/EB1A

34 12 Updated Jan 19, 2024

rkinas / triton-resources

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

Python 489 32 Updated Mar 10, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,317 1,264 Updated Jun 23, 2026

aws-neuron / neuronx-distributed

Python 66 22 Updated Apr 9, 2026

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 5,839 1,070 Updated Jun 22, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,684 964 Updated Jun 22, 2026

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,752 432 Updated Jun 23, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,566 448 Updated Jun 18, 2026

aliireza / ddio-bench

Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks

Makefile 103 22 Updated Sep 2, 2021

NVIDIA / TileGym

Helpful kernel tutorials, examples and SKILLs for tile-based GPU programming

Python 759 78 Updated Jun 17, 2026

Azure / AzurePublicDataset

Microsoft Azure Traces

Jupyter Notebook 1,149 182 Updated Jun 3, 2026

ryscheng / paper-template

LaTeX templates for papers

TeX 55 19 Updated Feb 9, 2019

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,085 4,110 Updated Jun 22, 2026

dmemsys / awesome-disaggregated-memory

A collection of awesome researchers and papers about disaggregated memory.

190 18 Updated Apr 3, 2026

GT-CHIPS / gem5_chips

gem5 repository to study chiplet-based systems

C++ 90 19 Updated Apr 18, 2019

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 9,751 1,294 Updated Jun 15, 2026

Zjh-819 / LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

3,395 238 Updated Nov 28, 2023

microsoft / msccl

Microsoft Collective Communication Library

C++ 391 34 Updated Sep 20, 2023

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,254 366 Updated Aug 14, 2025

antgroup / glake

GLake: optimizing GPU memory management and IO transmission.

Python 502 45 Updated Mar 24, 2025

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

26,963 2,599 Updated Jul 31, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 16,796 4,106 Updated Jun 23, 2026

NVIDIA / nccl-tests

NCCL Tests

Cuda 1,561 380 Updated Jun 22, 2026

sihyeong / Awesome-LLM-Inference-Engine

218 18 Updated Apr 27, 2026

zyqCSL / sinan-local

C 32 15 Updated Jan 21, 2021

delimitrou / DeathStarBench

Open-source benchmark suite for cloud microservices

Lua 933 498 Updated Jul 9, 2024

luckyq

Starred repositories

Linux

LaTeX