Skip to content
View luckyq's full-sized avatar

Block or report luckyq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

EB1A DIY Collection

18 6 Updated Jun 8, 2026

I self petitioned my EB1A and got approved. This repository contains my original petition, RFE response, and link to resources I used.

33 31 Updated May 6, 2026

A simple pip-installable Python tool to generate your HTML citation world map from your Google Scholar ID.

Python 715 66 Updated Jun 15, 2026

Example Claude skill for explaining technical AI concepts.

Python 78 44 Updated Dec 21, 2025

DIY for NIW/EB1A

34 12 Updated Jan 19, 2024

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

Python 489 32 Updated Mar 10, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,317 1,264 Updated Jun 23, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,839 1,070 Updated Jun 22, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,684 964 Updated Jun 22, 2026

Scalable toolkit for efficient model reinforcement

Python 1,752 432 Updated Jun 23, 2026

My learning notes for ML SYS.

Python 6,566 448 Updated Jun 18, 2026

Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks

Makefile 103 22 Updated Sep 2, 2021

Helpful kernel tutorials, examples and SKILLs for tile-based GPU programming

Python 759 78 Updated Jun 17, 2026

Microsoft Azure Traces

Jupyter Notebook 1,149 182 Updated Jun 3, 2026

LaTeX templates for papers

TeX 55 19 Updated Feb 9, 2019

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,085 4,110 Updated Jun 22, 2026

A collection of awesome researchers and papers about disaggregated memory.

190 18 Updated Apr 3, 2026

gem5 repository to study chiplet-based systems

C++ 90 19 Updated Apr 18, 2019

DeepEP: an efficient expert-parallel communication library

Cuda 9,751 1,294 Updated Jun 15, 2026

A quick guide (especially) for trending instruction finetuning datasets

3,395 238 Updated Nov 28, 2023

Microsoft Collective Communication Library

C++ 391 34 Updated Sep 20, 2023

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,254 366 Updated Aug 14, 2025

GLake: optimizing GPU memory management and IO transmission.

Python 502 45 Updated Mar 24, 2025

Awesome-LLM: a curated list of Large Language Model

26,963 2,599 Updated Jul 31, 2025

Ongoing research training transformer models at scale

Python 16,796 4,106 Updated Jun 23, 2026

NCCL Tests

Cuda 1,561 380 Updated Jun 22, 2026
C 32 15 Updated Jan 21, 2021

Open-source benchmark suite for cloud microservices

Lua 933 498 Updated Jul 9, 2024
Next