Skip to content
View Cydia2018's full-sized avatar

Block or report Cydia2018

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

rl-explainer

Svelte 192 4 Updated Mar 9, 2026

hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 556 58 Updated Apr 27, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,578 233 Updated May 30, 2026
Python 71 7 Updated Jun 8, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,109 893 Updated Jun 13, 2026

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 448 63 Updated Jun 12, 2026

CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge techniques in sparse architecture, speculative sampling and qua…

Cuda 241 26 Updated Jan 14, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,228 289 Updated Jun 13, 2026

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 11,247 1,149 Updated May 29, 2026

Distributed Compiler based on Triton for Parallel Systems

Python 1,459 151 Updated Apr 22, 2026
Python 169 19 Updated Dec 27, 2024

Fast low-bit matmul kernels in Triton

Python 471 34 Updated May 15, 2026

GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs

Python 8 Updated Apr 7, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,983 380 Updated Mar 12, 2026

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,690 133 Updated Nov 21, 2025

[BMVC 2025] Occam’s LGS: An Efficient Approach for Language Gaussian Splatting

Python 67 3 Updated Nov 18, 2025

Explainability for Vision Transformers

Python 1,088 108 Updated Mar 12, 2022

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 8,686 535 Updated Jun 10, 2026

Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]

Python 1,061 112 Updated Oct 10, 2025

A curated list for Efficient Large Language Models

Python 2,020 165 Updated Jun 17, 2025

🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)

Python 12 1 Updated Jun 8, 2026

Puzzles for learning Triton, play it with minimal environment configuration!

Python 709 101 Updated Mar 17, 2026

Efficient Triton Kernels for LLM Training

Python 6,430 539 Updated Jun 12, 2026

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,781 1,305 Updated Nov 4, 2025
Python 107 8 Updated Sep 9, 2024

Material for gpu-mode lectures

Jupyter Notebook 6,172 623 Updated May 9, 2026

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 1,085 88 Updated Sep 4, 2024

how to optimize some algorithm in cuda.

Cuda 3,083 279 Updated Jun 9, 2026

GPTQ inference Triton kernel

Jupyter Notebook 322 21 Updated May 18, 2023

hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.

Jupyter Notebook 51 11 Updated Jun 15, 2023
Next