Skip to content
View Hygge02's full-sized avatar

Block or report Hygge02

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Python 21 Updated Apr 27, 2026

PTX ISA 9.1 documentation converted to searchable markdown. Includes Claude Code skill for CUDA development.

Python 3 Updated Dec 24, 2025

An curated list for feed-forward 3D scene modeling, including research directions, datasets, and applications.

175 4 Updated Apr 22, 2026

A kernel library written in tilelang

Python 1,277 104 Updated Apr 23, 2026

This is the official PyTorch implementation of "Video-BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation."

Python 1 Updated Nov 6, 2025

Project homepage of Pyramid sparse attention

TeX 4 Updated Dec 14, 2025

This is the official PyTorch implementation of "BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation."

Python 42 5 Updated Oct 9, 2025

A project implementing various agentic RL based on the Slime post-training framework

Python 365 18 Updated Apr 11, 2026

TriAttention — Efficient long reasoning with trigonometric KV cache compression. Enables OpenClaw local deployment on memory-constrained GPUs.

Python 656 53 Updated Apr 23, 2026

[CVPRW 2026 Oral] Less Detail, Better Answers: Degradation-Driven Prompting for VQA

Python 20 1 Updated Apr 25, 2026

A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-performance systems.

Python 121 6 Updated Apr 15, 2026

PTX ISA 9.1 documentation converted to searchable markdown. Includes Claude Code skill for CUDA development.

Python 185 34 Updated Dec 24, 2025

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 481 25 Updated Apr 27, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,577 301 Updated Mar 27, 2026

LM engine is a library for pretraining/finetuning LLMs

Python 165 29 Updated Apr 26, 2026

A Model Context Protocol (MCP) server for creating, reading, and manipulating Microsoft Word documents. This server enables AI assistants to work with Word documents through a standardized interfac…

Python 1,898 252 Updated Dec 31, 2025

Official repository of paper [FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic, NeurIPS 2025]

Python 21 Updated Dec 2, 2025
Python 130 13 Updated Feb 17, 2026

Artifact for PPoPP'26 "RoMeo: Mitigating Dual-dimensional Outliers with Rotated Mixed Precision Quantization"

Python 9 Updated Jan 9, 2026

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 506 52 Updated Jan 20, 2026

DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference

Python 623 38 Updated Nov 24, 2025

A library of GPU kernels for sparse matrix operations.

C++ 286 53 Updated Nov 24, 2020

SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs

Cuda 64 15 Updated Mar 25, 2025

Helpful kernel tutorials, examples and SKILLs for tile-based GPU programming

Python 706 67 Updated Apr 27, 2026

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,469 297 Updated Apr 10, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 661 79 Updated Apr 27, 2026

[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

Python 163 17 Updated Feb 27, 2026

The official repository for PTQTP implementation

12 Updated Sep 24, 2025
Next