MoonMath.ai

moondex Public

MoonMath.ai’s index of knowledge

SageAttention Public

Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda

radial-attention Public

Forked from mit-han-lab/radial-attention

Radial Attention Official Implementation

Python

Jenga Public

Forked from dvlab-research/Jenga

Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving

Python

moon-lite-attention Public

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention (fork of flash-attention)

Python

Wan2.1-pv-skip Public

Forked from Wan-Video/Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MoonMath.ai

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!