This repository contains multiple implementations of Flash Attention optimized with Triton kernels, showcasing progressive performance improvements through hardware-aware optimizations. The impleme…

Python 11 1 Updated Mar 26, 2026

SchedMD / slurm

Slurm: A Highly Scalable Workload Manager

C 4,066 858 Updated Jun 23, 2026

Vchitect / ShotBench

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Python 97 3 Updated Sep 12, 2025

HKUDS / RAG-Anything

"RAG-Anything: All-in-One RAG Framework"

Python 21,517 2,513 Updated Jun 15, 2026

HKUDS / LightRAG

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 36,877 5,199 Updated Jun 21, 2026

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 11,303 1,163 Updated Jun 23, 2026

MQN-80 / mindnlp

Forked from candle-org/mindnlp

Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.

Jupyter Notebook 1 Updated Mar 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MQN-80 MQN-80

Achievements

Achievements

Block or report MQN-80

Lists (1)

🚀 My stack

Stars

Natfii / UnrealClaude

Z1zs / Causal-Embed

SUSTechBruce / LOOK-M

AIoT-MLSys-Lab / MEDA

HarryWu99 / llm_kvcache_sparsity

Wuxb02 / assist-agent

microsoft / agent-lightning

Leezekun / MMSci

memovai / mem

SkyworkAI / UniPic

OpenGVLab / Docopilot

mayubo2333 / MMLongBench-Doc

jfalcou / eve

GeeeekExplorer / nano-vllm

MQN-80 / black-myth-agent

lyj20071013 / Triton-FlashAttention

SchedMD / slurm

Vchitect / ShotBench

HKUDS / RAG-Anything

HKUDS / LightRAG

xlite-dev / LeetCUDA

MQN-80 / mindnlp

illuin-tech / vidore-benchmark

bytedance / Dolphin

illuin-tech / colpali

PaddlePaddle / PaddleOCR

Alibaba-NLP / VRAG

Obmutescence / COCO-MINI

lupantech / PromptPG

SpursGoZmy / Tabular-LLM