-
flexible-flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedMay 4, 2026 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedDec 24, 2025 -
-
-
cutlass-ffa-0.0.3 Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedFeb 28, 2025 -
-
-
-
grouped_gemm Public
Forked from fanshiqing/grouped_gemmPyTorch bindings for CUTLASS grouped GEMM.
Cuda Apache License 2.0 UpdatedJul 18, 2024 -
-
-
-
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedNov 1, 2023 -
Paddle Public
Forked from PaddlePaddle/PaddlePArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
C++ Apache License 2.0 UpdatedOct 30, 2023 -
-
v2ray-core Public
Forked from v2ray/v2ray-coreA platform for building proxies to bypass network restrictions.
Go MIT License UpdatedOct 18, 2023 -
-
-
PaDiff Public
Forked from PaddlePaddle/PaDiffPaddle Automatically Diff Precision Toolkits.
Python UpdatedSep 24, 2023 -
PaddleNLP Public
Forked from PaddlePaddle/PaddleNLP👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Ques…
Python Apache License 2.0 UpdatedSep 24, 2023 -
hpds Public
Forked from xiaoniaoyouhuajiang/hpdshigh performance data structure for python
Python MIT License UpdatedSep 20, 2023 -
PaPerf Public
Forked from Xreki/PaPerfA tools to automatically analysis and compare the layer-wise perf.
Python Apache License 2.0 UpdatedSep 18, 2023 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedAug 8, 2023 -
PHTrans Public
Forked from lseventeen/PHTrans[MICCAI2022] PHTrans: Parallelly Aggregating Global and Local Representations for Medical Image Segmentation
Python Apache License 2.0 UpdatedJun 28, 2023 -
-
TransUNet Public
Forked from Beckschen/TransUNetThis repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
Python Apache License 2.0 UpdatedJun 28, 2023 -
Pytorch-UNet Public
Forked from milesial/Pytorch-UNetPyTorch implementation of the U-Net for image semantic segmentation with high quality images
Python GNU General Public License v3.0 UpdatedJun 7, 2023 -
-