t-head
Popular repositories Loading
-
FlagTree
FlagTree PublicForked from flagos-ai/FlagTree
FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang/triton.
-
pytorch-for-sail
pytorch-for-sail PublicForked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Python
-
vllm-for-sail
vllm-for-sail PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python 1
-
sglang-for-sail
sglang-for-sail PublicForked from sgl-project/sglang
SGLang is a high-performance serving framework for large language models and multimodal models.
Python
-
tensorflow-for-sail
tensorflow-for-sail PublicForked from tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
C++
-
xformers-for-sail
xformers-for-sail PublicForked from facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Python
Repositories
- vllm-for-sail Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
t-head/vllm-for-sail’s past year of commit activity - cutlass-for-sail Public Forked from NVIDIA/cutlass
CUDA Templates and Python DSLs for High-Performance Linear Algebra
t-head/cutlass-for-sail’s past year of commit activity - FlagTree Public Forked from flagos-ai/FlagTree
FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang/triton.
t-head/FlagTree’s past year of commit activity - xformers-for-sail Public Forked from facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
t-head/xformers-for-sail’s past year of commit activity - pytorch-for-sail Public Forked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
t-head/pytorch-for-sail’s past year of commit activity - vllm-flash-attention-for-sail Public Forked from vllm-project/flash-attention
Fast and memory-efficient exact attention
t-head/vllm-flash-attention-for-sail’s past year of commit activity - flash-attention-for-sail Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
t-head/flash-attention-for-sail’s past year of commit activity - sglang-for-sail Public Forked from sgl-project/sglang
SGLang is a high-performance serving framework for large language models and multimodal models.
t-head/sglang-for-sail’s past year of commit activity - tensorflow-for-sail Public Forked from tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
t-head/tensorflow-for-sail’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…