Stars
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
Tensors and Dynamic neural networks in Python with strong GPU acceleration
MSCCL++: A GPU-driven communication stack for scalable AI applications
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Resource scheduling and cluster management for AI
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Visual Studio Tools for AI is a free Visual Studio extension to build, test, and deploy deep learning / AI solutions. It seamlessly integrates with Azure Machine Learning for robust experimentation…