Skip to content
View ptrendx's full-sized avatar

Organizations

@apache @NVIDIA

Block or report ptrendx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
4 stars written in Python
Clear filter

🚀 Level up your GitHub profile readme with customizable cards including LOC statistics!

Python 16,170 120 Updated Jan 25, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,984 1,665 Updated Nov 6, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 2,887 540 Updated Nov 6, 2025

distributed-embeddings is a library for building large embedding based models in Tensorflow 2.

Python 46 12 Updated Oct 17, 2023