Skip to content
View dimanzt's full-sized avatar

Highlights

  • Pro

Block or report dimanzt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]

C++ 57 13 Updated Dec 5, 2024

Large Scale Failure recovery algorithms

Python 3 Updated Jan 26, 2017

Example of multi-process, multi-GPU training using Torch-parallel, nVidia-nccl, and nVidia-MPS

Lua 17 4 Updated Sep 22, 2016

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,868 4,760 Updated Mar 22, 2026

PEAKS: Power Efficiency Aware Kubernetes Scheduler

Jupyter Notebook 39 4 Updated Mar 19, 2026

GTNS is a discrete-event network simulator targeted primarily for research and educational use. GTNS is written in Visual C++ programming language and supports different network topologies. This si…

C++ 21 8 Updated Apr 13, 2021

Kubernetes training from basics to advanced

Go 62 6 Updated Mar 19, 2026

Locust load-testing tool on OpenFaaS

Python 9 2 Updated Nov 29, 2017

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o…

Python 4,019 767 Updated Oct 28, 2025

A library for Partially Homomorphic Encryption in Python

Python 633 142 Updated Aug 4, 2023
Python 10 2 Updated Apr 19, 2022

Declarative cluster management using constraint programming, where constraints are described using SQL.

Java 103 18 Updated May 9, 2023

AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads

Jupyter Notebook 203 31 Updated Nov 22, 2023

Reference implementations of MLPerf® training benchmarks

Python 1,749 585 Updated Mar 12, 2026

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

Go 3,216 372 Updated Mar 20, 2025

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,679 2,251 Updated Dec 1, 2025

YOLO3D: End-to-end real-time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud (ECCV 2018)

Python 310 46 Updated Aug 7, 2020
Python 1 2 Updated Jun 15, 2023

Kubernetes networking based on Open vSwitch

Go 1,781 457 Updated Mar 22, 2026

Borg cluster traces from Google

TeX 1,041 209 Updated Feb 17, 2026

Microsoft Azure Traces

Jupyter Notebook 1,092 176 Updated Dec 6, 2025

P4_16 reference compiler

C++ 815 510 Updated Mar 20, 2026
C++ 2 1 Updated May 9, 2020

Sources and examples for ASPLOS20 paper

C++ 14 7 Updated Jul 21, 2020

Set of Experiments for Lambda NIC project

P4 1 2 Updated Apr 23, 2021
Shell 3 3 Updated Dec 10, 2021

Open API for IP Applications to Offload TCP/UDP Session Packet Processing to Hardware

C 22 15 Updated Apr 7, 2023
Next