Skip to content
View idevasena's full-sized avatar

Block or report idevasena

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 67 11 Updated Sep 10, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,372 584 Updated Oct 28, 2024

First Latency-Aware Competitive LLM Agent Benchmark

Python 23 2 Updated Jun 3, 2025

KV cache visualizations

JavaScript 3 Updated Nov 4, 2025

NVIDIA Inference Xfer Library (NIXL)

C++ 702 177 Updated Nov 5, 2025
Python 3 Updated Oct 11, 2025
C++ 10 Updated Oct 15, 2025

The simplest, highest-throughput Python interface to S3, GCS & Azure Storage, powered by Rust.

Python 595 24 Updated Oct 29, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,905 313 Updated Nov 5, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,891 690 Updated Nov 5, 2025

Part of the sai3 project that delivers multi-protocol storage access for AI/ML workflows. This project provides a CLI, along with Rust and Python libraries for AI/ML storage workflows. Supporting S…

Rust 7 Updated Nov 4, 2025

MLPerf Client is a benchmark for Windows and macOS, focusing on client form factors in ML inference scenarios.

C++ 55 4 Updated Oct 9, 2025

Cutting-edge tool that unlocks the full potential of semantic chunking

Python 18 4 Updated Sep 18, 2025

All-in-Storage Solution based on DiskANN for DRAM-free Approximate Nearest Neighbor Search

C++ 86 11 Updated Jun 30, 2025

Basic benchmark for Vector Databases

Python 4 3 Updated Oct 27, 2025

MLPerf® Storage Benchmark Suite

Python 167 51 Updated Nov 4, 2025

MLPerf™ Storage Benchmark Suite

Python 3 1 Updated Jul 31, 2025

Carefully crafted Alpine Docker image with glibc (~12MB)

Dockerfile 764 184 Updated Oct 27, 2025

An I/O benchmark for deep Learning applications

Python 93 45 Updated Oct 28, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,440 957 Updated Oct 24, 2025

Build userspace NVMe drivers and storage applications with CUDA support

C 399 54 Updated Dec 18, 2023

Newsletter to help busy software engineers become good at system design 👇

17,136 1,896 Updated Nov 3, 2025

This Repository Contains Solution to the Assignments of the Generative Adversarial Networks (GANs) Specialization from deeplearning.ai on Coursera Taught by Sharon Zhou, Eda Zhou, Eric Zelikman

Jupyter Notebook 35 11 Updated Mar 16, 2023

Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai

Jupyter Notebook 480 326 Updated Jul 1, 2021

Tools to enable the development of mlperf storage

Python 2 Updated Mar 19, 2025

StyleGAN - Official TensorFlow Implementation

Python 14,388 3,179 Updated Apr 10, 2024

Collective Knowledge (CK), Collective Mind (CM/CMX) and MLPerf automations: community-driven projects to facilitate collaborative and reproducible research and to learn how to run AI, ML, and other…

Python 633 121 Updated Sep 13, 2025

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,072 187 Updated Jun 30, 2025
Next