doraa7

Yannam C Chiranjeevi doraa7

I help clients across industries set up their IT Systems from scratch(Cloud Native Kubernetes). Looking to help Companies in their DevOps/SRE/K8S Journeys.

12 followers · 96 following

Vincisive Technologies
Bengaluru
09:54 (UTC +05:30)
in/ycchiranjeevi

Lists (3)

Sort

Stars

envoyproxy / ai-gateway

Manages Unified Access to Generative AI Services built on Envoy Gateway

Go 1,889 321 Updated Aug 2, 2026

kserve / open-inference-protocol

Repository for open inference protocol specification

77 15 Updated May 12, 2025

bentoml / BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 8,749 999 Updated Jul 20, 2026

SeldonIO / MLServer

An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

Python 898 238 Updated Aug 2, 2026

NVIDIA-AI-Blueprints / rag

This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.

Python 722 308 Updated Jul 29, 2026

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 14,283 2,632 Updated Aug 3, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 87,998 20,187 Updated Aug 3, 2026

ggml-org / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 52,534 5,976 Updated Jul 31, 2026

ggml-org / llama.cpp

LLM inference in C/C++

C++ 122,483 21,265 Updated Aug 2, 2026

corundum / corundum

Open source FPGA-based NIC and platform for in-network compute

Verilog 2,411 545 Updated Jul 5, 2024

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 10,095 1,073 Updated May 7, 2026

ava-orange-education / Mastering-Computer-Vision-with-PyTorch-2.0

Mastering Computer Vision with PyTorch 2.0, published by Orange, AVA®

Jupyter Notebook 3 6 Updated Jan 18, 2025

ThoenigAdrian / NeuralNetworksCudaTutorial

Implement Neural Networks in Cuda from Scratch

C++ 23 3 Updated May 17, 2024

kubernetes-sigs / karpenter

Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.

Go 2,068 542 Updated Jul 31, 2026

doraa7 / neural-network-cuda

Forked from BobMcDear/neural-network-cuda

Neural network from scratch in CUDA/C++

Cuda 1 Updated Jan 17, 2025

BobMcDear / neural-network-cuda

Neural network from scratch in CUDA/C++

Cuda 95 18 Updated Sep 8, 2025

rapidsai / cuml

cuML - RAPIDS Machine Learning Library

Python 5,243 649 Updated Jul 31, 2026

CppCon / CppCon2018

Slides and other materials from CppCon 2018

C++ 1,447 177 Updated Apr 11, 2019

BlazingDB / blazingsql

BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.

C++ 2,011 182 Updated Sep 16, 2022

vdesai2014 / inference-optimization-blog-post

Jupyter Notebook 92 7 Updated Feb 29, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,850 4,922 Updated Aug 3, 2026

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 17,302 4,322 Updated Aug 3, 2026

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,447 1,119 Updated Jun 11, 2026

everpeace / kube-openmpi

Open MPI jobs on Kubernetes

Makefile 120 25 Updated Apr 17, 2018

ofiwg / libfabric

Open Fabric Interfaces

C 819 513 Updated Aug 1, 2026

aws / aws-ofi-nccl

This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.

C++ 229 101 Updated Jul 31, 2026

data-prep-kit / data-prep-kit

Open source project for data preparation for GenAI applications

HTML 952 252 Updated Jul 14, 2026

NVIDIA / cccl

CUDA Core Compute Libraries

C++ 2,447 453 Updated Aug 3, 2026

NVIDIA / cuvs

cuVS - a library for vector search and clustering on the GPU

Cuda 829 215 Updated Aug 2, 2026

NVIDIA / raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 1,034 245 Updated Jul 31, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly