Skip to content
View dzhulgakov's full-sized avatar

Block or report dzhulgakov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Optimizing inference proxy for LLMs

Python 3,395 267 Updated Mar 19, 2026

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 3,740 312 Updated May 21, 2025

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 557 34 Updated Oct 28, 2023

A list of startups that have employee-friendly terms for exercising your options past 90 days.

1,194 140 Updated Feb 11, 2026

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,713 382 Updated Mar 16, 2026

Development repository for the Triton language and compiler

MLIR 18,787 2,711 Updated Mar 28, 2026

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 938 240 Updated Mar 27, 2026

miniz: Single C source file zlib-replacement library, originally from code.google.com/p/miniz

C++ 2,700 399 Updated Feb 13, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 98,609 27,332 Updated Mar 28, 2026

Write PyTorch code at the level of individual examples, then run it efficiently on minibatches.

Python 485 23 Updated Feb 12, 2022

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Jupyter Notebook 2,521 461 Updated Mar 4, 2026

Demo of running NNs across different frameworks

Jupyter Notebook 1,656 355 Updated Oct 8, 2022

The convertor/conversion of deep learning models for different deep learning frameworks/softwares.

3,245 482 Updated Jun 26, 2023

Original Python version of Intel® Nervana™ Graph

Python 214 38 Updated Oct 5, 2022