Skip to content
View alex0dd's full-sized avatar

Highlights

  • Pro

Block or report alex0dd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 181 72 Updated Apr 11, 2026

Fastest, smallest, and fully autonomous AI assistant infrastructure written in Zig

Zig 7,171 841 Updated Apr 10, 2026

Run OpenClaw more securely inside NVIDIA OpenShell with managed inference

TypeScript 18,977 2,322 Updated Apr 12, 2026

Hundreds of models & providers. One command to find what runs on your hardware.

Rust 22,927 1,367 Updated Apr 11, 2026

A curated list of awesome LLM and AI Agent Skills, resources and tools for customising AI Agent workflows - that works with Claude Code, Codex, Gemini CLI and your custom AI Agents

Python 1,111 133 Updated Dec 25, 2025

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…

Jupyter Notebook 622 31 Updated Sep 5, 2025

[NeurIPS 2025] 4KAgent: Agentic Any Image to 4K Super-Resolution. An intelligent computer vision agent that can magically restore any image to perfect-4K!

Python 783 44 Updated Sep 24, 2025

NVIDIA Inference Xfer Library (NIXL)

C++ 971 288 Updated Apr 11, 2026

Docker configuration for running VLLM on dual DGX Sparks

Shell 977 173 Updated Apr 12, 2026

One-command vLLM installation for NVIDIA DGX Spark with Blackwell GB10 GPUs (sm_121 architecture)

Shell 79 13 Updated Oct 28, 2025

A framework for efficient model inference with omni-modality models

Python 4,234 734 Updated Apr 11, 2026

The Ultimate Linux micro distribution written in JavaScript! A very functional minimal userspace for Linux written in... pure JavaScript! Not quite, but almost. It's good, I promise!

JavaScript 293 12 Updated Dec 24, 2025

FlashInfer: Kernel Library for LLM Serving

Python 5,371 889 Updated Apr 11, 2026

Native and Compact Structured Latents for 3D Generation

Python 5,315 620 Updated Jan 10, 2026

Helpful kernel tutorials and examples for tile-based GPU programming

Python 699 60 Updated Apr 11, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,013 130 Updated Apr 11, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,189 1,848 Updated Mar 17, 2026

Official inference repo for FLUX.1 models

Python 25,395 1,873 Updated Jul 31, 2025

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 1,300 54 Updated Jun 8, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,592 314 Updated Apr 9, 2026

Collective communications library with various primitives for multi-machine training.

C++ 1,417 353 Updated Mar 20, 2026

SAM 3D Objects

Python 6,408 761 Updated Mar 12, 2026

Machine Learning Systems

JavaScript 23,558 2,825 Updated Apr 11, 2026

A Jupyter - Three.js bridge

JavaScript 989 194 Updated Oct 10, 2024

CUDA Python: Performance meets Productivity

Cython 3,214 269 Updated Apr 12, 2026

Index your Gmail Inbox with Elasticsearch

Python 2,057 160 Updated Mar 12, 2026

Open-source library for scalable, reproducible evaluation of AI models and benchmarks.

Python 253 41 Updated Apr 11, 2026

Command line tool to create and query container image manifest list/indexes

Go 836 99 Updated Mar 30, 2026

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Python 365 11 Updated Oct 5, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,784 242 Updated Mar 7, 2026
Next