Skip to content
View alex0dd's full-sized avatar

Highlights

  • Pro

Block or report alex0dd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 182 74 Updated Apr 11, 2026

Fastest, smallest, and fully autonomous AI assistant infrastructure written in Zig

Zig 7,176 840 Updated Apr 10, 2026

Run OpenClaw more securely inside NVIDIA OpenShell with managed inference

TypeScript 19,041 2,329 Updated Apr 12, 2026

Hundreds of models & providers. One command to find what runs on your hardware.

Rust 23,025 1,373 Updated Apr 12, 2026

A curated list of awesome LLM and AI Agent Skills, resources and tools for customising AI Agent workflows - that works with Claude Code, Codex, Gemini CLI and your custom AI Agents

Python 1,113 134 Updated Dec 25, 2025

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…

Jupyter Notebook 622 31 Updated Sep 5, 2025

[NeurIPS 2025] 4KAgent: Agentic Any Image to 4K Super-Resolution. An intelligent computer vision agent that can magically restore any image to perfect-4K!

Python 784 44 Updated Sep 24, 2025

NVIDIA Inference Xfer Library (NIXL)

C++ 972 288 Updated Apr 12, 2026

Docker configuration for running VLLM on dual DGX Sparks

Shell 985 176 Updated Apr 12, 2026

One-command vLLM installation for NVIDIA DGX Spark with Blackwell GB10 GPUs (sm_121 architecture)

Shell 79 13 Updated Oct 28, 2025

A framework for efficient model inference with omni-modality models

Python 4,245 735 Updated Apr 12, 2026

The Ultimate Linux micro distribution written in JavaScript! A very functional minimal userspace for Linux written in... pure JavaScript! Not quite, but almost. It's good, I promise!

JavaScript 293 12 Updated Dec 24, 2025

FlashInfer: Kernel Library for LLM Serving

Python 5,375 890 Updated Apr 11, 2026

Native and Compact Structured Latents for 3D Generation

Python 5,319 622 Updated Jan 10, 2026

Helpful kernel tutorials and examples for tile-based GPU programming

Python 699 60 Updated Apr 12, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,014 130 Updated Apr 11, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,202 1,850 Updated Mar 17, 2026

Official inference repo for FLUX.1 models

Python 25,398 1,873 Updated Jul 31, 2025

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 1,300 54 Updated Jun 8, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,592 314 Updated Apr 9, 2026

Collective communications library with various primitives for multi-machine training.

C++ 1,418 353 Updated Mar 20, 2026

SAM 3D Objects

Python 6,414 761 Updated Mar 12, 2026

Machine Learning Systems

JavaScript 23,564 2,828 Updated Apr 12, 2026

A Jupyter - Three.js bridge

JavaScript 989 194 Updated Oct 10, 2024

CUDA Python: Performance meets Productivity

Cython 3,215 269 Updated Apr 12, 2026

Index your Gmail Inbox with Elasticsearch

Python 2,057 160 Updated Mar 12, 2026

Open-source library for scalable, reproducible evaluation of AI models and benchmarks.

Python 253 41 Updated Apr 11, 2026

Command line tool to create and query container image manifest list/indexes

Go 836 99 Updated Mar 30, 2026

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Python 365 11 Updated Oct 5, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,784 242 Updated Mar 7, 2026
Next