Skip to content
View alexqdh's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report alexqdh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

⚡ HugoBlox: Markdown sites in minutes. Academic/resume/lab/portfolio for AI researchers & startups. Premium templates. Deploy to GitHub Pages now in 1-click 👇

HTML 9,250 2,959 Updated Dec 18, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,022 586 Updated Dec 22, 2025

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 14,654 1,503 Updated Dec 23, 2025

Contexts Optical Compression

Python 21,554 1,927 Updated Oct 25, 2025

The best ChatGPT that $100 can buy.

Python 39,135 4,955 Updated Dec 23, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,293 355 Updated Dec 23, 2025

Qianfan-VL: Domain-Enhanced Universal Vision-Language Models

176 13 Updated Sep 22, 2025

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 174 40 Updated Dec 12, 2025

Nano vLLM

Python 10,041 1,256 Updated Nov 3, 2025
C++ 335 33 Updated Dec 20, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 75,517 2,377 Updated Dec 23, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,922 3,847 Updated Dec 23, 2025
C 6 Updated Jul 6, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,734 2,877 Updated Dec 23, 2025

Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.

Python 32,706 3,474 Updated Dec 18, 2025
Jupyter Notebook 110 19 Updated Sep 24, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,288 114 Updated Dec 16, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,618 228 Updated Jun 17, 2025

mimalloc is a compact general purpose allocator with excellent performance.

C 12,294 1,036 Updated Dec 22, 2025

Lightweight in-process concurrent programming

C++ 1,786 261 Updated Dec 4, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,875 431 Updated Mar 5, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,892 310 Updated Mar 10, 2025

Expert Parallelism Load Balancer

Python 1,322 195 Updated Mar 24, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,991 778 Updated Dec 23, 2025

AI chat for any model.

TypeScript 32,853 9,461 Updated Aug 3, 2024

DeepEP: an efficient expert-parallel communication library

Cuda 8,827 1,036 Updated Dec 23, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,932 922 Updated Dec 15, 2025

Fully open reproduction of DeepSeek-R1

Python 25,749 2,406 Updated Nov 24, 2025

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 13,169 431 Updated Dec 22, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,793 869 Updated Jun 10, 2024
Next