jdebache

Julien Debache jdebache

11 followers · 30 following

Zurich
in/julien-debache-354759127

Achievements

Highlights

vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated May 18, 2026
flashinfer Public
Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Python Apache License 2.0 Updated May 13, 2026
dynamo Public
Forked from ai-dynamo/dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust Other Updated Apr 22, 2026
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

Python Other Updated Feb 7, 2026
recipes Public
Forked from vllm-project/recipes

Common recipes to run vLLM

Jupyter Notebook Apache License 2.0 Updated Dec 3, 2025
transformers Public
Forked from huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python Apache License 2.0 Updated Oct 31, 2025
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python MIT License Updated Oct 29, 2025
numba Public
Forked from numba/numba

NumPy aware dynamic Python compiler using LLVM

Python BSD 2-Clause "Simplified" License Updated Oct 22, 2025
llvmlite Public
Forked from numba/llvmlite

A lightweight LLVM python binding for writing JIT compilers

Python BSD 2-Clause "Simplified" License Updated Oct 22, 2025
tokenizers Public
Forked from huggingface/tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust Apache License 2.0 Updated Oct 16, 2025
FlashMLA Public
Forked from deepseek-ai/FlashMLA

FlashMLA: Efficient MLA kernels

C++ MIT License Updated Oct 10, 2025
cutlass Public
Forked from NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

C++ Other Updated Aug 6, 2025
pytorch Public
Forked from pytorch/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python Other Updated Aug 4, 2025
buck2 Public
Forked from facebook/buck2

Build system, successor to Buck

Rust Apache License 2.0 Updated Jun 23, 2025
spdlog Public
Forked from gabime/spdlog

Fast C++ logging library.

C++ Other Updated Apr 16, 2025
mscclpp Public
Forked from microsoft/mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ MIT License Updated Apr 14, 2025
dlpack Public
Forked from dmlc/dlpack

common in-memory tensor structure

C++ Apache License 2.0 Updated Apr 14, 2025
gallery Public

Starlark Updated Apr 13, 2025
nvbench Public
Forked from NVIDIA/nvbench

CUDA Kernel Benchmarking Library

Cuda Apache License 2.0 Updated Apr 10, 2025
rules_cuda Public
Forked from bazel-contrib/rules_cuda

Starlark implementation of bazel rules for CUDA.

Starlark MIT License Updated Apr 4, 2025
TensorRT-Model-Optimizer Public
Forked from NVIDIA/Model-Optimizer

nvidia-modelopt is a unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for do…

Python Other Updated Apr 3, 2025
bazel-compile-commands-extractor Public
Forked from hedronvision/bazel-compile-commands-extractor

Goal: Enable awesome tooling for Bazel users of the C language family.

Python Other Updated Oct 8, 2024
rules_cc Public
Forked from bazelbuild/rules_cc

C++ Rules for Bazel

Starlark Apache License 2.0 Updated Aug 25, 2024
vscode-bazel Public
Forked from bazel-contrib/vscode-bazel

Bazel support for Visual Studio Code

TypeScript Apache License 2.0 Updated Jan 2, 2024
repros Public

C# 1 Updated Dec 27, 2023
missing-deps Public

Starlark Updated Dec 15, 2023
noff-privacy Public

HTML Updated Dec 15, 2023
lawrencium Public

C++ Updated Dec 14, 2023
tensorflow Public
Forked from tensorflow/tensorflow

An Open Source Machine Learning Framework for Everyone

C++ Apache License 2.0 Updated Dec 5, 2023
TensorRT Public
Forked from NVIDIA/TensorRT

NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…

C++ Apache License 2.0 Updated Dec 3, 2023

Julien Debache jdebache

Achievements

Achievements

Highlights

vllm Public

Uh oh!

flashinfer Public

Uh oh!

dynamo Public

Uh oh!

TensorRT-LLM Public

Uh oh!

recipes Public

Uh oh!

transformers Public

Uh oh!

lm-evaluation-harness Public

Uh oh!

numba Public

Uh oh!

llvmlite Public

Uh oh!

tokenizers Public

Uh oh!

FlashMLA Public

Uh oh!

cutlass Public

Uh oh!

pytorch Public

Uh oh!

buck2 Public

Uh oh!

spdlog Public

Uh oh!

mscclpp Public

Uh oh!

dlpack Public

Uh oh!

gallery Public

Uh oh!

nvbench Public

Uh oh!

rules_cuda Public

Uh oh!

TensorRT-Model-Optimizer Public

Uh oh!

bazel-compile-commands-extractor Public

Uh oh!

rules_cc Public

Uh oh!

vscode-bazel Public

Uh oh!

repros Public

Uh oh!

missing-deps Public

Uh oh!

noff-privacy Public

Uh oh!

lawrencium Public

Uh oh!

tensorflow Public

Uh oh!

TensorRT Public

Uh oh!