AbelHutten

Follow

Abel Hutten AbelHutten

Follow

"Other learning paradigms are about minimization; reinforcement learning is about maximization."

0 followers · 8 following

Zurich

Achievements

Achievements

Stars

kwsong0113 / diffusion-forcing-transformer

[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"

Python 581 29 Updated Jul 1, 2025

kvfrans / shortcut-models

Python 707 31 Updated Dec 5, 2024

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,982 219 Updated Sep 12, 2025

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,114 60 Updated Nov 9, 2025

kozistr / pytorch_optimizer

optimizer & lr scheduler & loss function collections in PyTorch

Python 377 34 Updated Dec 21, 2025

kvfrans / splus

Python 122 5 Updated Jun 11, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,872 654 Updated Nov 20, 2025

leggedrobotics / grand_tour_dataset

The GrandTour Dataset: A Legged Robotics Dataset in the Wild

Jupyter Notebook 91 4 Updated Sep 2, 2025

ColinQiyangLi / qc

Python 327 41 Updated Nov 26, 2025

eliacunegatti / NeuroAL

Official implementation of the paper "Zeroth-Order Adaptive Neuron Alignment Based Pruning without Re-Training", [TMLR & Workshop (SLLM) @ ICLR 2025]

Python 7 Updated Oct 27, 2025

AbelHutten / deformable-grouped-query-attention

Combining Grouped-Query Attention (https://arxiv.org/abs/2305.13245) with Deformable Attention (https://arxiv.org/abs/2201.00520) in PyTorch.

Python 1 Updated Mar 15, 2025

tensorgi / TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 435 36 Updated Dec 16, 2025

lucidrains / deformable-attention

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"

Python 370 33 Updated Feb 3, 2025

MuLabPKU / TransMLA

TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)

Python 418 25 Updated Sep 23, 2025

arogozhnikov / einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,328 390 Updated Nov 24, 2025

meta-pytorch / attention-gym

Helpful tools and examples for working with flex-attention

Python 1,092 67 Updated Dec 18, 2025

mikaylagawarecki / transformer_tutorial_accompaniment

Python 17 2 Updated Oct 4, 2024

kyegomez / MGQA

The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints"

Python 15 1 Updated Dec 11, 2023

fkodom / grouped-query-attention-pytorch

(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)

Python 186 11 Updated May 9, 2024

knotgrass / attention

several types of attention modules written in PyTorch for learning purposes

Python 52 11 Updated Oct 1, 2024

apache / tvm

Open Machine Learning Compiler Framework

Python 12,945 3,740 Updated Dec 21, 2025

mesozoic-egg / tinygrad-notes

Tutorials on tinygrad

Python 444 31 Updated Oct 10, 2025

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 34,945 2,357 Updated Dec 20, 2025

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,906 3,785 Updated Dec 21, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,891 2,462 Updated Dec 21, 2025

CASE-Lab-UMD / LLM-Drop

The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".

Python 186 22 Updated Nov 14, 2025

CraftJarvis / MineStudio

MineStudio: A Streamlined Package for Minecraft AI Agent Development

Python 312 25 Updated Oct 12, 2025

modular / modular

The Modular Platform (includes MAX & Mojo)

Mojo 25,369 2,742 Updated Dec 21, 2025

ENSTA-U2IS-AI / awesome-uncertainty-deeplearning

This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.

776 75 Updated Dec 5, 2025

Intellindust-AI-Lab / DEIM

[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence

Python 1,360 176 Updated Sep 26, 2025