Skip to content
View AbelHutten's full-sized avatar
  • Zurich

Block or report AbelHutten

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"

Python 581 29 Updated Jul 1, 2025
Python 707 31 Updated Dec 5, 2024

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,982 219 Updated Sep 12, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,114 60 Updated Nov 9, 2025

optimizer & lr scheduler & loss function collections in PyTorch

Python 377 34 Updated Dec 21, 2025
Python 122 5 Updated Jun 11, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,872 654 Updated Nov 20, 2025

The GrandTour Dataset: A Legged Robotics Dataset in the Wild

Jupyter Notebook 91 4 Updated Sep 2, 2025
Python 327 41 Updated Nov 26, 2025

Official implementation of the paper "Zeroth-Order Adaptive Neuron Alignment Based Pruning without Re-Training", [TMLR & Workshop (SLLM) @ ICLR 2025]

Python 7 Updated Oct 27, 2025

Combining Grouped-Query Attention (https://arxiv.org/abs/2305.13245) with Deformable Attention (https://arxiv.org/abs/2201.00520) in PyTorch.

Python 1 Updated Mar 15, 2025

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 435 36 Updated Dec 16, 2025

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"

Python 370 33 Updated Feb 3, 2025

TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)

Python 418 25 Updated Sep 23, 2025

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,328 390 Updated Nov 24, 2025

Helpful tools and examples for working with flex-attention

Python 1,092 67 Updated Dec 18, 2025

The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints"

Python 15 1 Updated Dec 11, 2023

(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)

Python 186 11 Updated May 9, 2024

several types of attention modules written in PyTorch for learning purposes

Python 52 11 Updated Oct 1, 2024

Open Machine Learning Compiler Framework

Python 12,945 3,740 Updated Dec 21, 2025

Tutorials on tinygrad

Python 444 31 Updated Oct 10, 2025

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 34,945 2,357 Updated Dec 20, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,906 3,785 Updated Dec 21, 2025

Development repository for the Triton language and compiler

MLIR 17,891 2,462 Updated Dec 21, 2025

The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".

Python 186 22 Updated Nov 14, 2025

MineStudio: A Streamlined Package for Minecraft AI Agent Development

Python 312 25 Updated Oct 12, 2025

The Modular Platform (includes MAX & Mojo)

Mojo 25,369 2,742 Updated Dec 21, 2025

This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.

776 75 Updated Dec 5, 2025

[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence

Python 1,360 176 Updated Sep 26, 2025
Next