Skip to content
View mfuntowicz's full-sized avatar

Organizations

@huggingface

Block or report mfuntowicz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

HMLL - High-Performance Model Loading Library for Efficient AI Model I/O

C++ 11 Updated Mar 20, 2026

Frame profiler

C++ 15,641 1,046 Updated Apr 11, 2026

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,375 450 Updated Apr 12, 2026

Hugging Face Jobs

Python 19 6 Updated Jul 11, 2025

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,453 350 Updated Apr 13, 2026

A simple, performant and scalable Jax LLM!

Python 2,230 505 Updated Apr 13, 2026

A pytorch quantization backend for optimum

Python 1,036 85 Updated Apr 2, 2026

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,709 883 Updated Apr 13, 2026

Transformer related optimization, including BERT, GPT

C++ 6,412 935 Updated Mar 27, 2024

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,788 673 Updated Apr 9, 2026
MLIR 423 74 Updated Feb 24, 2026

Backward compatible ML compute opset inspired by HLO/MHLO

MLIR 640 190 Updated Apr 10, 2026

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Python 209 271 Updated Apr 3, 2026

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 43,763 3,959 Updated Apr 13, 2026

State-of-the-Art Text Embeddings

Python 18,544 2,770 Updated Apr 10, 2026

Blazing fast training of 🤗 Transformers on Graphcore IPUs

Python 87 33 Updated Apr 3, 2026

a debugger for async rust!

Rust 4,489 164 Updated Apr 9, 2026

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,592 451 Updated Apr 11, 2026

Hydra is a framework for elegantly configuring complex applications

Python 10,313 828 Updated Feb 7, 2026

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,612 301 Updated Apr 9, 2026

Dapr user documentation, used to build docs.dapr.io

Shell 1,014 777 Updated Apr 10, 2026

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

C++ 2,308 191 Updated Feb 7, 2024

Simple Python client for the Hugging Face Inference API

Python 75 9 Updated Aug 18, 2020

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

TypeScript 3,650 372 Updated Mar 26, 2026

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 10,071 3,175 Updated Apr 13, 2026

Cross-platform CLI and Python drivers for AIO liquid coolers and other devices

Python 2,580 263 Updated Mar 24, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,067 4,792 Updated Apr 12, 2026
Next