Skip to content
View lhoestq's full-sized avatar
🤗
🤗

Organizations

@huggingface

Block or report lhoestq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A WebGL-powered Jupyter Widget for Niivue based on anywidget

Python 44 10 Updated Dec 19, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,432 432 Updated Oct 27, 2025

Curate High Quality Datasets, Train, Evaluate and Ship! 🚀

Python 676 45 Updated Dec 22, 2025

Apache DataFusion SQL Query Engine

Rust 8,178 1,834 Updated Dec 23, 2025

Apache DataFusion Python Bindings

Python 540 134 Updated Dec 14, 2025

Training LLMs to reason and analyze data with notebooks

Python 58 7 Updated Sep 10, 2025

Train LLM on Hugging Face infra

Python 67 9 Updated Nov 13, 2025

parquet file parser for javascript

JavaScript 726 36 Updated Dec 20, 2025

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

TypeScript 39,835 2,450 Updated Dec 23, 2025

A VSCode extension to use Hugging Face Inference Providers in Copilot Chat

TypeScript 50 41 Updated Oct 30, 2025

Synthetic Online Conversations

Python 3 Updated Aug 17, 2025

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

TypeScript 4,470 239 Updated Dec 23, 2025

PySpark custom data source for Hugging Face Datasets

Python 22 6 Updated Aug 12, 2025

Low-level communication with Reachy Mini motors

Rust 46 7 Updated Dec 16, 2025

Apache OpenDAL: One Layer, All Storage.

Rust 4,756 698 Updated Dec 22, 2025

PyTorch media decoding and encoding

Python 879 80 Updated Dec 22, 2025

Set up your GitHub Actions workflow with ffmpeg

JavaScript 135 24 Updated Mar 22, 2024

Fast parquet command line tool with many functions, nailed it!

Rust 63 4 Updated Dec 21, 2025

A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗

Python 1,160 73 Updated Dec 19, 2025

Metadata extraction and validation in scientific papers

Python 11 3 Updated Dec 22, 2025

Build, enrich, and transform datasets using AI models with no code

TypeScript 1,608 137 Updated Oct 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,347 7,797 Updated Dec 21, 2025

The official repository of Mozilla's Firefox web browser.

JavaScript 10,813 770 Updated Dec 23, 2025

Efficient BM25 with DuckDB 🦆

Python 59 2 Updated Dec 20, 2024

Craft conversational datasets (JSONL format with rich metadata) using LLMs. Specify parameters manually or use a creative brief for LLM-generated arguments with automatic topic/scenario variation. …

Python 10 1 Updated Apr 17, 2025

Plug-and-play document AI with zero-shot models.

Python 120 8 Updated Dec 22, 2025

ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scri…

Python 9 3 Updated May 16, 2025

OSX hfjobs menubar app

Swift 5 Updated Dec 18, 2025

MCP server for Hugging Face dataset viewer

Python 30 13 Updated Apr 25, 2025
Next