Skip to content
View younesbelkada's full-sized avatar
:octocat:
Working from home
:octocat:
Working from home

Block or report younesbelkada

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Inference repo for Falcon-Perception and Falcon-OCR model, early-fusion, natively multimodal, dense Autoregressive Transformer models.

Python 593 57 Updated Apr 27, 2026

Run BitNet b1.58 ternary LLMs with WebGPU β€” in browsers and native apps

TypeScript 16 Updated Mar 8, 2026

Inference server for MioTTS, a lightweight and fast LLM-based TTS model.

Python 133 19 Updated Feb 14, 2026
9 Updated Jan 5, 2026

A framework for few-shot evaluation of language models.

Python 12,387 3,241 Updated Apr 30, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 26,790 5,641 Updated Apr 30, 2026

Ongoing research training transformer models at scale

Python 16,203 3,893 Updated Apr 30, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,350 277 Updated Apr 8, 2026

Docker Model Runner

Go 562 120 Updated Apr 30, 2026

Pure Rust engine for BitNet LLMs β€” Conversion, Inference, Training and Research. With streaming and GPU/CPU support

Rust 12 1 Updated Jul 11, 2025

250+ Fine-tuning & RL Notebooks for text, vision, audio, embedding, TTS models.

Jupyter Notebook 5,291 866 Updated Apr 30, 2026

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

TypeScript 42,290 2,830 Updated Apr 30, 2026

πŸš€ Efficient implementations for emerging model architectures

Python 5,016 515 Updated Apr 30, 2026

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 9,217 759 Updated Apr 30, 2026

All information and news with respect to Falcon-H1 series

116 14 Updated Oct 9, 2025

Lightweight toolkit package to train and fine-tune 1.58bit Language models

Python 130 9 Updated Apr 30, 2026

Build compute kernels and load them from the Hub.

Python 619 79 Updated Apr 30, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,987 287 Updated May 15, 2025

Automatic evals for LLMs

HTML 591 81 Updated Feb 24, 2026

A version manager for neovim

Rust 2,090 61 Updated Apr 30, 2026

An extremely fast Python linter and code formatter, written in Rust.

Rust 47,314 2,034 Updated Apr 30, 2026
Python 1,135 53 Updated Jan 10, 2026

Segment Anything for Microscopy

Jupyter Notebook 680 101 Updated Apr 29, 2026

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,170 182 Updated Aug 26, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python 134,982 19,189 Updated Apr 24, 2026

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,732 195 Updated Oct 2, 2025

MLX: An array framework for Apple silicon

C++ 25,882 1,733 Updated Apr 28, 2026

Official inference framework for 1-bit LLMs

Python 38,751 3,513 Updated Mar 10, 2026

Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊

273 7 Updated Jan 27, 2025

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,100 121 Updated Dec 3, 2025
Next