Skip to content
View NouamaneTazi's full-sized avatar

Organizations

@huggingface @Hugging-Face-Supporter @embeddings-benchmark

Block or report NouamaneTazi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 94,983 14,552 Updated May 17, 2026

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接

Jupyter Notebook 9,653 1,770 Updated Mar 4, 2026

A kernel library written in tilelang

Python 1,524 126 Updated Apr 23, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,229 570 Updated May 17, 2026

Open source repository of plugins primarily intended for knowledge workers to use in Claude Cowork

Python 12,260 1,491 Updated May 16, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 372,614 77,237 Updated May 17, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 124,343 20,472 Updated May 15, 2026

Mawaqit integration - salat time and nearest mosque - in Home Assistant

Python 101 25 Updated Apr 10, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 684 86 Updated May 14, 2026

LM engine is a library for pretraining/finetuning LLMs

Python 171 29 Updated May 17, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,631 1,244 Updated May 13, 2026

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 1,076 88 Updated Sep 4, 2024

frozen-in-time version of our Paper Finder agent for reproducing evaluation results

Python 241 29 Updated Mar 17, 2026

Easily embed, cluster and semantically label text datasets

Python 605 48 Updated Mar 28, 2024

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,111 122 Updated Dec 3, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,426 142 Updated Apr 22, 2026

A debugging and profiling tool that can trace and visualize python code execution

Python 7,640 467 Updated Feb 16, 2026

A flexible and efficient training framework for large-scale alignment tasks

Python 453 39 Updated Oct 23, 2025

torchcomms: a modern PyTorch communications API

C++ 363 147 Updated May 17, 2026

CSCS User Lab Day – Meet the Swiss National Supercomputing Centre

Jupyter Notebook 13 10 Updated Sep 12, 2025

The best ChatGPT that $100 can buy.

Python 53,569 7,203 Updated May 5, 2026

NCCL Tests

Cuda 1,523 370 Updated Apr 13, 2026

Post-training with Tinker

Python 3,294 419 Updated May 17, 2026

iperf3: A TCP, UDP, and SCTP network bandwidth measurement tool

C 8,485 1,416 Updated May 14, 2026

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 699 81 Updated Apr 8, 2026

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Shell 423 191 Updated May 17, 2026

Analyze computation-communication overlap in V3/R1.

1,156 148 Updated Mar 21, 2025

Pipeline Parallelism Emulation and Visualization

Python 82 9 Updated Jan 8, 2026

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 211 72 Updated May 17, 2026
Next