Skip to content
View henrique's full-sized avatar
:shipit:
:shipit:

Highlights

  • Pro

Block or report henrique

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Box64 - Linux Userspace x86_64 Emulator with a twist, targeted at ARM64, RV64 and LoongArch Linux devices

C 5,464 430 Updated Jun 12, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,370 1,043 Updated Jun 4, 2026
Python 607 59 Updated May 21, 2026

GitHub Copilot LLM Gateway is a companion extension for GitHub Copilot that adds support for self-hosted open-source models. It seamlessly integrates with the Copilot chat experience, allowing you …

TypeScript 36 18 Updated Jun 10, 2026
Rust 8 10 Updated Jun 9, 2026

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,355 5,940 Updated Jun 12, 2026
Shell 1 1 Updated Mar 10, 2026

A CSCS theme for slidev presentations

Vue 2 1 Updated Mar 25, 2025

Markdown template for making a slide deck

CSS 4 2 Updated May 26, 2025

Real-time global intelligence dashboard. AI-powered news aggregation, geopolitical monitoring, and infrastructure tracking in a unified situational awareness interface

TypeScript 56,356 9,008 Updated Jun 12, 2026

A new simplified version of the GPU saturation scorer for CSCS workloads

C 4 1 Updated Jun 8, 2026

Inference of Machine Learning weather forecasting models

Python 41 30 Updated Jun 12, 2026

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 720 361 Updated Jun 12, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,068 140 Updated Jun 12, 2026

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,413 157 Updated Jun 12, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,724 1,283 Updated Jun 11, 2026
Rust 23 7 Updated Mar 2, 2026
Python 38 3 Updated Sep 29, 2025

"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files

Go 57,825 5,148 Updated Jun 11, 2026
Python 1 2 Updated Mar 4, 2026

A concise, beginner-friendly introduction to the core ideas of linear algebra.

Jupyter Notebook 1,977 64 Updated Mar 16, 2026

Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

Rust 15,408 939 Updated Jun 12, 2026

Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)

Python 511 68 Updated Jun 9, 2026

Build compute kernels and load them from the Hub.

Python 690 105 Updated Jun 12, 2026

Verification and viszualization suite for the SwissAI + Climate models

Python 6 Updated Jun 12, 2026

Tool for generating high quality Synthetic datasets

Python 1,598 219 Updated Oct 28, 2025

Singularity recipes for containers directly provided to LUMI user or used as part of the environment

Shell 1 2 Updated Sep 16, 2025

CUDA Python: Performance meets Productivity

Cython 3,288 297 Updated Jun 12, 2026

CSCS public documentation

Shell 34 48 Updated Jun 11, 2026

Build, test and deploy containers at CSCS

Dockerfile 2 4 Updated Jan 5, 2024
Next