Skip to content
View philass's full-sized avatar

Organizations

@groq @onnx

Block or report philass

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 900 129 Updated Mar 15, 2026

Exports of popular models in StableHLO

MLIR 3 Updated Jan 16, 2025

Want a faster ML processor? Do it yourself! -- A framework for playing with custom opcodes to accelerate TensorFlow Lite for Microcontrollers (TFLM). . . . . . Online tutorial: https://google.githu…

Verilog 549 157 Updated Feb 26, 2026

Unified compiler/runtime for interfacing with PyTorch Dynamo.

Python 100 45 Updated Apr 8, 2026
Rust 8 Updated Jan 21, 2024

A mini x86 linux debugger for teaching purposes

C++ 649 107 Updated Aug 2, 2024

Playground with Neural Networks from scratch, using CPP

C++ 2 Updated Jun 20, 2021

GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.

Python 119 21 Updated Jul 31, 2025

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1,662 177 Updated Jan 21, 2026

Enabling Flexible FPGA High-Level Synthesis of Tensorflow Deep Neural Networks

Verilog 623 102 Updated Jan 3, 2020

💥💻💥 A data-parallel functional programming language

Haskell 2,693 198 Updated Apr 10, 2026

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 998 414 Updated Apr 9, 2026

A list of ICs and IPs for AI, Machine Learning and Deep Learning.

PHP 1,704 278 Updated Jun 5, 2024

Porting Postgres Server to WASM [WIP]

C 16 1 Updated Mar 6, 2021

Automatically install github workflows for different types of projects (auto publish npm, pip, etc.)

JavaScript 2 Updated Nov 19, 2024

Emscripten: An LLVM-to-WebAssembly Compiler

C++ 27,307 3,513 Updated Apr 11, 2026

Threads and Atomics in WebAssembly

WebAssembly 749 54 Updated Feb 10, 2026
C 2 Updated Aug 19, 2020

A pure, low-level tensor program representation enabling tensor program optimization via program rewriting. See the web demo at https://gussmith23.github.io/glenside-web-demo/

Rust 74 10 Updated May 30, 2025

Open Machine Learning Compiler Framework

Python 13,265 3,848 Updated Apr 10, 2026

Python library using the Futhark C backend via CFFI

Python 26 9 Updated Jul 4, 2025

A microbenchmark support library

C++ 10,125 1,760 Updated Apr 10, 2026

Awk on the GPU

C++ 28 Updated Apr 2, 2024

Multi-backend GPU query engine written with Futhark

Futhark 18 Updated Jun 22, 2022

8-ary heap

C++ 2 2 Updated May 30, 2020