Skip to content
View szha's full-sized avatar

Organizations

@apache @awslabs @amzn @dmlc @data-apis

Block or report szha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 49,811 6,949 Updated Mar 21, 2026

Secure and fast microVMs for serverless computing.

Rust 33,211 2,309 Updated Mar 20, 2026

slime is an LLM post-training framework for RL Scaling.

Python 4,895 657 Updated Mar 22, 2026

The open source coding agent.

TypeScript 128,003 13,533 Updated Mar 22, 2026

GPU Cluster Monitoring (GCM): Large-Scale AI Research Cluster Monitoring

Python 214 33 Updated Mar 20, 2026

Machine Learning Engineering Open Book

Python 17,482 1,108 Updated Mar 16, 2026

MLIR-based partitioning system

MLIR 174 32 Updated Mar 20, 2026

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 98,983 10,879 Updated Mar 21, 2026
Python 83 15 Updated May 27, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,526 1,005 Updated Feb 6, 2026

A simple toolchain for moving Remarkable highlights to Readwise

Python 62 2 Updated Oct 28, 2021

A curated list of projects related to the reMarkable tablet

7,305 250 Updated Mar 4, 2026

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,056 2,910 Updated Nov 3, 2025

A prize for finding tasks that cause large language models to show inverse scaling

619 27 Updated Oct 11, 2023

A programming framework for agentic AI

Python 56,025 8,432 Updated Mar 21, 2026

The agent engineering platform

Python 130,601 21,516 Updated Mar 22, 2026

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,449 109 Updated Mar 17, 2026

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,929 372 Updated Dec 7, 2024

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,610 103 Updated Aug 30, 2023

A playbook for systematically maximizing the performance of deep learning models.

29,930 2,425 Updated Jun 18, 2024

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

HTML 153,830 20,233 Updated Mar 22, 2026

Fast and memory-efficient exact attention

Python 22,895 2,541 Updated Mar 22, 2026

Training and serving large-scale neural networks with auto parallelization.

Python 3,188 362 Updated Dec 9, 2023

A list of ICs and IPs for AI, Machine Learning and Deep Learning.

PHP 1,702 279 Updated Jun 5, 2024

Animation engine for explanatory math videos

Python 85,460 7,177 Updated Mar 14, 2026

Distributed Xarray with Apache Beam

Python 166 11 Updated Jan 14, 2026

Library for 8-bit optimizers and quantization routines.

780 48 Updated Aug 18, 2022

Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.

Python 7,162 2,805 Updated Mar 22, 2026

Making large AI models cheaper, faster and more accessible

Python 41,370 4,522 Updated Mar 16, 2026

Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57

Python 3,281 220 Updated Jan 18, 2022
Next