Skip to content
View szha's full-sized avatar

Organizations

@apache @awslabs @amzn @dmlc @data-apis

Block or report szha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 65,831 9,421 Updated Mar 26, 2026

Secure and fast microVMs for serverless computing.

Rust 33,455 2,331 Updated Apr 2, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,121 691 Updated Apr 5, 2026

The open source coding agent.

TypeScript 137,256 15,046 Updated Apr 5, 2026

GPU Cluster Monitoring (GCM): Large-Scale AI Research Cluster Monitoring

Python 221 34 Updated Apr 2, 2026

Machine Learning Engineering Open Book

Python 17,610 1,116 Updated Mar 16, 2026

MLIR-based partitioning system

MLIR 177 33 Updated Apr 3, 2026

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 99,176 10,891 Updated Mar 21, 2026
Python 83 16 Updated May 27, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,551 1,003 Updated Mar 31, 2026

A simple toolchain for moving Remarkable highlights to Readwise

Python 62 2 Updated Oct 28, 2021

A curated list of projects related to the reMarkable tablet

7,326 250 Updated Mar 4, 2026

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,129 2,912 Updated Mar 26, 2026

A prize for finding tasks that cause large language models to show inverse scaling

618 27 Updated Oct 11, 2023

A programming framework for agentic AI

Python 56,700 8,525 Updated Apr 2, 2026

The agent engineering platform

Python 132,390 21,839 Updated Apr 4, 2026

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,451 112 Updated Mar 30, 2026

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,933 372 Updated Dec 7, 2024

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,610 103 Updated Aug 30, 2023

A playbook for systematically maximizing the performance of deep learning models.

29,981 2,422 Updated Jun 18, 2024

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

HTML 157,462 20,621 Updated Apr 5, 2026

Fast and memory-efficient exact attention

Python 23,136 2,583 Updated Apr 4, 2026

Training and serving large-scale neural networks with auto parallelization.

Python 3,187 361 Updated Dec 9, 2023

A list of ICs and IPs for AI, Machine Learning and Deep Learning.

PHP 1,702 277 Updated Jun 5, 2024

Animation engine for explanatory math videos

Python 85,768 7,199 Updated Mar 26, 2026

Distributed Xarray with Apache Beam

Python 165 11 Updated Jan 14, 2026

Library for 8-bit optimizers and quantization routines.

780 47 Updated Aug 18, 2022

Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.

Python 7,208 2,829 Updated Apr 5, 2026

Making large AI models cheaper, faster and more accessible

Python 41,370 4,520 Updated Mar 30, 2026

Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57

Python 3,275 220 Updated Jan 18, 2022
Next