Skip to content
View szha's full-sized avatar

Organizations

@apache @awslabs @amzn @dmlc @data-apis

Block or report szha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 62,450 8,730 Updated Mar 26, 2026

Secure and fast microVMs for serverless computing.

Rust 33,388 2,324 Updated Mar 31, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,051 677 Updated Mar 29, 2026

The open source coding agent.

TypeScript 133,512 14,364 Updated Mar 31, 2026

GPU Cluster Monitoring (GCM): Large-Scale AI Research Cluster Monitoring

Python 219 35 Updated Mar 30, 2026

Machine Learning Engineering Open Book

Python 17,587 1,114 Updated Mar 16, 2026

MLIR-based partitioning system

MLIR 176 33 Updated Mar 31, 2026

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 99,083 10,888 Updated Mar 21, 2026
Python 83 16 Updated May 27, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,546 1,006 Updated Mar 31, 2026

A simple toolchain for moving Remarkable highlights to Readwise

Python 62 2 Updated Oct 28, 2021

A curated list of projects related to the reMarkable tablet

7,318 250 Updated Mar 4, 2026

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,107 2,911 Updated Mar 26, 2026

A prize for finding tasks that cause large language models to show inverse scaling

618 27 Updated Oct 11, 2023

A programming framework for agentic AI

Python 56,509 8,494 Updated Mar 29, 2026

The agent engineering platform

Python 131,782 21,720 Updated Mar 31, 2026

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,450 111 Updated Mar 30, 2026

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,934 371 Updated Dec 7, 2024

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,610 103 Updated Aug 30, 2023

A playbook for systematically maximizing the performance of deep learning models.

29,953 2,428 Updated Jun 18, 2024

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

HTML 155,222 20,386 Updated Mar 31, 2026

Fast and memory-efficient exact attention

Python 23,065 2,568 Updated Mar 31, 2026

Training and serving large-scale neural networks with auto parallelization.

Python 3,187 362 Updated Dec 9, 2023

A list of ICs and IPs for AI, Machine Learning and Deep Learning.

PHP 1,702 277 Updated Jun 5, 2024

Animation engine for explanatory math videos

Python 85,661 7,197 Updated Mar 26, 2026

Distributed Xarray with Apache Beam

Python 165 11 Updated Jan 14, 2026

Library for 8-bit optimizers and quantization routines.

780 48 Updated Aug 18, 2022

Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.

Python 7,191 2,819 Updated Mar 31, 2026

Making large AI models cheaper, faster and more accessible

Python 41,374 4,522 Updated Mar 30, 2026

Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57

Python 3,279 220 Updated Jan 18, 2022
Next