Skip to content
View szha's full-sized avatar

Organizations

@apache @awslabs @amzn @dmlc @data-apis

Block or report szha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 62,671 8,766 Updated Mar 26, 2026

Secure and fast microVMs for serverless computing.

Rust 33,394 2,323 Updated Mar 31, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,053 677 Updated Mar 29, 2026

The open source coding agent.

TypeScript 133,679 14,389 Updated Mar 31, 2026

GPU Cluster Monitoring (GCM): Large-Scale AI Research Cluster Monitoring

Python 219 35 Updated Mar 30, 2026

Machine Learning Engineering Open Book

Python 17,587 1,114 Updated Mar 16, 2026

MLIR-based partitioning system

MLIR 176 33 Updated Mar 31, 2026

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 99,085 10,887 Updated Mar 21, 2026
Python 83 16 Updated May 27, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,547 1,005 Updated Mar 31, 2026

A simple toolchain for moving Remarkable highlights to Readwise

Python 62 2 Updated Oct 28, 2021

A curated list of projects related to the reMarkable tablet

7,317 250 Updated Mar 4, 2026

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,108 2,911 Updated Mar 26, 2026

A prize for finding tasks that cause large language models to show inverse scaling

618 27 Updated Oct 11, 2023

A programming framework for agentic AI

Python 56,520 8,496 Updated Mar 29, 2026

The agent engineering platform

Python 131,806 21,730 Updated Mar 31, 2026

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,451 111 Updated Mar 30, 2026

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,934 371 Updated Dec 7, 2024

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,610 103 Updated Aug 30, 2023

A playbook for systematically maximizing the performance of deep learning models.

29,956 2,428 Updated Jun 18, 2024

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

HTML 155,288 20,396 Updated Mar 31, 2026

Fast and memory-efficient exact attention

Python 23,065 2,569 Updated Mar 31, 2026

Training and serving large-scale neural networks with auto parallelization.

Python 3,187 362 Updated Dec 9, 2023

A list of ICs and IPs for AI, Machine Learning and Deep Learning.

PHP 1,702 277 Updated Jun 5, 2024

Animation engine for explanatory math videos

Python 85,669 7,195 Updated Mar 26, 2026

Distributed Xarray with Apache Beam

Python 165 11 Updated Jan 14, 2026

Library for 8-bit optimizers and quantization routines.

780 48 Updated Aug 18, 2022

Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.

Python 7,192 2,820 Updated Mar 31, 2026

Making large AI models cheaper, faster and more accessible

Python 41,373 4,521 Updated Mar 30, 2026

Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57

Python 3,279 220 Updated Jan 18, 2022
Next