Skip to content
View leogao2's full-sized avatar

Block or report leogao2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source release accompanying Gao et al. 2025

Python 515 54 Updated Dec 11, 2025

Small language model built in Minecraft.

Python 703 40 Updated Sep 28, 2025

a small library for combinatorial iters of dicts, useful for config/hyperparameter sweep management

Python 5 2 Updated Feb 8, 2025

Evaluation of measurement tampering detection techniques on the datasets from Benchmarks for Detecting Measurement Tampering

Python 8 4 Updated Sep 7, 2023

A debugging and profiling tool that can trace and visualize python code execution

Python 7,621 468 Updated Feb 16, 2026

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Python 218 30 Updated Apr 27, 2026

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Python 3,807 213 Updated Jul 9, 2024

A dataset of alignment research and code to reproduce it

HTML 78 17 Updated Jun 22, 2023

Starlark Language

Python 2,961 176 Updated Feb 6, 2026

Dataset of Canada goose images with annotations of bounding boxes with object classes, suitable for testing object detection algorithms.

Jupyter Notebook 40 5 Updated Aug 2, 2018

Massively-Parallel Natural Extension of Reference Frame

Jupyter Notebook 34 4 Updated Jan 18, 2023

Simple, elegant, Pythonic functional programming.

Python 4,325 141 Updated Feb 16, 2026

Model parallel transformers in JAX and Haiku

Python 6,366 884 Updated Jan 21, 2023
Python 164 19 Updated Mar 5, 2021

PyTorch package for the discrete VAE used for DALL·E.

Python 10,867 1,885 Updated Jan 31, 2024

A Python library for integrating model-based and judgmental forecasting

Jupyter Notebook 110 24 Updated Dec 21, 2021

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Python 172 48 Updated Sep 26, 2025

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Python 431 46 Updated Feb 12, 2022

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 33,350 3,992 Updated Mar 25, 2026

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,424 1,110 Updated Apr 13, 2026

URL downloader supporting checkpointing and continuous checksumming.

Python 19 6 Updated Nov 29, 2023

Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly through the main process. It offers similar functionality for pyth…

Python 43 2 Updated Jan 6, 2021

A framework for few-shot evaluation of language models.

Python 12,353 3,238 Updated Apr 27, 2026
Python 95 19 Updated Jul 16, 2022

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python 8,281 960 Updated Feb 25, 2022
Python 1,646 149 Updated Apr 27, 2023

Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.

Python 15 4 Updated Jun 3, 2023

Universal markup converter

Haskell 43,755 3,834 Updated Apr 24, 2026

Babysit your preemptible TPUs

Python 86 15 Updated Dec 3, 2022

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 160,046 33,048 Updated Apr 28, 2026
Next