Skip to content
View leogao2's full-sized avatar

Block or report leogao2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
37 results for source starred repositories
Clear filter

Small language model built in Minecraft.

Python 649 37 Updated Sep 28, 2025

a small library for combinatorial iters of dicts, useful for config/hyperparameter sweep management

Python 5 2 Updated Feb 8, 2025

Evaluation of measurement tampering detection techniques on the datasets from Benchmarks for Detecting Measurement Tampering

Python 7 4 Updated Sep 7, 2023

A debugging and profiling tool that can trace and visualize python code execution

Python 7,292 464 Updated Nov 4, 2025

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Python 212 32 Updated Nov 3, 2025

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Python 3,794 213 Updated Jul 9, 2024

A dataset of alignment research and code to reproduce it

HTML 78 18 Updated Jun 22, 2023

Starlark Language

Python 2,784 172 Updated Sep 10, 2025

Dataset of Canada goose images with annotations of bounding boxes with object classes, suitable for testing object detection algorithms.

Jupyter Notebook 39 5 Updated Aug 2, 2018

Massively-Parallel Natural Extension of Reference Frame

Jupyter Notebook 32 4 Updated Jan 18, 2023

Simple, elegant, Pythonic functional programming.

Python 4,272 130 Updated Apr 21, 2025

Model parallel transformers in JAX and Haiku

Python 6,352 892 Updated Jan 21, 2023
Python 159 18 Updated Mar 5, 2021

PyTorch package for the discrete VAE used for DALL·E.

Python 10,875 1,905 Updated Jan 31, 2024

A Python library for integrating model-based and judgmental forecasting

Jupyter Notebook 108 24 Updated Dec 21, 2021

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Python 433 46 Updated Feb 12, 2022

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,426 3,819 Updated Jul 23, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,325 1,090 Updated Sep 26, 2025

URL downloader supporting checkpointing and continuous checksumming.

Python 19 7 Updated Nov 29, 2023

Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly through the main process. It offers similar functionality for pyth…

Python 43 2 Updated Jan 6, 2021

A framework for few-shot evaluation of language models.

Python 10,533 2,829 Updated Oct 29, 2025
Python 92 18 Updated Jul 16, 2022
Python 1,610 143 Updated Apr 27, 2023

Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.

Python 14 5 Updated Jun 3, 2023

Universal markup converter

Haskell 40,048 3,674 Updated Nov 5, 2025

Babysit your preemptible TPUs

Python 86 15 Updated Dec 3, 2022

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,132 31,050 Updated Nov 5, 2025

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 4,341 388 Updated Oct 22, 2025

Downloads 2020 English Wikipedia articles as plaintext

Python 24 4 Updated Mar 25, 2023

downloads and parses subtitle dataset from opensubtitles.org

Python 16 4 Updated Apr 19, 2024
Next