Skip to content
View leogao2's full-sized avatar

Block or report leogao2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
28 results for source starred repositories written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,211 31,076 Updated Nov 7, 2025

PyTorch package for the discrete VAE used for DALL·E.

Python 10,876 1,905 Updated Jan 31, 2024

A framework for few-shot evaluation of language models.

Python 10,553 2,832 Updated Oct 29, 2025

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,328 1,093 Updated Sep 26, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,302 464 Updated Nov 5, 2025

Model parallel transformers in JAX and Haiku

Python 6,352 892 Updated Jan 21, 2023

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 4,341 388 Updated Oct 22, 2025

Simple, elegant, Pythonic functional programming.

Python 4,272 130 Updated Apr 21, 2025

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Python 3,794 213 Updated Jul 9, 2024

Python 3.9 to JavaScript compiler - Lean, fast, open!

Python 2,905 218 Updated Jun 16, 2025

Starlark Language

Python 2,785 172 Updated Sep 10, 2025
Python 1,611 144 Updated Apr 27, 2023

Small language model built in Minecraft.

Python 649 37 Updated Sep 28, 2025

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Python 433 46 Updated Feb 12, 2022

Simple migration engine for Peewee

Python 374 88 Updated Nov 5, 2025

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Python 212 32 Updated Nov 3, 2025
Python 159 18 Updated Mar 5, 2021
Python 92 18 Updated Jul 16, 2022

Babysit your preemptible TPUs

Python 86 15 Updated Dec 3, 2022

Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly through the main process. It offers similar functionality for pyth…

Python 43 2 Updated Jan 6, 2021

Downloads 2020 English Wikipedia articles as plaintext

Python 24 4 Updated Mar 25, 2023

URL downloader supporting checkpointing and continuous checksumming.

Python 19 7 Updated Nov 29, 2023

downloads and parses subtitle dataset from opensubtitles.org

Python 16 4 Updated Apr 19, 2024

Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.

Python 14 5 Updated Jun 3, 2023

Script for downloading GitHub.

Python 13 46 Updated Sep 24, 2020

Evaluation of measurement tampering detection techniques on the datasets from Benchmarks for Detecting Measurement Tampering

Python 7 4 Updated Sep 7, 2023

a small library for combinatorial iters of dicts, useful for config/hyperparameter sweep management

Python 5 2 Updated Feb 8, 2025

Script/utility that tests sample cases from Kattis locally, built for UAPSC.

Python 3 Updated Mar 10, 2019