Skip to content
View leogao2's full-sized avatar

Block or report leogao2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Simple migration engine for Peewee

Python 18 6 Updated Nov 7, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,192 31,066 Updated Nov 6, 2025

Universal markup converter

Haskell 40,070 3,676 Updated Nov 5, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,299 464 Updated Nov 5, 2025

Simple migration engine for Peewee

Python 374 88 Updated Nov 5, 2025

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Python 212 32 Updated Nov 3, 2025

A framework for few-shot evaluation of language models.

Python 10,550 2,832 Updated Oct 29, 2025

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 4,341 388 Updated Oct 22, 2025

Small language model built in Minecraft.

Python 649 37 Updated Sep 28, 2025

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,326 1,093 Updated Sep 26, 2025

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Python 170 48 Updated Sep 26, 2025

Starlark Language

Python 2,785 172 Updated Sep 10, 2025

Python 3.9 to JavaScript compiler - Lean, fast, open!

Python 2,905 218 Updated Jun 16, 2025

Simple, elegant, Pythonic functional programming.

Python 4,272 130 Updated Apr 21, 2025

a small library for combinatorial iters of dicts, useful for config/hyperparameter sweep management

Python 5 2 Updated Feb 8, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,445 3,821 Updated Jul 23, 2024

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Python 3,794 213 Updated Jul 9, 2024

downloads and parses subtitle dataset from opensubtitles.org

Python 16 4 Updated Apr 19, 2024

PyTorch package for the discrete VAE used for DALL·E.

Python 10,875 1,905 Updated Jan 31, 2024

Transforms PDF, Documents and Images into Enriched Structured Data

JavaScript 6,019 318 Updated Dec 3, 2023

URL downloader supporting checkpointing and continuous checksumming.

Python 19 7 Updated Nov 29, 2023

Evaluation of measurement tampering detection techniques on the datasets from Benchmarks for Detecting Measurement Tampering

Python 7 4 Updated Sep 7, 2023

German part-of-speech dictionary

Shell 45 7 Updated Sep 6, 2023

A dataset of alignment research and code to reproduce it

HTML 78 18 Updated Jun 22, 2023

Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.

Python 14 5 Updated Jun 3, 2023
Python 1,611 144 Updated Apr 27, 2023

Downloads 2020 English Wikipedia articles as plaintext

Python 24 4 Updated Mar 25, 2023

Athens is no longer maintainted. Athens was an open-source, collaborative knowledge graph, backed by YC W21

Clojure 6,306 403 Updated Feb 3, 2023

Model parallel transformers in JAX and Haiku

Python 6,352 892 Updated Jan 21, 2023

Massively-Parallel Natural Extension of Reference Frame

Jupyter Notebook 32 4 Updated Jan 18, 2023
Next