Skip to content
View ethancaballero's full-sized avatar

Block or report ethancaballero

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scaling Laws for Linear Complexity Language Models

10 Updated Jun 23, 2024

A 1D analogue of the MNIST dataset for measuring spatial biases and answering Science of Deep Learning questions.

Jupyter Notebook 235 38 Updated Oct 9, 2024

A toolkit for scaling law research โš–

Python 54 4 Updated Jan 27, 2025

Functional local implementations of main model parallelism approaches

Jupyter Notebook 95 6 Updated Feb 21, 2023
Python 210 18 Updated Oct 10, 2022

A prize for finding tasks that cause large language models to show inverse scaling

619 27 Updated Oct 11, 2023

Pen and paper exercises in machine learning

TeX 2,575 216 Updated May 21, 2024

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,007 103 Updated Jul 29, 2024
Python 4,202 573 Updated Mar 19, 2024

Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J

Python 71 9 Updated Apr 16, 2024

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 2,062 350 Updated Jul 14, 2024

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Python 1,127 78 Updated Aug 2, 2025

Compute FID scores with PyTorch.

Python 3,817 523 Updated Jul 3, 2024

Release for Improved Denoising Diffusion Probabilistic Models

Python 3,759 531 Updated Jul 18, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,323 1,255 Updated Aug 4, 2025

Learned Hyperparameter Optimizers

Python 59 13 Updated Jun 1, 2021

Denoising Diffusion Probabilistic Models

Python 4,949 464 Updated Aug 29, 2023

code for Scaling Laws for Language Transfer Learning

Python 9 1 Updated Apr 18, 2021

Repository for reproducing `Model-Based Robust Deep Learning`

Python 16 5 Updated Jan 22, 2021

PyTorch Implementation of OpenAI's Image GPT

Python 260 33 Updated Oct 3, 2023

An implementation of the BADGE batch active learning algorithm.

Python 210 35 Updated Jun 5, 2024

PyTorch extensions for high performance and large scale training.

Python 3,392 294 Updated Apr 26, 2025

Code from the article: "The Role of Disentanglement in Generalisation" (ICLR, 2021).

Jupyter Notebook 21 Updated May 28, 2022
Python 4 Updated Apr 21, 2019

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Python 1,187 137 Updated Aug 22, 2023

Scaling scaling laws with board games.

Python 54 12 Updated Jul 17, 2023

Use python's argparse module in shell scripts

Shell 176 26 Updated Apr 28, 2024

Repository for theory and methods for Out-of-Distribution (OoD) generalization

Jupyter Notebook 63 16 Updated Mar 4, 2022
Next