Skip to content
View link-er's full-sized avatar

Block or report link-er

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 58,570 8,114 Updated Mar 26, 2026

Normalizing flows for neuro-symbolic AI

Python 21 2 Updated Oct 9, 2025

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 44,216 5,307 Updated Mar 27, 2026

Examples and tutorials to help developers build AI systems

Python 3,888 1,389 Updated Mar 5, 2026

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 7,960 1,426 Updated Nov 28, 2025

About A collection of AWESOME things about information geometry Topics

184 16 Updated Jul 4, 2024

Machine Learning Engineering Open Book

Python 17,560 1,116 Updated Mar 16, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,361 13,641 Updated Mar 26, 2026

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Python 7,623 841 Updated Jul 14, 2025

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 9,263 2,179 Updated Feb 24, 2026

General Purpose Risk Modeling and Prediction Toolkit for Policy and Social Good Problems

Jupyter Notebook 199 61 Updated Mar 18, 2026

Master Federated Learning in 2 Hours—Run It on Your PC!

Python 2,096 407 Updated Jan 25, 2026

Information plane analysis of dropout networks

Python 1 Updated Feb 23, 2023

ICLR 2022, "FedBABU: Toward Enhanced Representation for Federated Image Classification"

Python 54 12 Updated Mar 21, 2022

A modern look at the relationship between sharpness and generalization [ICML 2023]

Jupyter Notebook 44 4 Updated Sep 11, 2023

Experiments and implementations of the methods described in the paper "Relative Flatness and Generalization".

Python 8 Updated May 14, 2024

PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"

Jupyter Notebook 77 10 Updated Feb 12, 2020

nanoGPT-like codebase for LLM training

Python 116 38 Updated Nov 7, 2025

Google's Operations Research tools:

C++ 13,280 2,373 Updated Mar 27, 2026

Code for visualizing the loss landscape of neural nets

Python 3,161 436 Updated Apr 5, 2022

explore DNNs via Infomration

Python 266 91 Updated Mar 10, 2020

Implementation of Information Dropout

Python 39 9 Updated Jul 12, 2017

PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks

Jupyter Notebook 777 127 Updated Jul 10, 2025

ASDL: Automatic Second-order Differentiation Library for PyTorch

Python 192 18 Updated Dec 5, 2024

BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.

Python 610 57 Updated Nov 28, 2025

A tool to quantify and report the carbon footprint of machine learning computations and communication

Jupyter Notebook 22 6 Updated Sep 5, 2023

Computing various measures and generalization bounds on convolutional and fully connected networks

Python 35 10 Updated Dec 13, 2018

Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)

Python 28 4 Updated Dec 22, 2020

Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"

Python 31 2 Updated Nov 24, 2020

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

20,298 2,533 Updated Mar 26, 2026
Next