Skip to content
View michaelnny's full-sized avatar

Block or report michaelnny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
28 stars written in Python
Clear filter

Public repository for Agent Skills

Python 105,544 11,673 Updated Mar 25, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 98,622 27,330 Updated Mar 29, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,839 11,941 Updated Mar 27, 2026

Animation engine for explanatory math videos

Python 85,615 7,188 Updated Mar 26, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,937 4,768 Updated Mar 29, 2026

TensorFlow code and pre-trained models for BERT

Python 39,942 9,706 Updated Jul 23, 2024

A toolkit for developing and comparing reinforcement learning algorithms.

Python 37,115 8,706 Updated Mar 26, 2026

The official Python library for the OpenAI API

Python 30,321 4,667 Updated Mar 29, 2026

The official Meta Llama 3 GitHub site

Python 29,299 3,529 Updated Jan 26, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,161 5,045 Updated Mar 29, 2026

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,716 5,865 Updated Aug 14, 2024

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 23,821 9,812 Updated Sep 1, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,370 894 Updated Dec 17, 2024

An educational resource to help anyone learn deep reinforcement learning.

Python 11,679 2,446 Updated Aug 5, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,456 773 Updated May 31, 2024

AlphaFold 3 inference pipeline.

Python 7,777 1,162 Updated Mar 10, 2026

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 3,176 443 Updated Jan 17, 2025

Post-training with Tinker

Python 2,993 364 Updated Mar 29, 2026
Python 1,414 100 Updated Mar 27, 2026

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,382 171 Updated Jul 25, 2023

Self-contained, minimalistic implementation of diffusion models with Pytorch.

Python 1,159 147 Updated Jun 28, 2022

Code for "Learning to summarize from human feedback"

Python 1,060 153 Updated Sep 5, 2023

Code for the paper "Exploration by Random Network Distillation"

Python 933 163 Updated Oct 1, 2020

A suite of test scenarios for multi-agent reinforcement learning.

Python 808 155 Updated Mar 28, 2026

DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.

Python 496 82 Updated Apr 6, 2024

Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation

Python 371 82 Updated Sep 2, 2025