-
Princeton University
Stars
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
A guidance language for controlling large language models.
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
QLoRA: Efficient Finetuning of Quantized LLMs
Code release for NeRF (Neural Radiance Fields)
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents
Real numbers, data science and chaos: How to fit any dataset with a single parameter
Some notes on Causal Inference, with examples in python
Gumbel-Softmax Variational Autoencoder with Keras
Reliably download millions of images efficiently
[NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks
[ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks
Action Grammars for Hierarchical Reinforcement Learning