- Cambridge
- alexjchan.com
- @AlexJChan
Highlights
- Pro
-
-
-
attention-based-credit Public
Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar
-
transductive-dropout Public
Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift (ICML 2020) by Alex J. Chan, Ahmed M. Alaa, Zhaozhi Qian, and Mihaela van der Schaar.
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedApr 2, 2024 -
deepspeed_llama Public
Forked from mbalesni/deepspeed_llamaFinetuning LLaMA with DeepSpeed
-
-
-
TruthfulQA Public
Forked from sylinrl/TruthfulQATruthfulQA: Measuring How Models Imitate Human Falsehoods
Jupyter Notebook Apache License 2.0 UpdatedNov 7, 2022 -
synthetic-model-combination Public
Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning (NeurIPS 2022) by Alex J. Chan and Mihaela van der Schaar.
-
-
-
inverse-online Public
Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies (ICLR 2022) by Alex J. Chan, Alicia Curth, and Mihaela van der Schaar.
-
medkit-learn Public
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, and Mihaela van der Schaar.
-
scalable-birl Public
Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.
-
-
mphil-thesis Public
Supplementary code for my MPhil thesis.
Jupyter Notebook MIT License UpdatedAug 20, 2020 -
AML_bayes_opt Public
Supporting code for the Advanced Machine Learning module, MPhil Machine Learning and Machine Intelligence
Jupyter Notebook UpdatedApr 7, 2020 -
MCMC-Project Public
Code for my project comparing theoretical bounds with practical convergence diagnostics in MCMC.
Jupyter Notebook MIT License UpdatedAug 27, 2018 -
rnn-handwriting-generation Public
Forked from snowkylin/rnn-handwriting-generationHandwriting generation by RNN with TensorFlow, based on "Generating Sequences With Recurrent Neural Networks" by Alex Graves
Python UpdatedJan 17, 2017