🎯
Focusing
Hi, I am Sihao, a CS PhD student at Georgia Tech.
-
Georgia Institute of Technology
- Atlanta, GA, USA
- bayi-hu.github.io
Stars
7
stars
written in Jupyter Notebook
Clear filter
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Playing Pokemon Red with Reinforcement Learning
Chess reinforcement learning by AlphaGo Zero methods.
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
Teaming the most diverse LLMs for ensemble learning with genetic algorithm using RL focal metric. Combining the outputs for MCQ and OEQ tasks using TOPLA MLP and LED models.