Lichang-Chen

Lichang Chen Lichang-Chen

38 followers · 21 following

University of Maryland
College Park
lichang-chen.github.io

Achievements

Organizations

Lichang-Chen.github.io Public

The github personal webpage for Lichang Chen.

HTML Updated Oct 26, 2025
ADRS Public
Forked from UCB-ADRS/ADRS

AI-Driven Research For Systems (ADRS)

Jupyter Notebook Updated Oct 16, 2025
CS234-Reinforcement-Learning Public
Forked from Rhyme0730/CS234-Reinforcement-Learning

This repo mainly contains CS234 assignment's coding problems

Python Updated Feb 4, 2025
RLHF-Reward-Modeling Public
Forked from RLHFlow/RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python Apache License 2.0 Updated Dec 9, 2024
Reflection_Tuning Public
Forked from tianyi-lab/Reflection_Tuning

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python Updated Sep 6, 2024
ODIN Public

ODIN: Disentangled Reward Mitigates Hacking in RLHF (ICML 2024)

Python 6 Updated Sep 5, 2024
real-world-data Public

real-world-data

Updated Aug 8, 2024
AlpaGasus Public

A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)

filtering-data instruction-following data-centric-ai large-language-models

HTML 24 3 Updated Jul 26, 2024
InstructZero Public

Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!

black-box-optimization vicuna black-box-tuning chatgpt large-language-model instruction-optimization

Python 197 14 Updated Jul 23, 2024
LLaVA Public
Forked from haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python Apache License 2.0 Updated Mar 24, 2024
HallusionBench Public
Forked from tianyi-lab/HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python BSD 3-Clause "New" or "Revised" License Updated Mar 17, 2024
LLaVA-RLHF Public
Forked from llava-rlhf/LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Python GNU General Public License v3.0 Updated Nov 1, 2023
claude2-alpaca Public

First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!

Python 12 3 Updated Oct 22, 2023
stanford_alpaca Public
Forked from tatsu-lab/stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python Apache License 2.0 Updated Jun 7, 2023
reward-trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python 1 Apache License 2.0 Updated Apr 14, 2023
Chain-of-ThoughtsPapers Public
Forked from Timothyxxx/Chain-of-ThoughtsPapers

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

Updated Nov 16, 2022
zero_shot_cot Public
Forked from kojima-takeshi188/zero_shot_cot

Prod Env

Python 1 Updated Jul 31, 2022
system-design-primer Public
Forked from donnemartin/system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python Other Updated Apr 28, 2022
minmax-opt-smooth-adversary Public
Forked from fiezt/minmax-opt-smooth-adversary

Jupyter Notebook Updated Jun 2, 2021
OUTLOOK_ZJU_MAIL Public

Updated Nov 21, 2019
zjuthesis Public template
Forked from TheNetAdmin/zjuthesis

Zhejiang University Graduation Thesis/Design LaTeX template.

TeX MIT License Updated Nov 4, 2019
DeepLearning-500-questions Public
Forked from CrownX/DeepLearning-500-questions

深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，近30万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06

TeX GNU General Public License v3.0 Updated Mar 7, 2019

Lichang Chen Lichang-Chen

Achievements

Achievements

Organizations

Lichang-Chen.github.io Public

Uh oh!

ADRS Public

Uh oh!

CS234-Reinforcement-Learning Public

Uh oh!

RLHF-Reward-Modeling Public

Uh oh!

Reflection_Tuning Public

Uh oh!

ODIN Public

Uh oh!

real-world-data Public

Uh oh!

AlpaGasus Public

Uh oh!

InstructZero Public

Uh oh!

LLaVA Public

Uh oh!

HallusionBench Public

Uh oh!

LLaVA-RLHF Public

Uh oh!

claude2-alpaca Public

Uh oh!

stanford_alpaca Public

Uh oh!

reward-trl Public

Uh oh!

Chain-of-ThoughtsPapers Public

Uh oh!

zero_shot_cot Public

Uh oh!

system-design-primer Public

Uh oh!

minmax-opt-smooth-adversary Public

Uh oh!

OUTLOOK_ZJU_MAIL Public

Uh oh!

zjuthesis Public template

Uh oh!

DeepLearning-500-questions Public

Uh oh!