Highlights
- Pro
-
RLVE Public
Forked from Zhiyuan-Zeng/RLVE[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Python MIT License UpdatedNov 12, 2025 -
RL-Compositionality Public
Forked from PRIME-RL/RL-CompositionalityFROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
-
CRAFT Public
Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"
-
alpaca_eval Public
Forked from tatsu-lab/alpaca_evalAn automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook Apache License 2.0 UpdatedSep 30, 2023 -
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python Apache License 2.0 UpdatedSep 27, 2023 -
OOD_NLP Public
[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations".
-
PLMCalibration Public
Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"
-
OpenAttack Public
Forked from thunlp/OpenAttackAn Open-Source Package for Textual Adversarial Attack.
Python MIT License UpdatedMar 13, 2023 -
FactMix Public
Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"