HAE-RAE 는 언어모델 평가를 연구하는 오픈소스 단체입니다. 이전까지 HAERAE-BENCH, KMMLU, csat-qa, k2-eval 등의 벤치마크를 공개하였습니다.
Pinned Loading
Repositories
Showing 8 of 8 repositories
- nlp-arxiv-translator Public
HAE-RAE/nlp-arxiv-translator’s past year of commit activity - lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
HAE-RAE/lm-evaluation-harness’s past year of commit activity