Stars
This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO
Design and analyze optimal deep learning models.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Repository for code and models for the paper "Extrapolative Controlled Sequence Generation via Iterative Refinement"
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.
A curated list of radiology report generation (medical report generation) and related areas. :-)
Estimating the COVID risk of ordinary activities
Analysis of NLU test sets with IRT
Article-summary entailment annotations for agreement-oriented multidoc summarization
Repository for the code associated with the paper: Unsupervised Extractive Summarization using Mutual Information
Datasets, SOTA results of every fields of Chinese NLP
hhexiy / refdb
Forked from percyliang/refdbStores paper references, outputs to bib/html, does basic sanity checking on bib entries
ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation
An ML framework to accelerate research and its path to production.
woollysocks / ParlAI
Forked from facebookresearch/ParlAIA framework for training and evaluating AI models on a variety of openly available dialogue datasets.
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
Paper List for Style Transfer in Text