Skip to content
View yzpang's full-sized avatar
  • New York, NY

Organizations

@nyu-mll

Block or report yzpang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO

Python 28 1 Updated Nov 26, 2025

Design and analyze optimal deep learning models.

Jupyter Notebook 29 2 Updated Aug 2, 2025
HTML 2 1 Updated Jan 19, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,319 1,005 Updated Jul 1, 2024
C 2 Updated Feb 7, 2020
HTML 1 Updated Jun 13, 2023

Repository for code and models for the paper "Extrapolative Controlled Sequence Generation via Iterative Refinement"

Python 16 Updated Mar 5, 2024

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Jupyter Notebook 467 47 Updated Sep 30, 2024

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Python 155 16 Updated Sep 9, 2025

Query-focused summarization data

Python 43 2 Updated Feb 17, 2023

A curated list of radiology report generation (medical report generation) and related areas. :-)

180 20 Updated May 7, 2022
Python 147 10 Updated Jan 17, 2025

Estimating the COVID risk of ordinary activities

TypeScript 267 54 Updated Oct 10, 2024

Analysis of NLU test sets with IRT

Jupyter Notebook 12 8 Updated Jul 23, 2021
Python 8 4 Updated Mar 14, 2023
Python 4 1 Updated Oct 15, 2020

Article-summary entailment annotations for agreement-oriented multidoc summarization

10 2 Updated Jun 7, 2021
Python 61 14 Updated May 26, 2022

Repository for the code associated with the paper: Unsupervised Extractive Summarization using Mutual Information

Python 25 1 Updated Sep 11, 2021

Datasets, SOTA results of every fields of Chinese NLP

HTML 1,813 267 Updated Apr 7, 2022

Stores paper references, outputs to bib/html, does basic sanity checking on bib entries

TeX 1 Updated Nov 30, 2023

ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation

Python 25 5 Updated Oct 2, 2020

jiant is an nlp toolkit

Python 1,674 297 Updated Jul 6, 2023

An ML framework to accelerate research and its path to production.

Python 269 28 Updated Sep 3, 2024

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 2 1 Updated Jul 23, 2020

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,263 475 Updated Aug 7, 2024

Paper List for Style Transfer in Text

1,623 194 Updated Mar 16, 2023

Paper coming soon

Python 2 Updated Feb 23, 2020