Skip to content
View jiaaoc's full-sized avatar

Organizations

@SALT-NLP

Block or report jiaaoc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,502 1,040 Updated Oct 29, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,379 71 Updated Mar 8, 2024

Tools for merging pretrained large language models.

Python 4,749 434 Updated Oct 29, 2024
Python 22 1 Updated Sep 19, 2023

Codes for the paper: "Continual Learning for Text Classification with Information Disentanglement Based Regularization"

Python 44 2 Updated Feb 9, 2023

Simple Conversational Data Augmentation for Semi-supervised Abstractive Conversation Summarization

Jupyter Notebook 10 3 Updated Mar 7, 2022
Python 5 3 Updated Aug 29, 2021

Code and dataset fo paper: Understanding the Usage of Online Media for Parenting from Infancy to Preschool At Scale

Jupyter Notebook 5 2 Updated May 13, 2021

Source codes for "Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs"

Python 64 8 Updated Oct 27, 2023
Jupyter Notebook 62 13 Updated Apr 25, 2020

Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"

Python 47 8 Updated Jun 30, 2022

MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization

63 7 Updated Jul 20, 2021

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

Python 62 4 Updated Nov 20, 2020

Source codes for the paper "Examining the Ordering of Rhetorical Strategies in Persuasive Requests"

Jupyter Notebook 17 3 Updated Sep 12, 2021

Source codes for the paper "Local Additivity Based Data Augmentation for Semi-supervised NER"

Python 44 5 Updated Oct 15, 2022

Source codes for the paper "Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization"

Python 89 15 Updated Oct 27, 2023

"End-to-End Abstractive Summarization for Meetings" paper - Unofficial PyTorch Implementation

Python 52 13 Updated Dec 8, 2022

Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward

Python 76 12 Updated Sep 13, 2020

PyTorch implementation of Contrastive Learning methods

Python 1,938 186 Updated Oct 4, 2023

Codebase for the Summary Loop paper at ACL2020

Python 44 13 Updated Jun 12, 2023
Python 394 83 Updated Nov 1, 2018

Language Model Baselines for PyTorch

Python 42 4 Updated Aug 18, 2020

annotated screenplays for 39 CSI:Crime Scene Investigation episodes for paper "Whodunnit? Crime Drama as a Case for Natural Language Understanding"

46 10 Updated May 19, 2020

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Jupyter Notebook 2,009 204 Updated Jan 9, 2024

Data and code associated with ICWSM 2020 paper about collective attention and descriptor phrases.

Jupyter Notebook 1 Updated Mar 30, 2020

Code for WWW-20 Paper: HTML: Hierarchical Transformer-based Multi-task Learning for Volatility Prediction

Jupyter Notebook 56 10 Updated Jan 5, 2024

"Let’s Make Your Request More Persuasive: Modeling Persuasive Strategies via Semi-Supervised Neural Nets on Crowdfunding Platforms"

Python 6 1 Updated Oct 28, 2019

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

Jupyter Notebook 352 61 Updated Jun 5, 2020

Code for "Semi-Supervised Models via Data Augmentation for Classifying Interactive Affective Responses"

Python 15 4 Updated Jun 26, 2020
Next