Skip to content
View gaotianyu1350's full-sized avatar

Highlights

  • Pro

Organizations

@princeton-nlp @princeton-pli

Block or report gaotianyu1350

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,627 534 Updated Oct 16, 2024

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,002 161 Updated Dec 20, 2025

A Data Streaming Library for Efficient Neural Network Training

Python 1,433 182 Updated Oct 27, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,105 336 Updated Dec 20, 2025

Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, implementing, and testing new search methods. Baguetter support…

Python 201 10 Updated Aug 31, 2024

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,475 526 Updated Oct 8, 2025

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Python 1,063 78 Updated Mar 7, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 7,842 803 Updated Dec 12, 2025

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 1,039 84 Updated Sep 19, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,259 4,030 Updated Jul 17, 2024

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Python 285 24 Updated Oct 20, 2022

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python 264 31 Updated Nov 27, 2023

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 614 84 Updated Oct 27, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,382 588 Updated Oct 28, 2024

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,575 189 Updated Jul 12, 2024

Scalable training for dense retrieval models.

Python 298 32 Updated Jun 10, 2025

Model API for GALACTICA

Jupyter Notebook 2,741 269 Updated Mar 5, 2023

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 196 13 Updated Jun 14, 2023

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 766 69 Updated Apr 7, 2023

This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs

Jupyter Notebook 187 14 Updated Oct 12, 2023

Source code and dataset for EMNLP 2020 paper "MAVEN: A Massive General Domain Event Detection Dataset".

Python 161 38 Updated Jan 5, 2022

A latent text-to-image diffusion model

Jupyter Notebook 72,046 10,553 Updated Jun 18, 2024

Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxiv.org/abs/2205.09726).

Python 138 11 Updated Aug 2, 2023

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,624 2,086 Updated Nov 3, 2023
Python 99 32 Updated Aug 28, 2018

The source code of our COLING'18 paper "Few-Shot Charge Prediction with Discriminative Legal Attributes".

Python 130 29 Updated Dec 20, 2018

The respository of jec-qa.

Python 59 4 Updated Feb 2, 2020

Open Chinese Language Pre-trained Model Zoo

988 147 Updated Mar 18, 2020
Python 26 2 Updated Apr 11, 2020
80 49 Updated Jun 29, 2020
Next