Skip to content
View gaotianyu1350's full-sized avatar

Highlights

  • Pro

Organizations

@princeton-nlp @princeton-pli

Block or report gaotianyu1350

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,609 531 Updated Oct 16, 2024

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,934 148 Updated Nov 5, 2025

A Data Streaming Library for Efficient Neural Network Training

Python 1,408 176 Updated Oct 27, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,744 292 Updated Nov 5, 2025

Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, implementing, and testing new search methods. Baguetter support…

Python 190 10 Updated Aug 31, 2024

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,408 519 Updated Oct 8, 2025

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Python 1,063 78 Updated Mar 7, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 7,719 791 Updated Nov 4, 2025

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 1,037 83 Updated Sep 19, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,211 4,036 Updated Jul 17, 2024

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Python 282 25 Updated Oct 20, 2022

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python 266 30 Updated Nov 27, 2023

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 612 82 Updated Oct 27, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,372 584 Updated Oct 28, 2024

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,542 187 Updated Jul 12, 2024

Scalable training for dense retrieval models.

Python 297 32 Updated Jun 10, 2025

Model API for GALACTICA

Jupyter Notebook 2,737 271 Updated Mar 5, 2023

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 196 13 Updated Jun 14, 2023

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 760 66 Updated Apr 7, 2023

This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs

Jupyter Notebook 184 15 Updated Oct 12, 2023

Source code and dataset for EMNLP 2020 paper "MAVEN: A Massive General Domain Event Detection Dataset".

Python 160 39 Updated Jan 5, 2022

A latent text-to-image diffusion model

Jupyter Notebook 71,752 10,516 Updated Jun 18, 2024

Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxiv.org/abs/2205.09726).

Python 138 11 Updated Aug 2, 2023

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,622 2,091 Updated Nov 3, 2023
Python 98 32 Updated Aug 28, 2018

The source code of our COLING'18 paper "Few-Shot Charge Prediction with Discriminative Legal Attributes".

Python 130 29 Updated Dec 20, 2018

The respository of jec-qa.

Python 57 2 Updated Feb 2, 2020

Open Chinese Language Pre-trained Model Zoo

988 147 Updated Mar 18, 2020
Python 26 2 Updated Apr 11, 2020
81 49 Updated Jun 29, 2020
Next