gaotianyu1350

Follow

Tianyu Gao gaotianyu1350

Follow

PhD student at Princeton University.

1k followers · 10 following

Achievements

Achievements

Highlights

Pro

Organizations

Stars

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 37,831 4,100 Updated Nov 8, 2025

iina / iina

The modern video player for macOS.

Swift 42,521 2,707 Updated Nov 8, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,230 31,082 Updated Nov 7, 2025

castorini / pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,975 462 Updated Nov 7, 2025

OpenRA / OpenRA

Open Source real-time strategy game engine for early Westwood games such as Command & Conquer: Red Alert written in C# using SDL and OpenGL. Runs on Windows, Linux, *BSD and Mac OS X.

C# 16,099 2,821 Updated Nov 7, 2025

rsta2 / circle

A C++ bare metal environment for Raspberry Pi with USB (32 and 64 bit)

C 2,113 280 Updated Nov 7, 2025

PaddlePaddle / ERNIE

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,527 1,441 Updated Nov 7, 2025

NVIDIA / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,842 1,500 Updated Nov 7, 2025

google-research / t5x

Python 2,907 336 Updated Nov 6, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,781 296 Updated Nov 6, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,939 148 Updated Nov 5, 2025

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 7,729 793 Updated Nov 4, 2025

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,272 1,219 Updated Nov 4, 2025

dvanoni / notero

A Zotero plugin for syncing items and notes into Notion

TypeScript 2,958 122 Updated Nov 2, 2025

shidenggui / easytrader

提供同花顺客户端/miniqmt/雪球的股票量化交易，支持跟踪 joinquant /ricequant 模拟交易和实盘雪球组合

Python 9,002 2,840 Updated Nov 2, 2025

google-research / language

Shared repository for open-sourced projects from the Google AI Language team.

Python 1,720 355 Updated Oct 29, 2025

posquit0 / Awesome-CV

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 25,650 5,105 Updated Oct 27, 2025

mosaicml / streaming

A Data Streaming Library for Efficient Neural Network Training

Python 1,410 176 Updated Oct 27, 2025

OpenBMB / BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 613 83 Updated Oct 27, 2025

tuna / thuthesis

LaTeX Thesis Template for Tsinghua University

TeX 5,021 1,127 Updated Oct 19, 2025

beir-cellar / beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,992 222 Updated Oct 16, 2025

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,710 1,108 Updated Oct 15, 2025

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,410 520 Updated Oct 8, 2025

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,924 6,624 Updated Sep 30, 2025

davidsbatista / BREDS

"Bootstrapping Relationship Extractors with Distributional Semantics" (Batista et al., 2015) in EMNLP'15 - Python implementation

Python 143 38 Updated Aug 23, 2025

stanfordnlp / GloVe

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

C 7,125 1,548 Updated Jul 27, 2025

facebookresearch / dpr-scale

Scalable training for dense retrieval models.

Python 297 32 Updated Jun 10, 2025

google-research / arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,546 375 Updated Jun 2, 2025

aws-samples / aws-plugin-for-slurm

A sample integration of AWS services with SLURM

Shell 80 39 Updated Apr 18, 2025

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,375 583 Updated Oct 28, 2024