Skip to content
View Ethan-TZ's full-sized avatar
🧭
Out of Memory
🧭
Out of Memory

Organizations

@RUCAIBox

Block or report Ethan-TZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NanoGPT (124M) in 90 seconds

Python 5,260 767 Updated May 14, 2026

pytorch-tiny-imagenet

Jupyter Notebook 192 45 Updated Feb 11, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 160,699 33,237 Updated May 18, 2026
Python 6 1 Updated Feb 18, 2025

Low-bit optimizers for PyTorch

Python 138 9 Updated Oct 9, 2023

Deep neural network framework for multiple GPUs

Cuda 34 15 Updated Jun 20, 2015

Accessible large language models via k-bit quantization for PyTorch.

Python 8,208 852 Updated May 15, 2026

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,305 196 Updated Mar 27, 2024

Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".

Python 48 2 Updated Jul 12, 2024

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Python 61 10 Updated Apr 19, 2022

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,963 616 Updated May 3, 2024

KarSein for CTR predict

Python 6 1 Updated Feb 5, 2025

The official implementation of Ada2Fair (RecSys'24 Short Paper).

Python 6 4 Updated Jan 21, 2025

[KDD 2026] FCN: Fusing Exponential and Linear Cross Network for Click-Through Rate Prediction

Python 62 9 Updated Nov 26, 2025

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,368 84 Updated Jul 14, 2024

[KDD 2024] This is the official PyTorch implementation for the paper: "Rotative Factorization Machines"

Python 3 Updated Aug 20, 2024

Long Range Arena for Benchmarking Efficient Transformers

Python 788 86 Updated Dec 16, 2023
Python 1,655 149 Updated Apr 27, 2023

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,499 691 Updated Feb 11, 2026

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,891 379 Updated May 5, 2026

[KDD 2024] This is the official PyTorch implementation for the paper: "Rotative Factorization Machines"

Python 3 1 Updated Aug 20, 2024
HTML 1 Updated May 17, 2024

Code for ACM RecSys 2023 paper "Turning Dross Into Gold Loss: Is BERT4Rec really better than SASRec?"

Jupyter Notebook 61 15 Updated Feb 24, 2024

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 850 106 Updated Jun 16, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 70,669 8,395 Updated Jan 25, 2026

Scripts for processing the Amazon Reviews 2023 dataset; implementations and checkpoints of BLaIR: "Bridging Language and Items for Retrieval and Recommendation".

Python 276 45 Updated Mar 11, 2025
Python 31 4 Updated Sep 3, 2023

Benchmarks for classification of genomic sequences

Jupyter Notebook 174 24 Updated Aug 14, 2025

Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena

Assembly 785 107 Updated Apr 22, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 71,341 8,714 Updated May 13, 2026
Next