Skip to content
View tonytan48's full-sized avatar

Block or report tonytan48

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

a distributed deep learning platform

C++ 3,607 1,269 Updated May 15, 2026

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 21,161 1,809 Updated Mar 5, 2026

Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"

Python 104 13 Updated Jun 15, 2023

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 3,145 287 Updated Jun 4, 2024

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Python 553 45 Updated Mar 10, 2024

LLM Zoo collects information of various open- and close-sourced LLMs

271 13 Updated Aug 23, 2023

Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。

Python 1,034 76 Updated Oct 19, 2023

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 8,032 583 Updated Jul 11, 2025

Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".

Python 167 14 Updated May 7, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,530 1,009 Updated May 8, 2026

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,499 691 Updated Feb 11, 2026

Making large AI models cheaper, faster and more accessible

Python 41,382 4,512 Updated May 11, 2026

Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)

Python 559 73 Updated Apr 8, 2026

Repo for Generating Flashbacks in Stories (NAACL'22)

Python 6 1 Updated Apr 28, 2022

A modular RL library to fine-tune language models to human preferences

Python 2,387 202 Updated Mar 1, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,748 485 Updated Jan 8, 2024

This repository implements our EMNLP 2022 research paper A Dataset for Hyper-Relational Extraction and a Cube-Filling Approach.

Python 28 2 Updated Dec 13, 2022

Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"

Jupyter Notebook 77 5 Updated Mar 3, 2022
Python 65 3 Updated Aug 21, 2023

Implementation of Document-level Relation Extraction with Knowledge Distillation and Adaptive Focal Loss

Python 116 22 Updated Nov 12, 2022

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

Python 187 21 Updated Jan 10, 2023
Jupyter Notebook 17 Updated May 17, 2022

[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods

Python 2,432 397 Updated Oct 16, 2023

Code associated with the paper **Two are Better Than One: Joint Entity and Relation Extraction with Table-Sequence Encoders**, at EMNLP 2020

Python 194 45 Updated Apr 14, 2022

EasyTransfer is designed to make the development of transfer learning in NLP applications easier.

Python 862 161 Updated Aug 25, 2022

DeLighT: Very Deep and Light-Weight Transformers

Python 469 52 Updated Oct 16, 2020

Language Model Baselines for PyTorch

Python 41 4 Updated Aug 18, 2020

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

Python 966 219 Updated May 21, 2024

A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology tags, lemmas, and dependency trees.

Python 225 57 Updated Dec 20, 2022
Next