Skip to content
View seungyeon-seo's full-sized avatar
🤪
🤪

Block or report seungyeon-seo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
23 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,243 32,845 Updated Apr 11, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,960 8,543 Updated Apr 12, 2026

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,914 2,245 Updated Apr 10, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,073 3,401 Updated Apr 12, 2026

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,478 435 Updated Sep 13, 2024

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Python 2,810 372 Updated Mar 21, 2026

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 2,355 220 Updated May 25, 2024

Large datasets for conversational AI

Python 1,392 178 Updated Nov 16, 2019

FAIR Sequence Modeling Toolkit 2

Python 1,128 138 Updated Apr 10, 2026

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 1,066 70 Updated Apr 25, 2025

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

Python 943 206 Updated Mar 18, 2026

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

Python 521 37 Updated Jan 27, 2025

An evolving list of electronic media data sets used to model mental-health status.

Python 468 79 Updated Sep 3, 2021

Source code and demo for memory bank and SiliconFriend

Python 419 60 Updated May 24, 2023

🥤🧑🏻‍🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"

Python 240 14 Updated Jan 23, 2026

train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism

Python 224 21 Updated Nov 21, 2023

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

Python 87 16 Updated Jan 30, 2024

FusedChat is a dialogue dataset. It contains dialogue sessions fusing task-oriented dialogues and open-domain dialogues.

Python 29 2 Updated Jul 20, 2022
Python 22 4 Updated Jul 5, 2025

Task-Optimized Adapters for an End-to-End Dialogue System Paper Code

Python 21 5 Updated Jul 31, 2023

CIKM 2021: Learning Implicit User Profile for Personalized Retrieval-based Chatbot

Python 11 2 Updated Sep 5, 2021

This is a repository for our paper (PAED: Zero-Shot Persona Attribute Extraction in Dialogues) accepted in ACL'23.

Python 9 4 Updated Oct 2, 2023

Official Code for PicPersonaTOD

Python 7 Updated Sep 9, 2025