seungyeon-seo

Follow

🤪

Seungyeon Seo seungyeon-seo

🤪

Follow

34 followers · 47 following

POSTECH NLP Group
South Korea / Pohang
https://www.linkedin.com/in/seungyeon-seo-188798185/
https://orcid.org/0009-0006-7392-7546

Achievements

Achievements

Lists (2)

Sort

dataset

model

Stars

23 stars written in Python

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,243 32,845 Updated Apr 11, 2026

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,960 8,543 Updated Apr 12, 2026

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,914 2,245 Updated Apr 10, 2026

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,073 3,401 Updated Apr 12, 2026

imoneoi / openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,478 435 Updated Sep 13, 2024

adapter-hub / adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Python 2,810 372 Updated Mar 21, 2026

AkariAsai / self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 2,355 220 Updated May 25, 2024

PolyAI-LDN / conversational-datasets

Large datasets for conversational AI

Python 1,392 178 Updated Nov 16, 2019

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 1,128 138 Updated Apr 10, 2026

prometheus-eval / prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 1,066 70 Updated Apr 25, 2025

budzianowski / multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

Python 943 206 Updated Mar 18, 2026

salesforce / DialogStudio

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

Python 521 37 Updated Jan 27, 2025

kharrigian / mental-health-datasets

An evolving list of electronic media data sets used to model mental-health status.

Python 468 79 Updated Sep 3, 2021

zhongwanjun / MemoryBank-SiliconFriend

Source code and demo for memory bank and SiliconFriend

Python 419 60 Updated May 24, 2023

skywalker023 / sodaverse

🥤🧑🏻‍🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"

Python 240 14 Updated Jan 23, 2026

HuangLK / transpeeder

train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism

Python 224 21 Updated Nov 21, 2023

xyjigsaw / LLM-Pretrain-SFT

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

Python 87 16 Updated Jan 30, 2024

tomyoung903 / FusedChat

FusedChat is a dialogue dataset. It contains dialogue sessions fusing task-oriented dialogues and open-domain dialogues.

Python 29 2 Updated Jul 20, 2022

emphasis10 / AI-paper-digest

Python 22 4 Updated Jul 5, 2025

sogang-isds / TOATOD

Task-Optimized Adapters for an End-to-End Dialogue System Paper Code

Python 21 5 Updated Jul 31, 2023

qhjqhj00 / CIKM2021-IMPChat

CIKM 2021: Learning Implicit User Profile for Personalized Retrieval-based Chatbot

Python 11 2 Updated Sep 5, 2021

Cyn7hia / PAED

This is a repository for our paper (PAED: Zero-Shot Persona Attribute Extraction in Dialogues) accepted in ACL'23.

Python 9 4 Updated Oct 2, 2023

JihyunLee1 / PicPersona

Official Code for PicPersonaTOD

Python 7 Updated Sep 9, 2025