default search action
Chanjun Park
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j17]Hyeonseok Moon, Myunghoon Kang, Jaehyung Seo, Sugyeong Eo, Chanjun Park, Yeongwook Yang, Heuiseok Lim:
Exploiting Hanja-Based Resources in Processing Korean Historic Documents Written by Common Literati. IEEE Access 12: 59909-59919 (2024) - [c25]Jungseob Lee, Hyeonseok Moon, Seungjun Lee, Chanjun Park, Sugyeong Eo, Hyunwoong Ko, Jaehyung Seo, Seungyoon Lee, Heuiseok Lim:
Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation. ACL (Findings) 2024: 2287-2303 - [c24]Jaehyung Seo, Jaewook Lee, Chanjun Park, Seongtae Hong, Seungjun Lee, Heuiseok Lim:
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models. ACL (Findings) 2024: 2390-2415 - [c23]Chanjun Park, Hyeonwoo Kim, Dahyun Kim, Seonghwan Cho, Sanghoon Kim, Sukyung Lee, Yungi Kim, Hwalsuk Lee:
Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark. ACL (1) 2024: 3220-3234 - [c22]Sugyeong Eo, Jungwoo Lim, Chanjun Park, Dahyun Jung, Seonmin Koo, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim:
Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean Translation. LREC/COLING 2024: 4705-4716 - [c21]Seungyoon Lee, Chanjun Park, Dahyun Jung, Hyeonseok Moon, Jaehyung Seo, Sugyeong Eo, Heuiseok Lim:
Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean. LREC/COLING 2024: 10380-10392 - [c20]Chanjun Park, Jaehyung Seo, Seolhwa Lee, Junyoung Son, Hyeonseok Moon, Sugyeong Eo, Chanhee Lee, Heuiseok Lim:
Hyper-BTS Dataset: Scalability and Enhanced Analysis of Back TranScription (BTS) for ASR Post-Processing. EACL (Findings) 2024: 67-78 - [c19]Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, Chanjun Park, Jaehyung Seo, Heuiseok Lim:
Generative Interpretation: Toward Human-Like Evaluation for Educational Question-Answer Pair Generation. EACL (Findings) 2024: 2185-2196 - [c18]Sanghoon Kim, Dahyun Kim, Chanjun Park, Wonsung Lee, Wonho Song, Yunsu Kim, Hyeonwoo Kim, Yungi Kim, Hyeonju Lee, Jihoo Kim, Changbae Ahn, Seonghoon Yang, Sukyung Lee, Hyunbyung Park, Gyoungjin Gim, Mikyoung Cha, Hwalsuk Lee, Sunghun Kim:
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling. NAACL (Industry Track) 2024: 23-35 - [c17]Dahyun Jung, Sugyeong Eo, Chanjun Park, Heuiseok Lim:
Explainable CED: A Dataset for Explainable Critical Error Detection in Machine Translation. NAACL (Student Research Workshop) 2024: 25-35 - [c16]Seungyoon Lee, Dong Kim, Dahyun Jung, Chanjun Park, Heuiseok Lim:
Exploring Inherent Biases in LLMs within Korean Social Context: A Comparative Analysis of ChatGPT and GPT-4. NAACL (Student Research Workshop) 2024: 93-104 - [i33]Seungyoon Lee, Dahyun Jung, Chanjun Park, Seolhwa Lee, Heuiseok Lim:
Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse. CoRR abs/2401.14616 (2024) - [i32]Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim:
Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline. CoRR abs/2401.14625 (2024) - [i31]Chanjun Park, Minsoo Khang, Dahyun Kim:
Model-Based Data-Centric AI: Bridging the Divide Between Academic Ideals and Industrial Pragmatism. CoRR abs/2403.01832 (2024) - [i30]Dahyun Kim, Yungi Kim, Wonho Song, Hyeonwoo Kim, Yunsu Kim, Sanghoon Kim, Chanjun Park:
sDPO: Don't Use Your Data All at Once. CoRR abs/2403.19270 (2024) - [i29]Hyunbyung Park, Sukyung Lee, Gyoungjin Gim, Yungi Kim, Dahyun Kim, Chanjun Park:
Dataverse: Open-Source ETL (Extract, Transform, Load) Pipeline for Large Language Models. CoRR abs/2403.19340 (2024) - [i28]Jihoo Kim, Wonho Song, Dahyun Kim, Yunsu Kim, Yungi Kim, Chanjun Park:
Evalverse: Unified and Accessible Library for Large Language Model Evaluation. CoRR abs/2404.00943 (2024) - [i27]Hyeonwoo Kim, Gyoungjin Gim, Yungi Kim, Jihoo Kim, Byungju Kim, Wonseok Lee, Chanjun Park:
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models. CoRR abs/2404.03887 (2024) - [i26]Hyeonseok Moon, Seungyoon Lee, Seongtae Hong, Seungjun Lee, Chanjun Park, Heuiseok Lim:
Translation of Multifaceted Data without Re-Training of Machine Translation Systems. CoRR abs/2404.16257 (2024) - [i25]Jeiyoon Park, Chanjun Park, Heuiseok Lim:
Enhancing Consistency and Role-Specific Knowledge Capturing by Rebuilding Fictional Character's Persona. CoRR abs/2405.19778 (2024) - [i24]Chanjun Park, Hyeonwoo Kim, Dahyun Kim, Seonghwan Cho, Sanghoon Kim, Sukyung Lee, Yungi Kim, Hwalsuk Lee:
Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark. CoRR abs/2405.20574 (2024) - [i23]Jeiyoon Park, Chanjun Park, Heuiseok Lim:
ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction. CoRR abs/2406.03202 (2024) - [i22]Chanjun Park, Hyeonwoo Kim:
Understanding LLM Development Through Longitudinal Study: Insights from the Open Ko-LLM Leaderboard. CoRR abs/2409.03257 (2024) - [i21]Yungi Kim, Hyunsoo Ha, Sukyung Lee, Jihoo Kim, Seonghoon Yang, Chanjun Park:
Rethinking KenLM: Good and Bad Model Ensembles for Efficient Text Quality Filtering in Large Web Corpora. CoRR abs/2409.09613 (2024) - [i20]Chanjun Park, Hyunsoo Ha, Jihoo Kim, Yungi Kim, Dahyun Kim, Sukyung Lee, Seonghoon Yang:
1 Trillion Token (1TT) Platform: A Novel Framework for Efficient Data Sharing and Compensation in Large Language Models. CoRR abs/2409.20149 (2024) - 2023
- [j16]Seonmin Koo, Chanjun Park, Seolhwa Lee, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim:
Uncovering the Risks and Drawbacks Associated With the Use of Synthetic Data for Grammatical Error Correction. IEEE Access 11: 95747-95756 (2023) - [j15]Hyeonseok Moon, Chanjun Park, Seonmin Koo, Jungseob Lee, Seungjun Lee, Jaehyung Seo, Sugyeong Eo, Yoonna Jang, Hyunjoong Kim, Hyoung-gyu Lee, Heuiseok Lim:
Doubts on the reliability of parallel corpus filtering. Expert Syst. Appl. 233: 120962 (2023) - [c15]Seungjun Lee, Yoonna Jang, Chanjun Park, Jungseob Lee, Jaehyung Seo, Hyeonseok Moon, Sugyeong Eo, Seounghoon Lee, Bernardo Yahya, Heuiseok Lim:
PEEP-Talk: A Situational Dialogue-based Chatbot for English Education. ACL (demo) 2023: 190-207 - [c14]Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim:
KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processing. EMNLP 2023: 4798-4815 - [c13]Jaehyung Seo, Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, Chanjun Park, Heuiseok Lim:
CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme Ingredients. EMNLP 2023: 6014-6029 - [c12]Seungyoon Lee, Dahyun Jung, Chanjun Park, Seolhwa Lee, Heuiseok Lim:
Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse. ICDM (Workshops) 2023: 1438-1442 - [c11]Dahyun Jung, Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim:
Informative Evidence-guided Prompt-based Fine-tuning for English-Korean Critical Error Detection. IJCNLP (1) 2023: 344-358 - [c10]Seungjun Lee, Hyeonseok Moon, Chanjun Park, Heuiseok Lim:
Improving Formality-Sensitive Machine Translation Using Data-Centric Approaches and Prompt Engineering. IWSLT@ACL 2023: 420-432 - [i19]Eujeong Choi, Chanjun Park:
DMOps: Data Management Operation and Recipes. CoRR abs/2301.01228 (2023) - [i18]Chanjun Park, Hyeonseok Moon, Seolhwa Lee, Jaehyung Seo, Sugyeong Eo, Heuiseok Lim:
Self-Improving-Leaderboard(SIL): A Call for Real-World Centric Natural Language Processing Leaderboards. CoRR abs/2303.10888 (2023) - [i17]NamHyeok Kim, Chanjun Park:
Inter-Annotator Agreement in the Wild: Uncovering Its Emerging Roles and Considerations in Real-World Scenarios. CoRR abs/2306.14373 (2023) - [i16]Damrin Kim, NamHyeok Kim, Chanjun Park, Harksoo Kim:
Transcending Traditional Boundaries: Leveraging Inter-Annotator Agreement (IAA) for Enhancing Data Management Operations (DMOps). CoRR abs/2306.14374 (2023) - [i15]Chanjun Park, Seonmin Koo, Seolhwa Lee, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim:
Synthetic Alone: Exploring the Dark Side of Synthetic Data for Grammatical Error Correction. CoRR abs/2306.14377 (2023) - [i14]Dahyun Jung, Jaehyung Seo, Jaewook Lee, Chanjun Park, Heuiseok Lim:
Knowledge Graph-Augmented Korean Generative Commonsense Reasoning. CoRR abs/2306.14470 (2023) - [i13]Seungjun Lee, Hyeonseok Moon, Chanjun Park, Heuiseok Lim:
Data-Driven Approach for Formality-Sensitive Machine Translation: Language-Specific Handling and Synthetic Data Generation. CoRR abs/2306.14514 (2023) - [i12]Dahyun Kim, Chanjun Park, Sanghoon Kim, Wonsung Lee, Wonho Song, Yunsu Kim, Hyeonwoo Kim, Yungi Kim, Hyeonju Lee, Jihoo Kim, Changbae Ahn, Seonghoon Yang, Sukyung Lee, Hyunbyung Park, Gyoungjin Gim, Mikyoung Cha, Hwalsuk Lee, Sunghun Kim:
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling. CoRR abs/2312.15166 (2023) - 2022
- [j14]Hyeonseok Moon, Chanjun Park, Jaehyung Seo, Sugyeong Eo, Heuiseok Lim:
An Automatic Post Editing With Efficient and Simple Data Generation Method. IEEE Access 10: 21032-21040 (2022) - [j13]Chanjun Park, Woo-Young Go, Sugyeong Eo, Hyeonseok Moon, Seolhwa Lee, Heuiseok Lim:
Mimicking Infants' Bilingual Language Acquisition for Domain Specialized Neural Machine Translation. IEEE Access 10: 38684-38693 (2022) - [j12]Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim:
Word-Level Quality Estimation for Korean-English Neural Machine Translation. IEEE Access 10: 44964-44973 (2022) - [j11]Junyoung Son, Hyeonseok Moon, Jeongwoo Lee, Seolhwa Lee, Chanjun Park, Wonkyung Jung, Heuiseok Lim:
AI for Patents: A Novel Yet Effective and Efficient Framework for Patent Analysis. IEEE Access 10: 59205-59218 (2022) - [j10]Myunghoon Kang, Jaehyung Seo, Chanjun Park, Heuiseok Lim:
Utilization Strategy of User Engagements in Korean Fake News Detection. IEEE Access 10: 79516-79525 (2022) - [j9]Jaehyung Seo, Hyeonseok Moon, Chanhee Lee, Sugyeong Eo, Chanjun Park, Jihoon Kim, Changwoo Chun, Heuiseok Lim:
Plain Template Insertion: Korean-Prompt-Based Engineering for Few-Shot Learners. IEEE Access 10: 107587-107597 (2022) - [j8]Seonmin Koo, Chanjun Park, Jaehyung Seo, Seungjun Lee, Hyeonseok Moon, Jungseob Lee, Heuiseok Lim:
K-NCT: Korean Neural Grammatical Error Correction Gold-Standard Test Set Using Novel Error Type Classification Criteria. IEEE Access 10: 118167-118175 (2022) - [j7]Jaehyung Seo, Dongsuk Oh, Sugyeong Eo, Chanjun Park, Kisu Yang, Hyeonseok Moon, Kinam Park, Heuiseok Lim:
PU-GEN: Enhancing generative commonsense reasoning for language models with human-centered knowledge. Knowl. Based Syst. 256: 109861 (2022) - [c9]Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Gyeongmin Kim, Jungseob Lee, Heuiseok Lim:
QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine Translation. COLING 2022: 5181-5190 - [c8]Chanjun Park, Yoonna Jang, Seolhwa Lee, Jaehyung Seo, Kisu Yang, Heuiseok Lim:
PicTalky: Augmentative and Alternative Communication for Language Developmental Disabilities. AACL/IJCNLP (System Demonstrations) 2022: 17-27 - [c7]Chanjun Park, Seolhwa Lee, Jaehyung Seo, Hyeonseok Moon, Sugyeong Eo, Heuiseok Lim:
Priming Ancient Korean Neural Machine Translation. LREC 2022: 22-28 - [c6]Hyeonseok Moon, Chanjun Park, Seolhwa Lee, Jaehyung Seo, Jungseob Lee, Sugyeong Eo, Heuiseok Lim:
Empirical Analysis of Noising Scheme based Synthetic Data Generation for Automatic Post-editing. LREC 2022: 883-891 - [c5]Chanjun Park, Yoonna Jang, Seolhwa Lee, Sungjin Park, Heuiseok Lim:
FreeTalky: Don't Be Afraid! Conversations Made Easier by a Humanoid Robot using Persona-based Dialogue. LREC 2022: 1242-1248 - [c4]Jaehyung Seo, Seounghoon Lee, Chanjun Park, Yoonna Jang, Hyeonseok Moon, Sugyeong Eo, Seonmin Koo, Heuiseok Lim:
A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation. NAACL-HLT (Findings) 2022: 2233-2249 - [c3]Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Heuiseok Lim:
KU X Upstage's Submission for the WMT22 Quality Estimation: Critical Error Detection Shared Task. WMT 2022: 606-614 - [i11]Jungseob Lee, Midan Shim, Suhyune Son, Yujin Kim, Chanjun Park, Heuiseok Lim:
Empirical study on BlenderBot 2.0 Errors Analysis in terms of Model, Data and User-Centric Approach. CoRR abs/2201.03239 (2022) - [i10]Suhyune Son, Chanjun Park, Jungseob Lee, Midan Shim, Chanhee Lee, Yoonna Jang, Jaehyung Seo, Heuiseok Lim:
Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models. CoRR abs/2209.06422 (2022) - [i9]Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo, Gyeongmin Kim, Jungseob Lee, Heuiseok Lim:
QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine Translation. CoRR abs/2209.15285 (2022) - 2021
- [j6]Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim:
An Empirical Study on Automatic Post Editing for Neural Machine Translation. IEEE Access 9: 123754-123763 (2021) - [j5]Kuekyeng Kim, Chanjun Park, Jaehyung Seo, Heuiseok Lim:
Grounded Vocabulary for Image Retrieval Using a Modified Multi-Generator Generative Adversarial Network. IEEE Access 9: 144614-144623 (2021) - [j4]Seolhwa Lee, Kisu Yang, Chanjun Park, João Sedoc, Heuiseok Lim:
Who Speaks Like a Style of Vitamin: Towards Syntax-Aware Dialogue Summarization Using Multi-Task Learning. IEEE Access 9: 168889-168898 (2021) - [j3]Chanjun Park, Kuekyeng Kim, YeongWook Yang, Minho Kang, Heuiseok Lim:
Neural spelling correction: translating incorrect sentences to correct sentences for multimedia. Multim. Tools Appl. 80(26): 34591-34608 (2021) - [c2]Chanjun Park, Jaehyung Seo, Seolhwa Lee, Chanhee Lee, Hyeonseok Moon, Sugyeong Eo, Heuiseok Lim:
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text. WAT@ACL/IJCNLP 2021: 106-116 - [c1]Chanjun Park, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim:
Should we find another model?: Improving Neural Machine Translation Performance with ONE-Piece Tokenization Method without Model Modification. NAACL-HLT (Industry Papers) 2021: 97-104 - [i8]Chanjun Park, Yoonna Jang, Seolhwa Lee, Jaehyung Seo, Kisu Yang, Heuiseok Lim:
PicTalky: Augmentative and Alternative Communication Software for Language Developmental Disabilities. CoRR abs/2109.12941 (2021) - [i7]Seolhwa Lee, Kisu Yang, Chanjun Park, João Sedoc, Heuiseok Lim:
Who says like a style of Vitamin: Towards Syntax-Aware DialogueSummarization using Multi-task Learning. CoRR abs/2109.14199 (2021) - [i6]Chanjun Park, Midan Shim, Sugyeong Eo, Seolhwa Lee, Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim:
Empirical Analysis of Korean Public AI Hub Parallel Corpora and in-depth Analysis using LIWC. CoRR abs/2110.15023 (2021) - [i5]Chanjun Park, Seolhwa Lee, Hyeonseok Moon, Sugyeong Eo, Jaehyung Seo, Heuiseok Lim:
How should human translation coexist with NMT? Efficient tool for building high quality parallel corpus. CoRR abs/2111.00191 (2021) - [i4]Jaehyung Seo, Chanjun Park, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim:
Automatic Knowledge Augmentation for Generative Commonsense Reasoning. CoRR abs/2111.00192 (2021) - [i3]Sugyeong Eo, Chanjun Park, Jaehyung Seo, Hyeonseok Moon, Heuiseok Lim:
A New Tool for Efficiently Generating Quality Estimation Datasets. CoRR abs/2111.00767 (2021) - [i2]Hyeonseok Moon, Chanjun Park, Sugyeong Eo, Jaehyung Seo, Seungjun Lee, Heuiseok Lim:
A Self-Supervised Automatic Post-Editing Data Generation Tool. CoRR abs/2111.12284 (2021) - [i1]Chanjun Park, Yoonna Jang, Seolhwa Lee, Sungjin Park, Heuiseok Lim:
FreeTalky: Don't Be Afraid! Conversations Made Easier by a Humanoid Robot using Persona-based Dialogue. CoRR abs/2112.04126 (2021) - 2020
- [j2]Chanjun Park, YeongWook Yang, Chanhee Lee, Heuiseok Lim:
Comparison of the Evaluation Metrics for Neural Grammatical Error Correction With Overcorrection. IEEE Access 8: 106264-106272 (2020) - [j1]Chanjun Park, Chanhee Lee, YeongWook Yang, Heuiseok Lim:
Ancient Korean Neural Machine Translation. IEEE Access 8: 116617-116625 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 20:29 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint