Geon-Hyeong Kim

인용

	전체	2020년 이후
서지정보	417	411
h-index	7	7
i10-index	6	6

160

120

20192020202120222023202420255 8 24 52 113 141 73

공동 저자

Kee-Eung KimKAISTkaist.ac.kr의 이메일 확인됨
Jongmin LeeYonsei Universityyonsei.ac.kr의 이메일 확인됨
HyeongJoo HwangSamsung AI Centersamsung.com의 이메일 확인됨
Youngsoo JangLG AI Researchlgresearch.ai의 이메일 확인됨
Hongseok YangProfessor, School of Computing, KAISTkaist.ac.kr의 이메일 확인됨
Wonseok JeonWaymowaymo.com의 이메일 확인됨
Seunghoon HongAssociate Professor, KAISTkaist.ac.kr의 이메일 확인됨
Seokin SeoKAIST AIai.kaist.ac.kr의 이메일 확인됨
Pascal PoupartUniversity of Waterloouwaterloo.ca의 이메일 확인됨
Kanghoon LeeLG AI Researchlgresearch.ai의 이메일 확인됨
Daniel D. LeeTisch University Professor of ECE, Cornell Universityalum.mit.edu의 이메일 확인됨
Pedro A. OrtegaArtificial Intelligence & Machine Learningadaptiveagents.org의 이메일 확인됨

팔로우

Geon-Hyeong Kim

LG AI Research

lgresearch.ai의 이메일 확인됨 - 홈페이지

Imitation Learning Reinforcement Learning


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Demodice: Offline imitation learning with supplementary imperfect demonstrations GH Kim, S Seo, J Lee, W Jeon, HJ Hwang, H Yang, KE Kim International Conference on Learning Representations, 2022	113	2022
Monte-Carlo tree search for constrained POMDPs J Lee, GH Kim, P Poupart, KE Kim Advances in Neural Information Processing Systems 31, 2018	90	2018
Variational interaction information maximization for cross-domain disentanglement HJ Hwang, GH Kim, S Hong, KE Kim Advances in Neural Information Processing Systems 33, 22479-22491, 2020	61	2020
Multi-view representation learning via total correlation objective HJ Hwang, GH Kim, S Hong, KE Kim Advances in Neural Information Processing Systems 34, 12194-12207, 2021	56	2021
Monte-carlo tree search in continuous action spaces with value gradients J Lee, W Jeon, GH Kim, KE Kim Proceedings of the AAAI conference on artificial intelligence 34 (04), 4561-4568, 2020	38	2020
Lobsdice: Offline learning from observation via stationary distribution correction estimation GH Kim, J Lee, Y Jang, H Yang, KE Kim Advances in Neural Information Processing Systems 35, 8252-8264, 2022	28	2022
Prospector: Improving LLM agents with self-asking and trajectory ranking B Kim, Y Jang, L Logeswaran, GH Kim, YJ Kim, H Lee, M Lee Findings of the Association for Computational Linguistics: EMNLP 2024, 14958 …, 2024	9	2024
Variational inference for sequential data with future likelihood estimates GH Kim, Y Jang, H Yang, KE Kim International Conference on Machine Learning, 5296-5305, 2020	6	2020
SafeDPO: A simple approach to direct preference optimization with enhanced safety GH Kim, Y Jang, YJ Kim, B Kim, H Lee, K Bae, M Lee arXiv preprint arXiv:2505.20065, 2025	5	2025
Information-theoretic state space model for multi-view reinforcement learning HJ Hwang, S Seo, Y Jang, S Kim, GH Kim, S Hong, KE Kim	5	2023
Safedice: offline safe imitation learning with non-preferred demonstrations Y Jang, GH Kim, J Lee, S Sohn, B Kim, H Lee, M Lee Advances in Neural Information Processing Systems 36, 74921-74951, 2023	2	2023
Trust region sequential variational inference GH Kim, Y Jang, J Lee, W Jeon, H Yang, KE Kim Asian conference on machine learning, 1033-1048, 2019	2	2019
Bayesian optimistic kullback–leibler exploration K Lee, GH Kim, P Ortega, DD Lee, KE Kim Machine Learning 108 (5), 765-783, 2019	2	2019
Online Pre-Training for Offline-to-Online Reinforcement Learning Y Shin, J Kim, W Jung, S Hong, D Yoon, Y Jang, G Kim, J Chae, Y Sung, ... arXiv preprint arXiv:2507.08387, 2025		2025
Degeneration-free policy optimization: RL fine-tuning for language models without degeneration Y Jang, GH Kim, B Kim, YJ Kim, H Lee, M Lee Forty-first International Conference on Machine Learning, 2024		2024
DfPO: Degeneration-free Policy Optimization via Action Masking in Natural Language Action Spaces Y Jang, GH Kim, B Kim, H Lee, M Lee

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–16

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자