User profiles for Deheng Ye

Deheng Ye

Director of AI Applications, Tencent
Verified email at e.ntu.edu.sg
Cited by 2667

Mastering complex control in moba games with deep reinforcement learning

D Ye, Z Liu, M Sun, B Shi, P Zhao, H Wu, H Yu… - Proceedings of the …, 2020 - ojs.aaai.org
We study the reinforcement learning problem of complex action control in the Multi-player
Online Battle Arena (MOBA) 1v1 games. This problem involves far more complicated state and …

Towards playing full moba games with deep reinforcement learning

D Ye, G Chen, W Zhang, S Chen… - Advances in …, 2020 - proceedings.neurips.cc
MOBA games, eg, Honor of Kings, League of Legends, and Dota 2, pose grand challenges
to AI systems such as multi-agent, enormous state-action space, complex action control, etc. …

Supervised learning achieves human-level performance in moba games: A case study of honor of kings

D Ye, G Chen, P Zhao, F Qiu, B Yuan… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
We present JueWu-SL, the first supervised-learning-based artificial intelligence (AI)
program that achieves human-level performance in playing multiplayer online battle arena (MOBA) …

Learning diverse policies in moba games via macro-goals

…, Z Lian, F Qiu, G Han, W Wang, D Ye… - Advances in …, 2021 - proceedings.neurips.cc
Recently, many researchers have made successful progress in building the AI systems for
MOBA-game-playing with deep reinforcement learning, such as on Dota 2 and Honor of Kings…

Ensemble application of convolutional and recurrent neural networks for multi-label text categorization

G Chen, D Ye, Z Xing, J Chen… - 2017 International joint …, 2017 - ieeexplore.ieee.org
Text categorization, or text classification, is one of key tasks for representing the semantic
information of documents. Multi-label text categorization is finer-grained approach to text …

More agents is all you need

J Li, Q Zhang, Y Yu, Q Fu, D Ye - arXiv preprint arXiv:2402.05120, 2024 - arxiv.org
We find that, simply via a sampling-and-voting method, the performance of large language
models (LLMs) scales with the number of agents instantiated. Also, this method, termed as …

Minerl diamond 2021 competition: Overview, results, and lessons learned

…, N Topin, Z Lin, J Li, J Shi, D Ye… - NeurIPS 2021 …, 2022 - proceedings.mlr.press
Reinforcement learning competitions advance the field by providing appropriate scope and
support to develop solutions toward a specific problem. To promote the development of more …

A survey on transformers in reinforcement learning

W Li, H Luo, Z Lin, C Zhang, Z Lu, D Ye - arXiv preprint arXiv:2301.03044, 2023 - arxiv.org
Transformer has been considered the dominating neural architecture in NLP and CV, mostly
under supervised settings. Recently, a similar surge of using Transformers has appeared in …

Predicting semantically linkable knowledge in developer online forums via convolutional neural network

B Xu, D Ye, Z Xing, X Xia, G Chen, S Li - Proceedings of the 31st IEEE …, 2016 - dl.acm.org
Consider a question and its answers in Stack Overflow as a knowledge unit. Knowledge
units often contain semantically relevant knowledge, and thus linkable for different purposes, …

Rltf: Reinforcement learning from unit test feedback

…, Y Zhu, K Xiao, Q Fu, X Han, W Yang, D Ye - arXiv preprint arXiv …, 2023 - arxiv.org
The goal of program synthesis, or code generation, is to generate executable code based
on given descriptions. Recently, there has been an increasing number of studies employing …