Google Scholar

User profiles for Deheng Ye

Deheng Ye

Director of AI Applications, Tencent

Verified email at e.ntu.edu.sg

Cited by 2667

[PDF] aaai.org

Mastering complex control in moba games with deep reinforcement learning

D Ye, Z Liu, M Sun, B Shi, P Zhao, H Wu, H Yu… - Proceedings of the …, 2020 - ojs.aaai.org

We study the reinforcement learning problem of complex action control in the Multi-player
Online Battle Arena (MOBA) 1v1 games. This problem involves far more complicated state and …

Save Cite Cited by 392 Related articles All 9 versions View as HTML

[PDF] neurips.cc

Towards playing full moba games with deep reinforcement learning

D Ye, G Chen, W Zhang, S Chen… - Advances in …, 2020 - proceedings.neurips.cc

MOBA games, eg, Honor of Kings, League of Legends, and Dota 2, pose grand challenges
to AI systems such as multi-agent, enormous state-action space, complex action control, etc. …

Save Cite Cited by 235 Related articles All 7 versions View as HTML

[PDF] arxiv.org

Supervised learning achieves human-level performance in moba games: A case study of honor of kings

D Ye, G Chen, P Zhao, F Qiu, B Yuan… - IEEE transactions on …, 2020 - ieeexplore.ieee.org

We present JueWu-SL, the first supervised-learning-based artificial intelligence (AI)
program that achieves human-level performance in playing multiplayer online battle arena (MOBA) …

Save Cite Cited by 73 Related articles All 6 versions

[PDF] neurips.cc

Learning diverse policies in moba games via macro-goals

…, Z Lian, F Qiu, G Han, W Wang, D Ye… - Advances in …, 2021 - proceedings.neurips.cc

Recently, many researchers have made successful progress in building the AI systems for
MOBA-game-playing with deep reinforcement learning, such as on Dota 2 and Honor of Kings…

Save Cite Cited by 13 Related articles All 7 versions View as HTML

[PDF] sentic.net

Ensemble application of convolutional and recurrent neural networks for multi-label text categorization

G Chen, D Ye, Z Xing, J Chen… - 2017 International joint …, 2017 - ieeexplore.ieee.org

Text categorization, or text classification, is one of key tasks for representing the semantic
information of documents. Multi-label text categorization is finer-grained approach to text …

Save Cite Cited by 351 Related articles All 7 versions

More agents is all you need

J Li, Q Zhang, Y Yu, Q Fu, D Ye - arXiv preprint arXiv:2402.05120, 2024 - arxiv.org

We find that, simply via a sampling-and-voting method, the performance of large language
models (LLMs) scales with the number of agents instantiated. Also, this method, termed as …

Save Cite Cited by 118 Related articles All 2 versions Cached

[PDF] mlr.press

Minerl diamond 2021 competition: Overview, results, and lessons learned

…, N Topin, Z Lin, J Li, J Shi, D Ye… - NeurIPS 2021 …, 2022 - proceedings.mlr.press

Reinforcement learning competitions advance the field by providing appropriate scope and
support to develop solutions toward a specific problem. To promote the development of more …

Save Cite Cited by 34 Related articles All 6 versions View as HTML

[PDF] arxiv.org

A survey on transformers in reinforcement learning

W Li, H Luo, Z Lin, C Zhang, Z Lu, D Ye - arXiv preprint arXiv:2301.03044, 2023 - arxiv.org

Transformer has been considered the dominating neural architecture in NLP and CV, mostly
under supervised settings. Recently, a similar surge of using Transformers has appeared in …

Save Cite Cited by 87 Related articles All 6 versions View as HTML

[PDF] github.io

Predicting semantically linkable knowledge in developer online forums via convolutional neural network

B Xu, D Ye, Z Xing, X Xia, G Chen, S Li - Proceedings of the 31st IEEE …, 2016 - dl.acm.org

Consider a question and its answers in Stack Overflow as a knowledge unit. Knowledge
units often contain semantically relevant knowledge, and thus linkable for different purposes, …

Save Cite Cited by 186 Related articles All 7 versions

[PDF] arxiv.org

Rltf: Reinforcement learning from unit test feedback

…, Y Zhu, K Xiao, Q Fu, X Han, W Yang, D Ye - arXiv preprint arXiv …, 2023 - arxiv.org

The goal of program synthesis, or code generation, is to generate executable code based
on given descriptions. Recently, there has been an increasing number of studies employing …

Save Cite Cited by 63 Related articles All 3 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Deheng Ye

Deheng Ye

Mastering complex control in moba games with deep reinforcement learning

Towards playing full moba games with deep reinforcement learning

Supervised learning achieves human-level performance in moba games: A case study of honor of kings

Learning diverse policies in moba games via macro-goals

Ensemble application of convolutional and recurrent neural networks for multi-label text categorization

More agents is all you need

Minerl diamond 2021 competition: Overview, results, and lessons learned

A survey on transformers in reinforcement learning

Predicting semantically linkable knowledge in developer online forums via convolutional neural network

Rltf: Reinforcement learning from unit test feedback