default search action
Yi Ma 0005
Person information
- affiliation: Tianjin University, College of Intelligence and Computing, China
Other persons with the same name
- Yi Ma — disambiguation page
- Yi Ma 0001 — University of California, Berkeley, CA, USA (and 4 more)
- Yi Ma 0002 — University of Surrey, Institute for Communication Systems, UK (and 1 more)
- Yi Ma 0003 — Zhejiang University of Technology, Institute of Process Equipment and Control Engineering, Hangzhou, China
- Yi Ma 0004 — First Institute of Oceanography, Ministry of Natural Resources, Qingdao, China (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c17]Jinyi Liu, Yi Ma, Jianye Hao, Yujing Hu, Yan Zheng, Tangjie Lv, Changjie Fan:
A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning. AAMAS 2024: 1229-1237 - [c16]Kai Zhao, Jianye Hao, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng:
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles. AAMAS 2024: 2609-2611 - [c15]Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng:
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback. ICLR 2024 - [c14]Jiashun Liu, Jianye Hao, Yi Ma, Shuyin Xia:
Unlock the Cognitive Generalization of Deep Reinforcement Learning via Granular Ball Representation. ICML 2024 - [c13]Yi Ma, Jianye Hao, Hebin Liang, Chenjun Xiao:
Rethinking Decision Transformer via Hierarchical Reinforcement Learning. ICML 2024 - [c12]Kai Zhao, Jianye Hao, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng:
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles. IJCAI 2024: 5563-5571 - [i13]Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng:
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback. CoRR abs/2402.02423 (2024) - [i12]Zibin Dong, Yifu Yuan, Jianye Hao, Fei Ni, Yi Ma, Pengyi Li, Yan Zheng:
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making. CoRR abs/2406.09509 (2024) - 2023
- [c11]Hebin Liang, Yi Ma, Zilin Cao, Tianyang Liu, Fei Ni, Zhigang Li, Jianye Hao:
SplitNet: A Reinforcement Learning Based Sequence Splitting Method for the MinMax Multiple Travelling Salesman Problem. AAAI 2023: 8720-8727 - [c10]Hebin Liang, Zibin Dong, Yi Ma, Xiaotian Hao, Yan Zheng, Jianye Hao:
A Hierarchical Imitation Learning-based Decision Framework for Autonomous Driving. CIKM 2023: 4695-4701 - [c9]Yi Ma, Hongyao Tang, Dong Li, Zhaopeng Meng:
Reining Generalization in Offline Reinforcement Learning via Representation Distinction. NeurIPS 2023 - [i11]Xiaohan Hu, Yi Ma, Chenjun Xiao, Yan Zheng, Zhaopeng Meng:
In-Sample Policy Iteration for Offline Reinforcement Learning. CoRR abs/2306.05726 (2023) - [i10]Shixi Lian, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng:
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach. CoRR abs/2306.06329 (2023) - [i9]Kai Zhao, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng:
Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration. CoRR abs/2306.06871 (2023) - [i8]Jinyi Liu, Yi Ma, Jianye Hao, Yujing Hu, Yan Zheng, Tangjie Lv, Changjie Fan:
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning. CoRR abs/2306.15503 (2023) - [i7]Yi Ma, Chenjun Xiao, Hebin Liang, Jianye Hao:
Rethinking Decision Transformer via Hierarchical Reinforcement Learning. CoRR abs/2311.00267 (2023) - 2022
- [c8]Tong Sang, Hongyao Tang, Yi Ma, Jianye Hao, Yan Zheng, Zhaopeng Meng, Boyan Li, Zhen Wang:
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations. IJCAI 2022: 3416-3422 - [i6]Tong Sang, Hongyao Tang, Yi Ma, Jianye Hao, Yan Zheng, Zhaopeng Meng, Boyan Li, Zhen Wang:
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations. CoRR abs/2204.02877 (2022) - [i5]Chen Chen, Hongyao Tang, Yi Ma, Chao Wang, Qianli Shen, Dong Li, Jianye Hao:
State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning. CoRR abs/2211.15065 (2022) - 2021
- [c7]Fei Ni, Jianye Hao, Jiawen Lu, Xialiang Tong, Mingxuan Yuan, Jiahui Duan, Yi Ma, Kun He:
A Multi-Graph Attributed Reinforcement Learning based Optimization Algorithm for Large-scale Hybrid Flow Shop Scheduling Problem. KDD 2021: 3441-3451 - [c6]Yi Ma, Xiaotian Hao, Jianye Hao, Jiawen Lu, Xing Liu, Xialiang Tong, Mingxuan Yuan, Zhigang Li, Jie Tang, Zhaopeng Meng:
A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems. NeurIPS 2021: 23609-23620 - 2020
- [j1]Leilei Liu, Xianglei Zhu, Yi Ma, Haiyin Piao, Yaodong Yang, Xiaotian Hao, Yue Fu, Li Wang, Jiajie Peng:
Combining sequence and network information to enhance protein-protein interaction prediction. BMC Bioinform. 21-S(16): 537 (2020) - [c5]Hanchao Wang, Hongyao Tang, Jianye Hao, Xiaotian Hao, Yue Fu, Yi Ma:
Large Scale Deep Reinforcement Learning in War-games. BIBM 2020: 1693-1699 - [c4]Xiaotian Hao, Zhaoqing Peng, Yi Ma, Guan Wang, Junqi Jin, Jianye Hao, Shan Chen, Rongquan Bai, Mingzhou Xie, Miao Xu, Zhenzhe Zheng, Chuan Yu, Han Li, Jian Xu, Kun Gai:
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising. ICML 2020: 4060-4070 - [c3]Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng:
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge. IJCAI 2020: 2291-2297 - [c2]Xiaotian Hao, Junqi Jin, Jianye Hao, Jin Li, Weixun Wang, Yi Ma, Zhenzhe Zheng, Han Li, Jian Xu, Kun Gai:
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising. IJCAI 2020: 3437-3443 - [i4]Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng:
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge. CoRR abs/2002.07418 (2020) - [i3]Xiaotian Hao, Junqi Jin, Jianye Hao, Jin Li, Weixun Wang, Yi Ma, Zhenzhe Zheng, Han Li, Jian Xu, Kun Gai:
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising. CoRR abs/2005.04355 (2020) - [i2]Xiaotian Hao, Zhaoqing Peng, Yi Ma, Guan Wang, Junqi Jin, Jianye Hao, Shan Chen, Rongquan Bai, Mingzhou Xie, Miao Xu, Zhenzhe Zheng, Chuan Yu, Han Li, Jian Xu, Kun Gai:
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising. CoRR abs/2006.16312 (2020)
2010 – 2019
- 2019
- [c1]Leilei Liu, Yi Ma, Xianglei Zhu, Yaodong Yang, Xiaotian Hao, Li Wang, Jiajie Peng:
Integrating Sequence and Network Information to Enhance Protein-Protein Interaction Prediction Using Graph Convolutional Networks. BIBM 2019: 1762-1768 - [i1]Yi Ma, Jianye Hao, Yaodong Yang, Han Li, Junqi Jin, Guangyong Chen:
Spectral-based Graph Convolutional Network for Directed Graphs. CoRR abs/1907.08990 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-22 19:04 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint