default search action
Chengdong Ma
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i10]Yifan Zhong, Chengdong Ma, Xiaoyuan Zhang, Ziran Yang, Qingfu Zhang, Siyuan Qi, Yaodong Yang:
Panacea: Pareto Alignment via Preference Adaptation for LLMs. CoRR abs/2402.02030 (2024) - [i9]Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang:
Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects. CoRR abs/2402.12907 (2024) - [i8]Jiesong Lian, Yucong Huang, Mingzhi Wang, Chengdong Ma, Yixue Hao, Ying Wen, Yaodong Yang:
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles. CoRR abs/2405.21027 (2024) - [i7]Ruize Zhang, Zelai Xu, Chengdong Ma, Chao Yu, Wei-Wei Tu, Shiyu Huang, Deheng Ye, Wenbo Ding, Yaodong Yang, Yu Wang:
A Survey on Self-play Methods in Reinforcement Learning. CoRR abs/2408.01072 (2024) - [i6]Wenzhe Fan, Zishun Yu, Chengdong Ma, Changye Li, Yaodong Yang, Xinhua Zhang:
Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning. CoRR abs/2410.15841 (2024) - [i5]Mingzhi Wang, Chengdong Ma, Qizhi Chen, Linjian Meng, Yang Han, Jiancong Xiao, Zhaowei Zhang, Jing Huo, Weijie J. Su, Yaodong Yang:
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment. CoRR abs/2410.16714 (2024) - [i4]Yucong Huang, Jiesong Lian, Mingzhi Wang, Chengdong Ma, Ying Wen:
Conflux-PSRO: Effectively Leveraging Collective Advantages in Policy Space Response Oracles. CoRR abs/2410.22776 (2024) - [i3]Xiaohang Tang, Chiyuan Wang, Chengdong Ma, Ilija Bogunovic, Stephen McAleer, Yaodong Yang:
Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games. CoRR abs/2411.00954 (2024) - 2023
- [j1]Chengdong Ma, Jianan Liu, Saichao He, Wenjing Hong, Jia Shi:
Confrontation and Obstacle-Avoidance of Unmanned Vehicles Based on Progressive Reinforcement Learning. IEEE Access 11: 50398-50411 (2023) - [i2]Chengdong Ma, Ziran Yang, Minquan Gao, Hai Ci, Jun Gao, Xuehai Pan, Yaodong Yang:
Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models. CoRR abs/2310.00322 (2023) - 2022
- [c2]Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang:
Scalable Model-based Policy Optimization for Decentralized Networked Systems. IROS 2022: 9019-9026 - [i1]Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang:
Fully Decentralized Model-based Policy Optimization for Networked Systems. CoRR abs/2207.06559 (2022)
2010 – 2019
- 2018
- [c1]Heshuai Shao, Ronglin Hu, Chengdong Ma:
Design of a Low-Power Cold Chain Logistics Internet of Things System. EIDWT 2018: 449-461
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-13 19:14 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint