default search action
Weichao Mao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j4]Weichao Mao
, Kaiqing Zhang
, Ruihao Zhu
, David Simchi-Levi
, Tamer Basar
:
Model-Free Nonstationary Reinforcement Learning: Near-Optimal Regret and Applications in Multiagent Reinforcement Learning and Inventory Control. Manag. Sci. 71(2): 1564-1580 (2025) - [i11]Zhihui Xie, Jie Chen, Liyu Chen, Weichao Mao, Jingjing Xu, Lingpeng Kong:
Teaching Language Models to Critique via Reinforcement Learning. CoRR abs/2502.03492 (2025) - 2024
- [j3]Zhenzhe Zheng
, Weichao Mao
, Yidan Xing
, Fan Wu
:
On Designing Market Model and Pricing Mechanisms for IoT Data Exchange. IEEE Trans. Mob. Comput. 23(11): 10202-10218 (2024) - [c20]Haoran Qiu, Weichao Mao, Chen Wang, Saurabh Jha, Hubertus Franke, Chandra Narayanaswami, Zbigniew Kalbarczyk, Tamer Basar, Ravishankar K. Iyer:
When Green Computing Meets Performance and Resilience SLOs. DSN-S 2024: 17-22 - [c19]Xiangyuan Zhang, Weichao Mao, Saviz Mowlavi, Mouhacine Benosman, Tamer Basar:
Controlgym: Large-scale control environments for benchmarking reinforcement learning algorithms. L4DC 2024: 181-196 - [c18]Weichao Mao, Haoran Qiu, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Tamer Basar:
$\widetilde{O}(T^{-1})$ {C}onvergence to (coarse) correlated equilibria in full-information general-sum markov games. L4DC 2024: 361-374 - [c17]Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Tamer Basar, Ravi K. Iyer:
FLASH: Fast Model Adaptation in ML-Centric Cloud Platforms. MLSys 2024 - [c16]Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Tamer Basar, Ravishankar K. Iyer:
Power-aware Deep Learning Model Serving with μ-Serve. USENIX ATC 2024: 75-93 - [i10]Weichao Mao, Haoran Qiu, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Tamer Basar:
Õ(T-1) Convergence to (Coarse) Correlated Equilibria in Full-Information General-Sum Markov Games. CoRR abs/2403.07890 (2024) - [i9]Xiangyuan Zhang, Weichao Mao, Haoran Qiu, Tamer Basar:
Decision Transformer as a Foundation Model for Partially Observable Continuous Control. CoRR abs/2404.02407 (2024) - [i8]Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew T. Kalbarczyk, Tamer Basar, Ravishankar K. Iyer:
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction. CoRR abs/2404.08509 (2024) - 2023
- [j2]Weichao Mao
, Tamer Basar:
Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games. Dyn. Games Appl. 13(1): 165-186 (2023) - [c15]Weichao Mao, Haoran Qiu, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Tamer Basar:
Multi-Agent Meta-Reinforcement Learning: Sharper Convergence Rates with Task Similarity. NeurIPS 2023 - [c14]Haoran Qiu, Weichao Mao, Chen Wang, Hubertus Franke, Alaa Youssef, Zbigniew T. Kalbarczyk, Tamer Basar, Ravishankar K. Iyer:
AWARE: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud Systems. USENIX ATC 2023: 387-402 - [i7]Weichao Mao, Ruta Desai, Michael Louis Iuzzolino, Nitin Kamra:
Action Dynamics Task Graphs for Learning Plannable Representations of Procedural Tasks. CoRR abs/2302.05330 (2023) - [i6]Xiangyuan Zhang, Weichao Mao, Saviz Mowlavi, Mouhacine Benosman, Tamer Basar:
Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms. CoRR abs/2311.18736 (2023) - 2022
- [c13]Haoran Qiu
, Weichao Mao, Archit Patke, Chen Wang
, Hubertus Franke, Zbigniew T. Kalbarczyk, Tamer Basar, Ravishankar K. Iyer:
SIMPPO: a scalable and incremental online learning framework for serverless resource management. SoCC 2022: 306-322 - [c12]Haoran Qiu
, Weichao Mao, Archit Patke, Chen Wang
, Hubertus Franke, Zbigniew T. Kalbarczyk, Tamer Basar, Ravishankar K. Iyer:
Reinforcement learning for resource management in multi-tenant serverless platforms. EuroMLSys@EuroSys 2022: 20-28 - [c11]Weichao Mao, Lin Yang
, Kaiqing Zhang, Tamer Basar:
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning. ICML 2022: 15007-15049 - [c10]Weichao Mao, Haoran Qiu, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Tamer Basar:
A Mean-Field Game Approach to Cloud Resource Management with Function Approximation. NeurIPS 2022 - 2021
- [c9]Sujay Bhatt, Weichao Mao, Alec Koppel, Tamer Basar:
Semiparametric Information State Embedding for Policy Search under Imperfect Information. CDC 2021: 4501-4506 - [c8]Weichao Mao, Kaiqing Zhang, Ruihao Zhu, David Simchi-Levi, Tamer Basar:
Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs. ICML 2021: 7447-7458 - [i5]Weichao Mao, Tamer Basar:
Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games. CoRR abs/2110.05682 (2021) - [i4]Weichao Mao, Tamer Basar, Lin F. Yang, Kaiqing Zhang:
Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration. CoRR abs/2110.05707 (2021) - 2020
- [c7]Weichao Mao, Kaiqing Zhang, Erik Miehling, Tamer Basar:
Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning. CDC 2020: 6124-6131 - [c6]Weichao Mao, Kaiqing Zhang, Qiaomin Xie, Tamer Basar:
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis. NeurIPS 2020 - [i3]Weichao Mao, Kaiqing Zhang, Erik Miehling, Tamer Basar:
Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2004.01098 (2020) - [i2]Weichao Mao, Kaiqing Zhang, Qiaomin Xie, Tamer Basar:
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis. CoRR abs/2006.04672 (2020) - [i1]Weichao Mao, Kaiqing Zhang, Ruihao Zhu, David Simchi-Levi, Tamer Basar:
Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs. CoRR abs/2010.03161 (2020)
2010 – 2019
- 2019
- [j1]Shiyou Qian
, Jian Cao, Weichao Mao, Yanmin Zhu
, Jiadi Yu, Minglu Li, Jie Wang:
A fast and anti-matchability matching algorithm for content-based publish/subscribe systems. Comput. Networks 149: 213-225 (2019) - [c5]Zhenzhe Zheng, Weichao Mao, Fan Wu, Guihai Chen
:
Challenges and Opportunities in IoT Data Markets. SocialSens@CPSIoTWeek 2019: 1-2 - [c4]Weichao Mao, Zhenzhe Zheng, Fan Wu:
Pricing for Revenue Maximization in IoT Data Markets: An Information Design Perspective. INFOCOM 2019: 1837-1845 - [c3]Shiyou Qian, Weichao Mao, Jian Cao, Frederic Le Mouel, Minglu Li:
Adjusting Matching Algorithm to Adapt to Workload Fluctuations in Content-based Publish/Subscribe Systems. INFOCOM 2019: 1936-1944 - 2018
- [c2]Weichao Mao, Zhenzhe Zheng, Fan Wu, Guihai Chen
:
Online Pricing for Revenue Maximization with Unknown Time Discounting Valuations. IJCAI 2018: 440-446 - [c1]Weichao Mao, Jian Cao, Guangtao Xue, Jiadi Yu, Yanmin Zhu, Minglu Li, Wenjuan Li, Shiyou Qian:
Adjusting Matching Algorithm to Adapt to Dynamic Subscriptions in Content-Based Publish/Subscribe Systems. ISPA/IUCC/BDCloud/SocialCom/SustainCom 2018: 369-376
Coauthor Index
aka: Zbigniew Kalbarczyk
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-04-09 21:22 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint