default search action
Hao Hu 0006
Person information
- affiliation: Tsinghua University, Beijing, China
Other persons with the same name
- Hao Hu — disambiguation page
- Hao Hu 0001 — Nanjing University, State Key Lab for Novel Software Technology, China
- Hao Hu 0002 — Huazhong University of Science and Technology, School of Electronic Information and Communications, Wuhan, China
- Hao Hu 0003 — Shanghai Jiao Tong University, Department of Transportation, Shipping and Logistics, China
- Hao Hu 0004 — University of Macau, State Key Laboratory of Quality Research in Chinese Medicine, Taipa, Macao
- Hao Hu 0005 — Zhengzhou Information Science Technology Institute, China
- Hao Hu 0007 — China Meteorological Administration, Beijing, China (and 2 more)
- Hao Hu 0008 — Institute of Software, Chinese Academy of Sciences, China (and 1 more)
- Hao Hu 0009 — Technical University of Denmark, DTU Fotonik, Lyngby, DK (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c12]Yihuan Mao, Chengjie Wu, Xi Chen, Hao Hu, Ji Jiang, Tianze Zhou, Tangjie Lv, Changjie Fan, Zhipeng Hu, Yi Wu, Yujing Hu, Chongjie Zhang:
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets. ICLR 2024 - [c11]Hao Hu, Yiqin Yang, Jianing Ye, Chengjie Wu, Ziqing Mai, Yujing Hu, Tangjie Lv, Changjie Fan, Qianchuan Zhao, Chongjie Zhang:
Bayesian Design Principles for Offline-to-Online Reinforcement Learning. ICML 2024 - [c10]Chengjie Wu, Hao Hu, Yiqin Yang, Ning Zhang, Chongjie Zhang:
Planning, Fast and Slow: Online Reinforcement Learning with Action-Free Offline Data via Multiscale Planners. ICML 2024 - [i10]Hao Hu, Yiqin Yang, Jianing Ye, Chengjie Wu, Ziqing Mai, Yujing Hu, Tangjie Lv, Changjie Fan, Qianchuan Zhao, Chongjie Zhang:
Bayesian Design Principles for Offline-to-Online Reinforcement Learning. CoRR abs/2405.20984 (2024) - 2023
- [c9]Yiqin Yang, Hao Hu, Wenzhe Li, Siyuan Li, Jun Yang, Qianchuan Zhao, Chongjie Zhang:
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery. AAAI 2023: 10843-10851 - [c8]Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang:
The Provable Benefit of Unsupervised Data Sharing for Offline Reinforcement Learning. ICLR 2023 - [c7]Rui Yang, Lin Yong, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang:
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? ICML 2023: 39543-39571 - [c6]Hao Hu, Yiqin Yang, Jianing Ye, Ziqing Mai, Chongjie Zhang:
Unsupervised Behavior Extraction via Random Intent Priors. NeurIPS 2023 - [i9]Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang:
The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning. CoRR abs/2302.13493 (2023) - [i8]Rui Yang, Yong Lin, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang:
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? CoRR abs/2305.18882 (2023) - [i7]Hao Hu, Yiqin Yang, Jianing Ye, Ziqing Mai, Chongjie Zhang:
Unsupervised Behavior Extraction via Random Intent Priors. CoRR abs/2310.18687 (2023) - 2022
- [c5]Xiaoteng Ma, Yiqin Yang, Hao Hu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang, Qihan Liu:
Offline Reinforcement Learning with Value-based Episodic Memory. ICLR 2022 - [c4]Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang:
On the Role of Discount Factor in Offline Reinforcement Learning. ICML 2022: 9072-9098 - [i6]Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang:
On the Role of Discount Factor in Offline Reinforcement Learning. CoRR abs/2206.03383 (2022) - [i5]Yiqin Yang, Hao Hu, Wenzhe Li, Siyuan Li, Jun Yang, Qianchuan Zhao, Chongjie Zhang:
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery. CoRR abs/2212.01105 (2022) - 2021
- [c3]Hao Hu, Jianing Ye, Guangxiang Zhu, Zhizhou Ren, Chongjie Zhang:
Generalizable Episodic Memory for Deep Reinforcement Learning. ICML 2021: 4380-4390 - [c2]Jin Zhang, Jianhao Wang, Hao Hu, Tong Chen, Yingfeng Chen, Changjie Fan, Chongjie Zhang:
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration. ICML 2021: 12600-12610 - [c1]Zhizhou Ren, Guangxiang Zhu, Hao Hu, Beining Han, Jianglun Chen, Chongjie Zhang:
On the Estimation Bias in Double Q-Learning. NeurIPS 2021: 10246-10259 - [i4]Hao Hu, Jianing Ye, Zhizhou Ren, Guangxiang Zhu, Chongjie Zhang:
Generalizable Episodic Memory for Deep Reinforcement Learning. CoRR abs/2103.06469 (2021) - [i3]Zhizhou Ren, Guangxiang Zhu, Hao Hu, Beining Han, Jianglun Chen, Chongjie Zhang:
On the Estimation Bias in Double Q-Learning. CoRR abs/2109.14419 (2021) - [i2]Xiaoteng Ma, Yiqin Yang, Hao Hu, Qihan Liu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang:
Offline Reinforcement Learning with Value-based Episodic Memory. CoRR abs/2110.09796 (2021) - 2020
- [i1]Jin Zhang, Jianhao Wang, Hao Hu, Yingfeng Chen, Changjie Fan, Chongjie Zhang:
Learn to Effectively Explore in Context-Based Meta-RL. CoRR abs/2006.08170 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-25 21:15 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint