default search action

combined dblp search
author search
venue search
publication search

ask others

Baihe Huang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c10]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/emnlp/HuangSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/HuangSM24
Baihe Huang, Hiteshi Sharma, Yi Mao:
Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing. EMNLP 2024: 21341-21352
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ZhuH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhuH024
Hanlin Zhu, Baihe Huang, Stuart Russell:
On Representation Complexity of Model-based and Model-free Reinforcement Learning. ICLR 2024
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-13893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-13893
Charles Lu, Baihe Huang, Sai Praneeth Karimireddy, Praneeth Vepakomma, Michael I. Jordan, Ramesh Raskar:
Data Acquisition via Experimental Design for Decentralized Data Markets. CoRR abs/2403.13893 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-04669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-04669
Hanlin Zhu, Baihe Huang, Shaolun Zhang, Michael I. Jordan, Jiantao Jiao, Yuandong Tian, Stuart Russell:
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics. CoRR abs/2405.04669 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19617
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19617
Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee:
Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity. CoRR abs/2406.19617 (2024)
2023
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/siamjo/ZhanCHCLC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamjo/ZhanCHCLC23
Wenhao Zhan, Shicong Cen, Baihe Huang, Yuxin Chen, Jason D. Lee, Yuejie Chi:
Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence. SIAM J. Optim. 33(2): 1061-1091 (2023)
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/YuWHLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/YuWHLL23
Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee:
Optimal Sample Complexity Bounds for Non-convex Optimization under Kurdyka-Lojasiewicz Condition. AISTATS 2023: 6806-6821
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YuWHLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YuWHLL23
Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee:
Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms. NeurIPS 2023
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-05592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-05592
Baihe Huang, Sai Praneeth Karimireddy, Michael I. Jordan:
Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning. CoRR abs/2306.05592 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-12383
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-12383
Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee:
Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms. CoRR abs/2306.12383 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-01706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-01706
Hanlin Zhu, Baihe Huang, Stuart Russell:
On Representation Complexity of Model-based and Model-free Reinforcement Learning. CoRR abs/2310.01706 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-07930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-07930
Baihe Huang, Banghua Zhu, Hanlin Zhu, Jason D. Lee, Jiantao Jiao, Michael I. Jordan:
Towards Optimal Statistical Watermarking. CoRR abs/2312.07930 (2023)
2022
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/ZhanHHJL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/ZhanHHJL22
Wenhao Zhan, Baihe Huang, Audrey Huang, Nan Jiang, Jason D. Lee:
Offline Reinforcement Learning with Realizability and Single-policy Concentrability. COLT 2022: 2730-2775
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/focs/HuangJ0T022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/focs/HuangJ0T022
Baihe Huang, Shunhua Jiang, Zhao Song, Runzhou Tao, Ruizhe Zhang:
Solving SDP Faster: A Robust IPM Framework and Efficient Implementation. FOCS 2022: 233-244
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/HuangLWY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HuangLWY22
Baihe Huang, Jason D. Lee, Zhaoran Wang, Zhuoran Yang:
Towards General Function Approximation in Zero-Sum Markov Games. ICLR 2022
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-04634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-04634
Wenhao Zhan, Baihe Huang, Audrey Huang, Nan Jiang, Jason D. Lee:
Offline Reinforcement Learning with Realizability and Single-policy Concentrability. CoRR abs/2202.04634 (2022)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-12329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-12329
Baihe Huang, Zhao Song, Omri Weinstein, Hengjie Zhang, Ruizhe Zhang:
A Dynamic Fast Gaussian Transform. CoRR abs/2202.12329 (2022)
2021
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/HuangL0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangL0021
Baihe Huang, Xiaoxiao Li, Zhao Song, Xin Yang:
FL-NTK: A Neural Tangent Kernel-based Framework for Federated Learning Analysis. ICML 2021: 4423-4434
[c2]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/HuangHKLLWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HuangHKLLWY21
Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang:
Going Beyond Linear RL: Sample Efficient Neural Function Approximation. NeurIPS 2021: 8968-8983
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/HuangHKLLWY21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HuangHKLLWY21a
Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang:
Optimal Gradient-based Algorithms for Non-concave Bandit Optimization. NeurIPS 2021: 29101-29115
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-08208
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-08208
Baihe Huang, Shunhua Jiang, Zhao Song, Runzhou Tao:
Solving Tall Dense SDPs in the Current Matrix Multiplication Time. CoRR abs/2101.08208 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-05001
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-05001
Baihe Huang, Xiaoxiao Li, Zhao Song, Xin Yang:
FL-NTK: A Neural Tangent Kernel-based Framework for Federated Learning Convergence Analysis. CoRR abs/2105.05001 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-11066
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-11066
Wenhao Zhan, Shicong Cen, Baihe Huang, Yuxin Chen, Jason D. Lee, Yuejie Chi:
Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence. CoRR abs/2105.11066 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-04518
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-04518
Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang:
Optimal Gradient-based Algorithms for Non-concave Bandit Optimization. CoRR abs/2107.04518 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-06466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-06466
Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang:
Going Beyond Linear RL: Sample Efficient Neural Function Approximation. CoRR abs/2107.06466 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-14702
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-14702
Baihe Huang, Jason D. Lee, Zhaoran Wang, Zhuoran Yang:
Towards General Function Approximation in Zero-Sum Markov Games. CoRR abs/2107.14702 (2021)
2020
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-11877
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-11877
Baihe Huang, Zhao Song, Runzhou Tao, Ruizhe Zhang, Danyang Zhuo:
InstaHide's Sample Complexity When Mixing Two Private Images. CoRR abs/2011.11877 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.