default search action

combined dblp search
author search
venue search
publication search

ask others

Esther Derman

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GadotDKELM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GadotDKELM24
Uri Gadot, Esther Derman, Navdeep Kumar, Maxence Mohamed Elfatihi, Kfir Levy, Shie Mannor:
Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization. AAAI 2024: 21090-21098
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ValensiDMD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ValensiDMD24
David Valensi, Esther Derman, Shie Mannor, Gal Dalal:
Tree Search-Based Policy Optimization under Stochastic Execution Delay. ICLR 2024
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-05440
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-05440
David Valensi, Esther Derman, Shie Mannor, Gal Dalal:
Tree Search-Based Policy Optimization under Stochastic Execution Delay. CoRR abs/2404.05440 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-24128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-24128
Jia Lin Hau, Erick Delage, Esther Derman, Mohammad Ghavamzadeh, Marek Petrik:
Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis. CoRR abs/2410.24128 (2024)
2023
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/KumarDGLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KumarDGLM23
Navdeep Kumar, Esther Derman, Matthieu Geist, Kfir Y. Levy, Shie Mannor:
Policy Gradient for Rectangular Robust Markov Decision Processes. NeurIPS 2023
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-13589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-13589
Navdeep Kumar, Esther Derman, Matthieu Geist, Kfir Levy, Shie Mannor:
Policy Gradient for s-Rectangular Robust Markov Decision Processes. CoRR abs/2301.13589 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-06654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-06654
Esther Derman, Yevgeniy Men, Matthieu Geist, Shie Mannor:
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization. CoRR abs/2303.06654 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-01107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-01107
Uri Gadot, Esther Derman, Navdeep Kumar, Maxence Mohamed Elfatihi, Kfir Levy, Shie Mannor:
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization. CoRR abs/2309.01107 (2023)
2021
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/DermanDM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DermanDM21
Esther Derman, Gal Dalal, Shie Mannor:
Acting in Delayed Environments with Non-Stationary Markov Policies. ICLR 2021
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/DermanGM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DermanGM21
Esther Derman, Matthieu Geist, Shie Mannor:
Twice regularized MDPs and the equivalence between robustness and regularization. NeurIPS 2021: 22274-22287
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-11992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-11992
Esther Derman, Gal Dalal, Shie Mannor:
Acting in Delayed Environments with Non-Stationary Markov Policies. CoRR abs/2101.11992 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06267
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06267
Esther Derman, Matthieu Geist, Shie Mannor:
Twice regularized MDPs and the equivalence between robustness and regularization. CoRR abs/2110.06267 (2021)
2020
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-02894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-02894
Esther Derman, Shie Mannor:
Distributional Robustness and Regularization in Reinforcement Learning. CoRR abs/2003.02894 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
- export record
  dblp key:
  - conf/uai/DermanMMM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/DermanMMM19
Esther Derman, Daniel J. Mankowitz, Timothy A. Mann, Shie Mannor:
A Bayesian Approach to Robust Reinforcement Learning. UAI 2019: 648-658
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-08188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-08188
Esther Derman, Daniel J. Mankowitz, Timothy A. Mann, Shie Mannor:
A Bayesian Approach to Robust Reinforcement Learning. CoRR abs/1905.08188 (2019)
2018
[c1]
- view
- export record
  dblp key:
  - conf/uai/DermanMMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/DermanMMM18
Esther Derman, Daniel J. Mankowitz, Timothy A. Mann, Shie Mannor:
Soft-Robust Actor-Critic Policy-Gradient. UAI 2018: 208-218
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-04848
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-04848
Esther Derman, Daniel J. Mankowitz, Timothy A. Mann, Shie Mannor:
Soft-Robust Actor-Critic Policy-Gradient. CoRR abs/1803.04848 (2018)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.