default search action
Akifumi Wachi
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c13]Akifumi Wachi, Wataru Hashimoto, Kazumune Hashimoto:
Long-Term Safe Reinforcement Learning with Binary Feedback. AAAI 2024: 21656-21663 - [c12]Xun Shen, Akifumi Wachi, Wataru Hashimoto, Kazumune Hashimoto, Shigemasa Takai:
Safe Reinforcement Learning Using Model Predictive Control with Probabilistic Control Barrier Function. ACC 2024: 74-79 - [c11]Akifumi Wachi, Xun Shen, Yanan Sui:
A Survey of Constraint Formulations in Safe Reinforcement Learning. IJCAI 2024: 8262-8271 - [i14]Akifumi Wachi, Wataru Hashimoto, Kazumune Hashimoto:
Long-term Safe Reinforcement Learning with Binary Feedback. CoRR abs/2401.03786 (2024) - [i13]Akifumi Wachi, Xun Shen, Yanan Sui:
A Survey of Constraint Formulations in Safe Reinforcement Learning. CoRR abs/2402.02025 (2024) - [i12]Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Yohei Akimoto:
Stepwise Alignment for Constrained Language Model Policy Optimization. CoRR abs/2404.11049 (2024) - 2023
- [c10]Akifumi Wachi, Wataru Hashimoto, Xun Shen, Kazumune Hashimoto:
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms. NeurIPS 2023 - [i11]Wataru Hashimoto, Kazumune Hashimoto, Akifumi Wachi, Xun Shen, Masako Kishida, Shigemasa Takai:
Bayesian Meta-Learning on Control Barrier Functions with Data from On-Board Sensors. CoRR abs/2308.05306 (2023) - [i10]Akifumi Wachi, Wataru Hashimoto, Xun Shen, Kazumune Hashimoto:
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms. CoRR abs/2310.03225 (2023) - [i9]Keita Saito, Akifumi Wachi, Koki Wataoka, Youhei Akimoto:
Verbosity Bias in Preference Labeling by Large Language Models. CoRR abs/2310.10076 (2023) - 2021
- [c9]Daiki Kimura, Subhajit Chaudhury, Masaki Ono, Michiaki Tatsubori, Don Joven Agravante, Asim Munawar, Akifumi Wachi, Ryosuke Kohita, Alexander Gray:
LOA: Logical Optimal Actions for Text-based Interaction Games. ACL (demo) 2021: 227-231 - [c8]Ryosuke Kohita, Akifumi Wachi, Daiki Kimura, Subhajit Chaudhury, Michiaki Tatsubori, Asim Munawar:
Language-based General Action Template for Reinforcement Learning Agents. ACL/IJCNLP (Findings) 2021: 2125-2139 - [c7]Ran Iwamoto, Ryosuke Kohita, Akifumi Wachi:
Polar Embedding. CoNLL 2021: 470-480 - [c6]Daiki Kimura, Masaki Ono, Subhajit Chaudhury, Ryosuke Kohita, Akifumi Wachi, Don Joven Agravante, Michiaki Tatsubori, Asim Munawar, Alexander Gray:
Neuro-Symbolic Reinforcement Learning with First-Order Logic. EMNLP (1) 2021: 3505-3511 - [c5]Akifumi Wachi, Yunyue Wei, Yanan Sui:
Safe Policy Optimization with Local Generalized Linear Function Approximations. NeurIPS 2021: 20759-20771 - [i8]Daiki Kimura, Subhajit Chaudhury, Akifumi Wachi, Ryosuke Kohita, Asim Munawar, Michiaki Tatsubori, Alexander Gray:
Reinforcement Learning with External Knowledge by using Logical Neural Networks. CoRR abs/2103.02363 (2021) - [i7]Daiki Kimura, Masaki Ono, Subhajit Chaudhury, Ryosuke Kohita, Akifumi Wachi, Don Joven Agravante, Michiaki Tatsubori, Asim Munawar, Alexander Gray:
Neuro-Symbolic Reinforcement Learning with First-Order Logic. CoRR abs/2110.10963 (2021) - [i6]Daiki Kimura, Subhajit Chaudhury, Masaki Ono, Michiaki Tatsubori, Don Joven Agravante, Asim Munawar, Akifumi Wachi, Ryosuke Kohita, Alexander Gray:
LOA: Logical Optimal Actions for Text-based Interaction Games. CoRR abs/2110.10973 (2021) - [i5]Akifumi Wachi, Yunyue Wei, Yanan Sui:
Safe Policy Optimization with Local Generalized Linear Function Approximations. CoRR abs/2111.04894 (2021) - 2020
- [c4]Ryosuke Kohita, Akifumi Wachi, Yang Zhao, Ryuki Tachibana:
Q-learning with Language Model for Edit-based Unsupervised Summarization. EMNLP (1) 2020: 470-484 - [c3]Akifumi Wachi, Yanan Sui:
Safe Reinforcement Learning in Constrained Markov Decision Processes. ICML 2020: 9797-9806 - [i4]Akifumi Wachi, Yanan Sui:
Safe Reinforcement Learning in Constrained Markov Decision Processes. CoRR abs/2008.06626 (2020) - [i3]Ryosuke Kohita, Akifumi Wachi, Yang Zhao, Ryuki Tachibana:
Q-learning with Language Model for Edit-based Unsupervised Summarization. CoRR abs/2010.04379 (2020)
2010 – 2019
- 2019
- [c2]Akifumi Wachi:
Failure-Scenario Maker for Rule-Based Agent using Multi-agent Adversarial Reinforcement Learning and its Application to Autonomous Driving. IJCAI 2019: 6006-6012 - [i2]Akifumi Wachi:
Failure-Scenario Maker for Rule-Based Agent using Multi-agent Adversarial Reinforcement Learning and its Application to Autonomous Driving. CoRR abs/1903.10654 (2019) - 2018
- [c1]Akifumi Wachi, Yanan Sui, Yisong Yue, Masahiro Ono:
Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes. AAAI 2018: 6548-6556 - [i1]Akifumi Wachi, Hiroshi Kajino, Asim Munawar:
Safe Exploration in Markov Decision Processes with Time-Variant Safety using Spatio-Temporal Gaussian Process. CoRR abs/1809.04232 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 20:30 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint