default search action

combined dblp search
author search
venue search
publication search

ask others

Akifumi Wachi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tcyb/HashimotoOWS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcyb/HashimotoOWS25
Kazumune Hashimoto, Yuga Onoue, Akifumi Wachi, Xun Shen:
Learning-Based Event-Triggered MPC With Gaussian Processes Under Terminal Constraints. IEEE Trans. Cybern. 55(4): 1512-1525 (2025)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-02153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-02153
Thien Q. Tran, Akifumi Wachi, Rei Sato, Takumi Tanabe, Youhei Akimoto:
Vulnerability Mitigation for Safety-Aligned Language Models via Debiasing. CoRR abs/2502.02153 (2025)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-02311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-02311
Kensuke Tatematsu, Akifumi Wachi:
Target Return Optimizer for Multi-Game Decision Transformer. CoRR abs/2503.02311 (2025)
2024
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ar/HashimotoHWSKT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ar/HashimotoHWSKT24
Wataru Hashimoto, Kazumune Hashimoto, Akifumi Wachi, Xun Shen, Masako Kishida, Shigemasa Takai:
Data-efficient safe learning and control with on-board sensors: Bayesian meta-learning and barrier function based approach. Adv. Robotics 38(21): 1501-1514 (2024)
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WachiHH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WachiHH24
Akifumi Wachi, Wataru Hashimoto, Kazumune Hashimoto:
Long-Term Safe Reinforcement Learning with Binary Feedback. AAAI 2024: 21656-21663
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/amcc/ShenWHHT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/amcc/ShenWHHT24
Xun Shen, Akifumi Wachi, Wataru Hashimoto, Kazumune Hashimoto, Shigemasa Takai:
Safe Reinforcement Learning Using Model Predictive Control with Probabilistic Control Barrier Function. ACC 2024: 74-79
[c13]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/WachiSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WachiSS24
Akifumi Wachi, Xun Shen, Yanan Sui:
A Survey of Constraint Formulations in Safe Reinforcement Learning. IJCAI 2024: 8262-8271
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ShenJWHG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ShenJWHG24
Xun Shen, Shuo Jiang, Akifumi Wachi, Kazumune Hashimoto, Sebastien Gros:
Flipping-based Policy for Chance-Constrained Markov Decision Processes. NeurIPS 2024
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WachiTSTA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WachiTSTA24
Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto:
Stepwise Alignment for Constrained Language Model Policy Optimization. NeurIPS 2024
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-03786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-03786
Akifumi Wachi, Wataru Hashimoto, Kazumune Hashimoto:
Long-term Safe Reinforcement Learning with Binary Feedback. CoRR abs/2401.03786 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02025
Akifumi Wachi, Xun Shen, Yanan Sui:
A Survey of Constraint Formulations in Safe Reinforcement Learning. CoRR abs/2402.02025 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-11049
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-11049
Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Yohei Akimoto:
Stepwise Alignment for Constrained Language Model Policy Optimization. CoRR abs/2404.11049 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-06474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-06474
Xun Shen, Shuo Jiang, Akifumi Wachi, Kazumune Hashimoto, Sebastien Gros:
Flipping-based Policy for Chance-Constrained Markov Decision Processes. CoRR abs/2410.06474 (2024)
2023
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WachiHSH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WachiHSH23
Akifumi Wachi, Wataru Hashimoto, Xun Shen, Kazumune Hashimoto:
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms. NeurIPS 2023
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05306
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05306
Wataru Hashimoto, Kazumune Hashimoto, Akifumi Wachi, Xun Shen, Masako Kishida, Shigemasa Takai:
Bayesian Meta-Learning on Control Barrier Functions with Data from On-Board Sensors. CoRR abs/2308.05306 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-03225
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-03225
Akifumi Wachi, Wataru Hashimoto, Xun Shen, Kazumune Hashimoto:
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms. CoRR abs/2310.03225 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10076
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10076
Keita Saito, Akifumi Wachi, Koki Wataoka, Youhei Akimoto:
Verbosity Bias in Preference Labeling by Large Language Models. CoRR abs/2310.10076 (2023)
2021
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KimuraCOTAMWKG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KimuraCOTAMWKG21
Daiki Kimura, Subhajit Chaudhury, Masaki Ono, Michiaki Tatsubori, Don Joven Agravante, Asim Munawar, Akifumi Wachi, Ryosuke Kohita, Alexander Gray:
LOA: Logical Optimal Actions for Text-based Interaction Games. ACL (demo) 2021: 227-231
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KohitaWKCTM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KohitaWKCTM21
Ryosuke Kohita, Akifumi Wachi, Daiki Kimura, Subhajit Chaudhury, Michiaki Tatsubori, Asim Munawar:
Language-based General Action Template for Reinforcement Learning Agents. ACL/IJCNLP (Findings) 2021: 2125-2139
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/conll/IwamotoKW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/conll/IwamotoKW21
Ran Iwamoto, Ryosuke Kohita, Akifumi Wachi:
Polar Embedding. CoNLL 2021: 470-480
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/KimuraOCKWATMG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KimuraOCKWATMG21
Daiki Kimura, Masaki Ono, Subhajit Chaudhury, Ryosuke Kohita, Akifumi Wachi, Don Joven Agravante, Michiaki Tatsubori, Asim Munawar, Alexander Gray:
Neuro-Symbolic Reinforcement Learning with First-Order Logic. EMNLP (1) 2021: 3505-3511
[c5]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WachiWS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WachiWS21
Akifumi Wachi, Yunyue Wei, Yanan Sui:
Safe Policy Optimization with Local Generalized Linear Function Approximations. NeurIPS 2021: 20759-20771
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-02363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-02363
Daiki Kimura, Subhajit Chaudhury, Akifumi Wachi, Ryosuke Kohita, Asim Munawar, Michiaki Tatsubori, Alexander Gray:
Reinforcement Learning with External Knowledge by using Logical Neural Networks. CoRR abs/2103.02363 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10963
Daiki Kimura, Masaki Ono, Subhajit Chaudhury, Ryosuke Kohita, Akifumi Wachi, Don Joven Agravante, Michiaki Tatsubori, Asim Munawar, Alexander Gray:
Neuro-Symbolic Reinforcement Learning with First-Order Logic. CoRR abs/2110.10963 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10973
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10973
Daiki Kimura, Subhajit Chaudhury, Masaki Ono, Michiaki Tatsubori, Don Joven Agravante, Asim Munawar, Akifumi Wachi, Ryosuke Kohita, Alexander Gray:
LOA: Logical Optimal Actions for Text-based Interaction Games. CoRR abs/2110.10973 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-04894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-04894
Akifumi Wachi, Yunyue Wei, Yanan Sui:
Safe Policy Optimization with Local Generalized Linear Function Approximations. CoRR abs/2111.04894 (2021)
2020
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/KohitaWZT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KohitaWZT20
Ryosuke Kohita, Akifumi Wachi, Yang Zhao, Ryuki Tachibana:
Q-learning with Language Model for Edit-based Unsupervised Summarization. EMNLP (1) 2020: 470-484
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WachiS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WachiS20
Akifumi Wachi, Yanan Sui:
Safe Reinforcement Learning in Constrained Markov Decision Processes. ICML 2020: 9797-9806
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-06626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-06626
Akifumi Wachi, Yanan Sui:
Safe Reinforcement Learning in Constrained Markov Decision Processes. CoRR abs/2008.06626 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04379
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04379
Ryosuke Kohita, Akifumi Wachi, Yang Zhao, Ryuki Tachibana:
Q-learning with Language Model for Edit-based Unsupervised Summarization. CoRR abs/2010.04379 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Wachi19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Wachi19
Akifumi Wachi:
Failure-Scenario Maker for Rule-Based Agent using Multi-agent Adversarial Reinforcement Learning and its Application to Autonomous Driving. IJCAI 2019: 6006-6012
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-10654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-10654
Akifumi Wachi:
Failure-Scenario Maker for Rule-Based Agent using Multi-agent Adversarial Reinforcement Learning and its Application to Autonomous Driving. CoRR abs/1903.10654 (2019)
2018
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WachiSYO18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WachiSYO18
Akifumi Wachi, Yanan Sui, Yisong Yue, Masahiro Ono:
Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes. AAAI 2018: 6548-6556
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-04232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-04232
Akifumi Wachi, Hiroshi Kajino, Asim Munawar:
Safe Exploration in Markov Decision Processes with Time-Variant Safety using Spatio-Temporal Gaussian Process. CoRR abs/1809.04232 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.