default search action
BibTeX record journals/tmlr/CandoganWA0C25
@article{DBLP:journals/tmlr/CandoganWA0C25,
author = {Leyla Naz Candogan and
Yongtao Wu and
El{\'{\i}}as Abad{-}Rocamora and
Grigorios Chrysos and
Volkan Cevher},
title = {Single-pass Detection of Jailbreaking Input in Large Language Models},
journal = {Trans. Mach. Learn. Res.},
volume = {2025},
year = {2025},
url = {https://openreview.net/forum?id=42v6I5Ut9a},
timestamp = {Fri, 20 Jun 2025 14:19:48 +0200},
biburl = {https://dblp.org/rec/journals/tmlr/CandoganWA0C25.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.