default search action

combined dblp search
author search
venue search
publication search

ask others

Jacob Steinhardt

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/DunlapZWZDSGY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/DunlapZWZDSGY24
Lisa Dunlap, Yuhui Zhang, Xiaohan Wang, Ruiqi Zhong, Trevor Darrell, Jacob Steinhardt, Joseph E. Gonzalez, Serena Yeung-Levy:
Describing Differences in Image Sets with Natural Language. CVPR 2024: 24199-24208
[c62]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FengS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FengS24
Jiahai Feng, Jacob Steinhardt:
How do Language Models Bind Entities in Context? ICLR 2024
[c61]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GandelsmanES24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GandelsmanES24
Yossi Gandelsman, Alexei A. Efros, Jacob Steinhardt:
Interpreting CLIP's Image Representation via Text-Based Decomposition. ICLR 2024
[c60]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HalawiDS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HalawiDS24
Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt:
Overthinking the Truth: Understanding how Language Models Process False Demonstrations. ICLR 2024
[c59]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChenZRZ0SYM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenZRZ0SYM24
Yanda Chen, Ruiqi Zhong, Narutatsu Ri, Chen Zhao, He He, Jacob Steinhardt, Zhou Yu, Kathleen R. McKeown:
Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations. ICML 2024
[c58]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Halawi0WWHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Halawi0WWHS24
Danny Halawi, Alexander Wei, Eric Wallace, Tony Tong Wang, Nika Haghtalab, Jacob Steinhardt:
Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation. ICML 2024
[c57]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/PanJJS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PanJJS24
Alexander Pan, Erik Jones, Meena Jagadeesan, Jacob Steinhardt:
Feedback Loops With Language Models Drive In-Context Reward Hacking. ICML 2024
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-06627
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-06627
Alexander Pan, Erik Jones, Meena Jagadeesan, Jacob Steinhardt:
Feedback Loops With Language Models Drive In-Context Reward Hacking. CoRR abs/2402.06627 (2024)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-18563
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-18563
Danny Halawi, Fred Zhang, Chen Yueh-Han, Jacob Steinhardt:
Approaching Human-Level Forecasting with Language Models. CoRR abs/2402.18563 (2024)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04341
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04341
Yossi Gandelsman, Alexei A. Efros, Jacob Steinhardt:
Interpreting the Second-Order Effects of Neurons in CLIP. CoRR abs/2406.04341 (2024)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-14595
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-14595
Erik Jones, Anca D. Dragan, Jacob Steinhardt:
Adversaries Can Misuse Combinations of Safe Models. CoRR abs/2406.14595 (2024)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19501
Jiahai Feng, Stuart Russell, Jacob Steinhardt:
Monitoring Latent World States in Language Models with Propositional Probes. CoRR abs/2406.19501 (2024)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-20053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-20053
Danny Halawi, Alexander Wei, Eric Wallace, Tony T. Wang, Nika Haghtalab, Jacob Steinhardt:
Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation. CoRR abs/2406.20053 (2024)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-03734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-03734
Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt:
Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry. CoRR abs/2409.03734 (2024)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08466
Ruiqi Zhong, Heng Wang, Dan Klein, Jacob Steinhardt:
Explaining Datasets in Words: Statistical Models with Natural Language Parameters. CoRR abs/2409.08466 (2024)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-12822
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-12822
Jiaxin Wen, Ruiqi Zhong, Akbir Khan, Ethan Perez, Jacob Steinhardt, Minlie Huang, Samuel R. Bowman, He He, Shi Feng:
Language Models Learn to Mislead Humans via RLHF. CoRR abs/2409.12822 (2024)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-12851
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-12851
Lisa Dunlap, Krishna Mandal, Trevor Darrell, Jacob Steinhardt, Joseph E. Gonzalez:
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models. CoRR abs/2410.12851 (2024)
2023
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/jacm/JagadeesanWWJS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jacm/JagadeesanWWJS23
Meena Jagadeesan, Alexander Wei, Yixin Wang, Michael I. Jordan, Jacob Steinhardt:
Learning Equilibria in Matching Markets with Bandit Feedback. J. ACM 70(3): 19:1-19:46 (2023)
[c56]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/BhatiaGS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/BhatiaGS23
Kush Bhatia, Wenshuo Guo, Jacob Steinhardt:
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws. AISTATS 2023: 11149-11171
[c55]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BurnsYKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BurnsYKS23
Collin Burns, Haotian Ye, Dan Klein, Jacob Steinhardt:
Discovering Latent Knowledge in Language Models Without Supervision. ICLR 2023
[c54]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/NandaCLSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NandaCLSS23
Neel Nanda, Lawrence Chan, Tom Lieberum, Jess Smith, Jacob Steinhardt:
Progress measures for grokking via mechanistic interpretability. ICLR 2023
[c53]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WangVCSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangVCSS23
Kevin Ro Wang, Alexandre Variengien, Arthur Conmy, Buck Shlegeris, Jacob Steinhardt:
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 Small. ICLR 2023
[c52]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/JonesDRS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JonesDRS23
Erik Jones, Anca D. Dragan, Aditi Raghunathan, Jacob Steinhardt:
Automatically Auditing Large Language Models via Discrete Optimization. ICML 2023: 15307-15329
[c51]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangSH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangSH23
Yongyi Yang, Jacob Steinhardt, Wei Hu:
Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations. ICML 2023: 39453-39487
[c50]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0001HS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001HS23
Alexander Wei, Nika Haghtalab, Jacob Steinhardt:
Jailbroken: How Does LLM Safety Training Fail? NeurIPS 2023
[c49]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Jagadeesan0S23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Jagadeesan0S23
Meena Jagadeesan, Nikhil Garg, Jacob Steinhardt:
Supply-Side Equilibria in Recommender Systems. NeurIPS 2023
[c48]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JagadeesanJSH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JagadeesanJSH23
Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt, Nika Haghtalab:
Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition. NeurIPS 2023
[c47]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/TongJS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TongJS23
Shengbang Tong, Erik Jones, Jacob Steinhardt:
Mass-Producing Failures of Multimodal Systems with Language Models. NeurIPS 2023
[c46]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhongZLAKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhongZLAKS23
Ruiqi Zhong, Peter Zhang, Steve Li, Jinwoo Ahn, Dan Klein, Jacob Steinhardt:
Goal Driven Discovery of Distributional Differences via Language Descriptions. NeurIPS 2023
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-05217
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-05217
Neel Nanda, Lawrence Chan, Tom Lieberum, Jess Smith, Jacob Steinhardt:
Progress measures for grokking via mechanistic interpretability. CoRR abs/2301.05217 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-12349
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-12349
Kush Bhatia, Wenshuo Guo, Jacob Steinhardt:
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws. CoRR abs/2302.12349 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14233
Ruiqi Zhong, Peter Zhang, Steve Li, Jinwoo Ahn, Dan Klein, Jacob Steinhardt:
Goal Driven Discovery of Distributional Differences via Language Descriptions. CoRR abs/2302.14233 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-04381
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-04381
Erik Jones, Anca D. Dragan, Aditi Raghunathan, Jacob Steinhardt:
Automatically Auditing Large Language Models via Discrete Optimization. CoRR abs/2303.04381 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08112
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08112
Nora Belrose, Zach Furman, Logan Smith, Danny Halawi, Igor Ostrovsky, Lev McKinney, Stella Biderman, Jacob Steinhardt:
Eliciting Latent Predictions from Transformers with the Tuned Lens. CoRR abs/2303.08112 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07479
Xinyan Hu, Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt:
Incentivizing High-Quality Content in Online Recommender Systems. CoRR abs/2306.07479 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-12105
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-12105
Shengbang Tong, Erik Jones, Jacob Steinhardt:
Mass-Producing Failures of Multimodal Systems with Language Models. CoRR abs/2306.12105 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-14670
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-14670
Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt, Nika Haghtalab:
Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition. CoRR abs/2306.14670 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-17105
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-17105
Yongyi Yang, Jacob Steinhardt, Wei Hu:
Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations. CoRR abs/2306.17105 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-02483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-02483
Alexander Wei, Nika Haghtalab, Jacob Steinhardt:
Jailbroken: How Does LLM Safety Training Fail? CoRR abs/2307.02483 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-08678
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-08678
Yanda Chen, Ruiqi Zhong, Narutatsu Ri, Chen Zhao, He He, Jacob Steinhardt, Zhou Yu, Kathleen R. McKeown:
Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations. CoRR abs/2307.08678 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-09476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-09476
Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt:
Overthinking the Truth: Understanding how Language Models Process False Demonstrations. CoRR abs/2307.09476 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05916
Yossi Gandelsman, Alexei A. Efros, Jacob Steinhardt:
Interpreting CLIP's Image Representation via Text-Based Decomposition. CoRR abs/2310.05916 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-17191
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-17191
Jiahai Feng, Jacob Steinhardt:
How do Language Models Bind Entities in Context? CoRR abs/2310.17191 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02974
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02974
Lisa Dunlap, Yuhui Zhang, Xiaohan Wang, Ruiqi Zhong, Trevor Darrell, Jacob Steinhardt, Joseph E. Gonzalez, Serena Yeung-Levy:
Describing Differences in Image Sets with Natural Language. CoRR abs/2312.02974 (2023)
2022
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/ml/KohSL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/KohSL22
Pang Wei Koh, Jacob Steinhardt, Percy Liang:
Stronger data poisoning attacks break data sanitization defenses. Mach. Learn. 111(1): 1-47 (2022)
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangMGSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangMGSS22
Ye Wang, Norman Mu, Daniele Grandi, Nicolas Savva, Jacob Steinhardt:
A3D: Studying Pretrained Representations with Programmable Datasets. CVPR Workshops 2022: 4877-4885
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HendrycksZMTLSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HendrycksZMTLSS22
Dan Hendrycks, Andy Zou, Mantas Mazeika, Leonard Tang, Bo Li, Dawn Song, Jacob Steinhardt:
PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures. CVPR 2022: 16762-16771
[c43]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/PanBS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PanBS22
Alexander Pan, Kush Bhatia, Jacob Steinhardt:
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models. ICLR 2022
[c42]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HendrycksBMZKMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HendrycksBMZKMS22
Dan Hendrycks, Steven Basart, Mantas Mazeika, Andy Zou, Joseph Kwon, Mohammadreza Mostajabi, Jacob Steinhardt, Dawn Song:
Scaling Out-of-Distribution Detection for Real-World Settings. ICML 2022: 8759-8773
[c41]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0001HS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0001HS22
Alexander Wei, Wei Hu, Jacob Steinhardt:
More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize. ICML 2022: 23549-23588
[c40]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YuY00S22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YuY00S22
Yaodong Yu, Zitong Yang, Alexander Wei, Yi Ma, Jacob Steinhardt:
Predicting Out-of-Distribution Error with the Projection Norm. ICML 2022: 25721-25746
[c39]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhongSKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhongSKS22
Ruiqi Zhong, Charlie Snell, Dan Klein, Jacob Steinhardt:
Describing Differences between Text Distributions with Natural Language. ICML 2022: 27099-27116
[c38]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JonesS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JonesS22
Erik Jones, Jacob Steinhardt:
Capturing Failures of Large Language Models via Human Cognitive Biases. NeurIPS 2022
[c37]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MazeikaTZBCSFSH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MazeikaTZBCSFSH22
Mantas Mazeika, Eric Tang, Andy Zou, Steven Basart, Jun Shern Chan, Dawn Song, David A. Forsyth, Jacob Steinhardt, Dan Hendrycks:
How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios. NeurIPS 2022
[c36]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZouXJKMLSSEH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZouXJKMLSSEH22
Andy Zou, Tristan Xiao, Ryan Jia, Joe Kwon, Mantas Mazeika, Richard Li, Dawn Song, Jacob Steinhardt, Owain Evans, Dan Hendrycks:
Forecasting Future World Events With Neural Networks. NeurIPS 2022
[i57]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-03544
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-03544
Alexander Pan, Kush Bhatia, Jacob Steinhardt:
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models. CoRR abs/2201.03544 (2022)
[i56]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-12323
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-12323
Ruiqi Zhong, Charlie Snell, Dan Klein, Jacob Steinhardt:
Summarizing Differences between Text Distributions with Natural Language. CoRR abs/2201.12323 (2022)
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-05834
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-05834
Yaodong Yu, Zitong Yang, Alexander Wei, Yi Ma, Jacob Steinhardt:
Predicting Out-of-Distribution Error with the Projection Norm. CoRR abs/2202.05834 (2022)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-12299
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-12299
Erik Jones, Jacob Steinhardt:
Capturing Failures of Large Language Models via Human Cognitive Biases. CoRR abs/2202.12299 (2022)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-06176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-06176
Alexander Wei, Wei Hu, Jacob Steinhardt:
More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize. CoRR abs/2203.06176 (2022)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-13489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-13489
Meena Jagadeesan, Nikhil Garg, Jacob Steinhardt:
Supply-Side Equilibria in Recommender Systems. CoRR abs/2206.13489 (2022)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-13498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-13498
Jean-Stanislas Denain, Jacob Steinhardt:
Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior. CoRR abs/2206.13498 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-15474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-15474
Andy Zou, Tristan Xiao, Ryan Jia, Joe Kwon, Mantas Mazeika, Richard Li, Dawn Song, Jacob Steinhardt, Owain Evans, Dan Hendrycks:
Forecasting Future World Events with Neural Networks. CoRR abs/2206.15474 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-10039
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-10039
Mantas Mazeika, Eric Tang, Andy Zou, Steven Basart, Jun Shern Chan, Dawn Song, David A. Forsyth, Jacob Steinhardt, Dan Hendrycks:
How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios. CoRR abs/2210.10039 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00593
Kevin Wang, Alexandre Variengien, Arthur Conmy, Buck Shlegeris, Jacob Steinhardt:
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small. CoRR abs/2211.00593 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03827
Collin Burns, Haotian Ye, Dan Klein, Jacob Steinhardt:
Discovering Latent Knowledge in Language Models Without Supervision. CoRR abs/2212.03827 (2022)
2021
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/cacm/Steinhardt21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cacm/Steinhardt21
Jacob Steinhardt:
Technical perspective: Robust statistics tackle new problems. Commun. ACM 64(5): 106 (2021)
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZhongGKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhongGKS21
Ruiqi Zhong, Dhruba Ghosh, Dan Klein, Jacob Steinhardt:
Are Larger Pretrained Language Models Uniformly Better? Comparing Performance at the Instance Level. ACL/IJCNLP (Findings) 2021: 3813-3827
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/BurnsS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/BurnsS21
Collin Burns, Jacob Steinhardt:
Limitations of Post-Hoc Feature Alignment for Robustness. CVPR 2021: 2525-2533
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HendrycksZBSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HendrycksZBSS21
Dan Hendrycks, Kevin Zhao, Steven Basart, Jacob Steinhardt, Dawn Song:
Natural Adversarial Examples. CVPR 2021: 15262-15271
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/HendrycksBMKWDD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/HendrycksBMKWDD21
Dan Hendrycks, Steven Basart, Norman Mu, Saurav Kadavath, Frank Wang, Evan Dorundo, Rahul Desai, Tyler Zhu, Samyak Parajuli, Mike Guo, Dawn Song, Jacob Steinhardt, Justin Gilmer:
The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization. ICCV 2021: 8320-8329
[c31]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HendrycksBBC0SS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HendrycksBBC0SS21
Dan Hendrycks, Collin Burns, Steven Basart, Andrew Critch, Jerry Li, Dawn Song, Jacob Steinhardt:
Aligning AI With Shared Human Values. ICLR 2021
[c30]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HendrycksBBZMSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HendrycksBBZMSS21
Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, Jacob Steinhardt:
Measuring Massive Multitask Language Understanding. ICLR 2021
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/innovations/BhatiaBDS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/innovations/BhatiaBDS21
Kush Bhatia, Peter L. Bartlett, Anca D. Dragan, Jacob Steinhardt:
Agnostic Learning with Unknown Utilities. ITCS 2021: 55:1-55:20
[c28]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DingDS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DingDS21
Frances Ding, Jean-Stanislas Denain, Jacob Steinhardt:
Grounding Representation Similarity Through Statistical Testing. NeurIPS 2021: 1556-1568
[c27]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HendrycksBKABTS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HendrycksBKABTS21
Dan Hendrycks, Collin Burns, Saurav Kadavath, Akul Arora, Steven Basart, Eric Tang, Dawn Song, Jacob Steinhardt:
Measuring Mathematical Problem Solving With the MATH Dataset. NeurIPS Datasets and Benchmarks 2021
[c26]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HendrycksBKMAGB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HendrycksBKMAGB21
Dan Hendrycks, Steven Basart, Saurav Kadavath, Mantas Mazeika, Akul Arora, Ethan Guo, Collin Burns, Samir Puranik, Horace He, Dawn Song, Jacob Steinhardt:
Measuring Coding Challenge Competence With APPS. NeurIPS Datasets and Benchmarks 2021
[c25]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HendrycksMZPZNS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HendrycksMZPZNS21
Dan Hendrycks, Mantas Mazeika, Andy Zou, Sahil Patel, Christine Zhu, Jesus Navarro, Dawn Song, Bo Li, Jacob Steinhardt:
What Would Jiminy Cricket Do? Towards Agents That Behave Morally. NeurIPS Datasets and Benchmarks 2021
[c24]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JagadeesanWWJS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JagadeesanWWJS21
Meena Jagadeesan, Alexander Wei, Yixin Wang, Michael I. Jordan, Jacob Steinhardt:
Learning Equilibria in Matching Markets from Bandit Feedback. NeurIPS 2021: 3323-3335
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-03874
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-03874
Dan Hendrycks, Collin Burns, Saurav Kadavath, Akul Arora, Steven Basart, Eric Tang, Dawn Song, Jacob Steinhardt:
Measuring Mathematical Problem Solving With the MATH Dataset. CoRR abs/2103.03874 (2021)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-05898
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-05898
Collin Burns, Jacob Steinhardt:
Limitations of Post-Hoc Feature Alignment for Robustness. CoRR abs/2103.05898 (2021)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-07601
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-07601
Charlie Snell, Ruiqi Zhong, Dan Klein, Jacob Steinhardt:
Approximating How Single Head Attention Learns. CoRR abs/2103.07601 (2021)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-09947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-09947
Yaodong Yu, Zitong Yang, Edgar Dobriban, Jacob Steinhardt, Yi Ma:
Understanding Generalization in Adversarial Training via the Bias-Variance Decomposition. CoRR abs/2103.09947 (2021)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-08482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-08482
Kush Bhatia, Peter L. Bartlett, Anca D. Dragan, Jacob Steinhardt:
Agnostic learning with unknown utilities. CoRR abs/2104.08482 (2021)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-06020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-06020
Ruiqi Zhong, Dhruba Ghosh, Dan Klein, Jacob Steinhardt:
Are Larger Pretrained Language Models Uniformly Better? Comparing Performance at the Instance Level. CoRR abs/2105.06020 (2021)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-09938
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-09938
Dan Hendrycks, Steven Basart, Saurav Kadavath, Mantas Mazeika, Akul Arora, Ethan Guo, Collin Burns, Samir Puranik, Horace He, Dawn Song, Jacob Steinhardt:
Measuring Coding Challenge Competence With APPS. CoRR abs/2105.09938 (2021)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-01661
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-01661
Frances Ding, Jean-Stanislas Denain, Jacob Steinhardt:
Grounding Representation Similarity with Statistical Testing. CoRR abs/2108.01661 (2021)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-08843
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-08843
Meena Jagadeesan, Alexander Wei, Yixin Wang, Michael I. Jordan, Jacob Steinhardt:
Learning Equilibria in Matching Markets from Bandit Feedback. CoRR abs/2108.08843 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-13916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13916
Dan Hendrycks, Nicholas Carlini, John Schulman, Jacob Steinhardt:
Unsolved Problems in ML Safety. CoRR abs/2109.13916 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13136
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13136
Dan Hendrycks, Mantas Mazeika, Andy Zou, Sahil Patel, Christine Zhu, Jesus Navarro, Dawn Song, Bo Li, Jacob Steinhardt:
What Would Jiminy Cricket Do? Towards Agents That Behave Morally. CoRR abs/2110.13136 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-04094
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-04094
Alan Pham, Eunice Chan, Vikranth Srivatsa, Dhruba Ghosh, Yaoqing Yang, Yaodong Yu, Ruiqi Zhong, Joseph E. Gonzalez, Jacob Steinhardt:
The Effect of Model Size on Worst-Group Generalization. CoRR abs/2112.04094 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-05135
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-05135
Dan Hendrycks, Andy Zou, Mantas Mazeika, Leonard Tang, Bo Li, Dawn Song, Jacob Steinhardt:
PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures. CoRR abs/2112.05135 (2021)
2020
[c23]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/EngstromISTSM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/EngstromISTSM20
Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Jacob Steinhardt, Aleksander Madry:
Identifying Statistical Bias in Dataset Replication. ICML 2020: 2922-2932
[c22]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangYYSM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangYYSM20
Zitong Yang, Yaodong Yu, Chong You, Jacob Steinhardt, Yi Ma:
Rethinking Bias-Variance Trade-off for Generalization of Neural Networks. ICML 2020: 10767-10777
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/isit/ZhuJS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isit/ZhuJS20
Banghua Zhu, Jiantao Jiao, Jacob Steinhardt:
When does the Tukey Median work? ISIT 2020: 1201-1206
[c20]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DathathriDKRUBS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DathathriDKRUBS20
Sumanth Dathathri, Krishnamurthy Dvijotham, Alexey Kurakin, Aditi Raghunathan, Jonathan Uesato, Rudy Bunel, Shreya Shankar, Jacob Steinhardt, Ian J. Goodfellow, Percy Liang, Pushmeet Kohli:
Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming. NeurIPS 2020
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-07805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-07805
Banghua Zhu, Jiantao Jiao, Jacob Steinhardt:
When does the Tukey median work? CoRR abs/2001.07805 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-11328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-11328
Zitong Yang, Yaodong Yu, Chong You, Jacob Steinhardt, Yi Ma:
Rethinking Bias-Variance Trade-off for Generalization of Neural Networks. CoRR abs/2002.11328 (2020)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09619
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09619
Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Jacob Steinhardt, Aleksander Madry:
Identifying Statistical Bias in Dataset Replication. CoRR abs/2005.09619 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-14073
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-14073
Banghua Zhu, Jiantao Jiao, Jacob Steinhardt:
Robust estimation via generalized quasi-gradients. CoRR abs/2005.14073 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-16241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-16241
Dan Hendrycks, Steven Basart, Norman Mu, Saurav Kadavath, Frank Wang, Evan Dorundo, Rahul Desai, Tyler Zhu, Samyak Parajuli, Mike Guo, Dawn Song, Jacob Steinhardt, Justin Gilmer:
The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization. CoRR abs/2006.16241 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-02275
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-02275
Dan Hendrycks, Collin Burns, Steven Basart, Andrew Critch, Jerry Li, Dawn Song, Jacob Steinhardt:
Aligning AI With Shared Human Values. CoRR abs/2008.02275 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-03300
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-03300
Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, Jacob Steinhardt:
Measuring Massive Multitask Language Understanding. CoRR abs/2009.03300 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11645
Sumanth Dathathri, Krishnamurthy Dvijotham, Alexey Kurakin, Aditi Raghunathan, Jonathan Uesato, Rudy Bunel, Shreya Shankar, Jacob Steinhardt, Ian J. Goodfellow, Percy Liang, Pushmeet Kohli:
Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming. CoRR abs/2010.11645 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/cacm/LiptonS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cacm/LiptonS19
Zachary C. Lipton, Jacob Steinhardt:
Research for practice: troubling trends in machine-learning scholarship. Commun. ACM 62(6): 45-53 (2019)
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/pacmpl/ShiSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pacmpl/ShiSL19
Kensen Shi, Jacob Steinhardt, Percy Liang:
FrAngel: component-based synthesis with control structures. Proc. ACM Program. Lang. 3(POPL): 73:1-73:29 (2019)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/queue/LiptonS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/queue/LiptonS19
Zachary C. Lipton, Jacob Steinhardt:
Troubling Trends in Machine Learning Scholarship. ACM Queue 17(1): 80 (2019)
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DiakonikolasKK019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DiakonikolasKK019
Ilias Diakonikolas, Gautam Kamath, Daniel Kane, Jerry Li, Jacob Steinhardt, Alistair Stewart:
Sever: A Robust Meta-Algorithm for Stochastic Optimization. ICML 2019: 1596-1606
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-01034
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-01034
Daniel Kang, Yi Sun, Tom Brown, Dan Hendrycks, Jacob Steinhardt:
Transfer of Adversarial Robustness Between Perturbation Types. CoRR abs/1905.01034 (2019)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-07174
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-07174
Dan Hendrycks, Kevin Zhao, Steven Basart, Jacob Steinhardt, Dawn Song:
Natural Adversarial Examples. CoRR abs/1907.07174 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-08016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-08016
Daniel Kang, Yi Sun, Dan Hendrycks, Tom Brown, Jacob Steinhardt:
Testing Robustness Against Unforeseen Adversaries. CoRR abs/1908.08016 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-08755
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-08755
Banghua Zhu, Jiantao Jiao, Jacob Steinhardt:
Generalized Resilience and Robust Statistics. CoRR abs/1909.08755 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-11132
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-11132
Dan Hendrycks, Steven Basart, Mantas Mazeika, Mohammadreza Mostajabi, Jacob Steinhardt, Dawn Song:
A Benchmark for Anomaly Segmentation. CoRR abs/1911.11132 (2019)
2018
[b1]
- view
  - electronic edition @ stanford.edu
  - details & citations
- export record
  dblp key:
  - phd/us/Steinhardt18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Steinhardt18
Jacob Steinhardt:
Robust learning: information theory and algorithms. Stanford University, USA, 2018
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/RaghunathanSL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/RaghunathanSL18
Aditi Raghunathan, Jacob Steinhardt, Percy Liang:
Certified Defenses against Adversarial Examples. ICLR (Poster) 2018
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/innovations/SteinhardtCV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/innovations/SteinhardtCV18
Jacob Steinhardt, Moses Charikar, Gregory Valiant:
Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers. ITCS 2018: 45:1-45:21
[c16]
- view
- export record
  dblp key:
  - conf/nips/RaghunathanSL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RaghunathanSL18
Aditi Raghunathan, Jacob Steinhardt, Percy Liang:
Semidefinite relaxations for certifying robustness to adversarial examples. NeurIPS 2018: 10900-10910
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/stoc/KothariSS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/stoc/KothariSS18
Pravesh K. Kothari, Jacob Steinhardt, David Steurer:
Robust moment estimation and improved clustering via sum of squares. STOC 2018: 1035-1046
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1801-09344
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-09344
Aditi Raghunathan, Jacob Steinhardt, Percy Liang:
Certified Defenses against Adversarial Examples. CoRR abs/1801.09344 (2018)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-07228
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-07228
Miles Brundage, Shahar Avin, Jack Clark, Helen Toner, Peter Eckersley, Ben Garfinkel, Allan Dafoe, Paul Scharre, Thomas Zeitzoff, Bobby Filar, Hyrum S. Anderson, Heather Roff, Gregory C. Allen, Jacob Steinhardt, Carrick Flynn, Seán Ó hÉigeartaigh, Simon Beard, Haydn Belfield, Sebastian Farquhar, Clare Lyle, Rebecca Crootof, Owain Evans, Michael Page, Joanna Bryson, Roman Yampolskiy, Dario Amodei:
The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation. CoRR abs/1802.07228 (2018)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-02815
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-02815
Ilias Diakonikolas, Gautam Kamath, Daniel M. Kane, Jerry Li, Jacob Steinhardt, Alistair Stewart:
Sever: A Robust Meta-Algorithm for Stochastic Optimization. CoRR abs/1803.02815 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-03341
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-03341
Zachary C. Lipton, Jacob Steinhardt:
Troubling Trends in Machine Learning Scholarship. CoRR abs/1807.03341 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-00741
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-00741
Pang Wei Koh, Jacob Steinhardt, Percy Liang:
Stronger Data Poisoning Attacks Break Data Sanitization Defenses. CoRR abs/1811.00741 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-01057
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-01057
Aditi Raghunathan, Jacob Steinhardt, Percy Liang:
Semidefinite relaxations for certifying robustness to adversarial examples. CoRR abs/1811.01057 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-05175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-05175
Kensen Shi, Jacob Steinhardt, Percy Liang:
FrAngel: Component-Based Synthesis with Control Structures. CoRR abs/1811.05175 (2018)
2017
[c14]
- view
- export record
  dblp key:
  - conf/nips/SteinhardtKL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SteinhardtKL17
Jacob Steinhardt, Pang Wei Koh, Percy Liang:
Certified Defenses for Data Poisoning Attacks. NIPS 2017: 3517-3529
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/stoc/CharikarSV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/stoc/CharikarSV17
Moses Charikar, Jacob Steinhardt, Gregory Valiant:
Learning from untrusted data. STOC 2017: 47-60
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SteinhardtCV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SteinhardtCV17
Jacob Steinhardt, Moses Charikar, Gregory Valiant:
Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers. CoRR abs/1703.04940 (2017)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/Steinhardt17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Steinhardt17
Jacob Steinhardt:
Does robustness imply tractability? A lower bound for planted clique in the semi-random model. CoRR abs/1704.05120 (2017)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SteinhardtKL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SteinhardtKL17
Jacob Steinhardt, Pang Wei Koh, Percy Liang:
Certified Defenses for Data Poisoning Attacks. CoRR abs/1706.03691 (2017)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-07465
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-07465
Pravesh K. Kothari, Jacob Steinhardt:
Better Agnostic Clustering Via Relaxed Tensor Norms. CoRR abs/1711.07465 (2017)
[i9]
- view
  - electronic edition @ weizmann.ac.il (open access)
  - details & citations
- export record
  dblp key:
  - journals/eccc/Steinhardt17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eccc/Steinhardt17
Jacob Steinhardt:
Does robustness imply tractability? A lower bound for planted clique in the semi-random model. Electron. Colloquium Comput. Complex. TR17 (2017)
2016
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/SteinhardtVW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/SteinhardtVW16
Jacob Steinhardt, Gregory Valiant, Stefan Wager:
Memory, Communication, and Statistical Queries. COLT 2016: 1490-1516
[c11]
- view
- export record
  dblp key:
  - conf/nips/SteinhardtL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SteinhardtL16
Jacob Steinhardt, Percy Liang:
Unsupervised Risk Estimation Using Only Conditional Independence Structure. NIPS 2016: 3657-3665
[c10]
- view
- export record
  dblp key:
  - conf/nips/SteinhardtVC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SteinhardtVC16
Jacob Steinhardt, Gregory Valiant, Moses Charikar:
Avoiding Imposters and Delinquents: Adversarial Crowdsourcing and Peer Prediction. NIPS 2016: 4439-4447
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SteinhardtL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SteinhardtL16
Jacob Steinhardt, Percy Liang:
Unsupervised Risk Estimation Using Only Conditional Independence Structure. CoRR abs/1606.05313 (2016)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SteinhardtVC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SteinhardtVC16
Jacob Steinhardt, Gregory Valiant, Moses Charikar:
Avoiding Imposters and Delinquents: Adversarial Crowdsourcing and Peer Prediction. CoRR abs/1606.05374 (2016)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AmodeiOSCSM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AmodeiOSCSM16
Dario Amodei, Chris Olah, Jacob Steinhardt, Paul F. Christiano, John Schulman, Dan Mané:
Concrete Problems in AI Safety. CoRR abs/1606.06565 (2016)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/CharikarSV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/CharikarSV16
Moses Charikar, Jacob Steinhardt, Gregory Valiant:
Learning from Untrusted Data. CoRR abs/1611.02315 (2016)
2015
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/ShiSL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ShiSL15
Tianlin Shi, Jacob Steinhardt, Percy Liang:
Learning Where to Sample in Structured Prediction. AISTATS 2015
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/SteinhardtD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/SteinhardtD15
Jacob Steinhardt, John C. Duchi:
Minimax rates for memory-bounded sparse linear regression. COLT 2015: 1564-1587
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SteinhardtL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SteinhardtL15
Jacob Steinhardt, Percy Liang:
Reified Context Models. ICML 2015: 1043-1052
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SteinhardtL15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SteinhardtL15a
Jacob Steinhardt, Percy Liang:
Learning Fast-Mixing Models for Structured Prediction. ICML 2015: 1063-1072
[c5]
- view
- export record
  dblp key:
  - conf/nips/SteinhardtL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SteinhardtL15
Jacob Steinhardt, Percy Liang:
Learning with Relaxed Supervision. NIPS 2015: 2827-2835
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SteinhardtL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SteinhardtL15
Jacob Steinhardt, Percy Liang:
Reified Context Models. CoRR abs/1502.06665 (2015)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SteinhardtL15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SteinhardtL15a
Jacob Steinhardt, Percy Liang:
Learning Fast-Mixing Models for Structured Prediction. CoRR abs/1502.06668 (2015)
[i2]
- view
  - electronic edition @ weizmann.ac.il (open access)
  - details & citations
- export record
  dblp key:
  - journals/eccc/SteinhardtVW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eccc/SteinhardtVW15
Jacob Steinhardt, Gregory Valiant, Stefan Wager:
Memory, Communication, and Statistical Queries. Electron. Colloquium Comput. Complex. TR15 (2015)
2014
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SteinhardtL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SteinhardtL14
Jacob Steinhardt, Percy Liang:
Filtering with Abstract Particles. ICML 2014: 727-735
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SteinhardtL14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SteinhardtL14a
Jacob Steinhardt, Percy Liang:
Adaptivity and Optimism: An Improved Exponentiated Gradient Algorithm. ICML 2014: 1593-1601
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SteinhardtWL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SteinhardtWL14
Jacob Steinhardt, Stefan Wager, Percy Liang:
The Statistics of Streaming Sparse Regression. CoRR abs/1412.4182 (2014)
2012
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/ijrr/SteinhardtT12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijrr/SteinhardtT12
Jacob Steinhardt, Russ Tedrake:
Finite-time regional verification of stochastic non-linear systems. Int. J. Robotics Res. 31(7): 901-923 (2012)
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/SteinhardtG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/SteinhardtG12
Jacob Steinhardt, Zoubin Ghahramani:
Flexible Martingale Priors for Deep Hierarchies. AISTATS 2012: 1108-1116
2011
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/rss/SteinhardtT11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rss/SteinhardtT11
Jacob Steinhardt, Russ Tedrake:
Finite-Time Regional Verification of Stochastic Nonlinear Systems. Robotics: Science and Systems 2011
2010
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/combinatorics/Steinhardt10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/combinatorics/Steinhardt10
Jacob Steinhardt:
Permutations with Ascending and Descending Blocks. Electron. J. Comb. 17(1) (2010)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/combinatorics/Steinhardt09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/combinatorics/Steinhardt09
Jacob Steinhardt:
On Coloring the Odd-Distance Graph. Electron. J. Comb. 16(1) (2009)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.