Learning Theory · LLM Verifiers · Reliable AI

Dravyansh Sharma

I study the theoretical foundations of learning systems that guide LLM reasoning, design algorithms from data, and adapt to adversarial and strategic environments.

I am currently an IDEAL Postdoctoral Researcher in Chicago (joint appointment at TTIC and Northwestern). I completed my PhD in the Computer Science Department at Carnegie Mellon University, where I was fortunate to be advised by Maria-Florina “Nina” Balcan. My recent work focuses especially on post-training verification for LLM Chain-of-Thought reasoning, including soundness-completeness trade-offs, process supervision, and the learnability of verifier classes. I also work on data-driven algorithm design, principled hyperparameter tuning, reliable learning, and learning with strategic agents.

Current focus: theoretical foundations for LLM verifiers and reasoning systems, with an emphasis on online and PAC learnability; open problems in the growing field of machine learning for algorithms; learning in agentic settings.
Dravyansh Sharma

Recent News and Highlights

Research

My work spans across learning theory, algorithms, and reliable AI, with a particular recent emphasis on verifier-based reasoning systems and learning in adaptive environments.

LLM verifiers and reasoning systems

I study the learnability of verifiers for LLM Chain-of-Thought reasoning, including online and PAC guarantees, soundness-completeness trade-offs, and how learned verifiers can reliably guide step-by-step reasoning post-training via process supervision.

Data-driven algorithm design

I study how to learn algorithmic choices, heuristics, hyperparameters, and configurations across problem instances, with guarantees for generalization, online learning, and performance across tasks.

Reliable learning

I work on learning algorithms with formal reliability guarantees under distribution shift, test-time attacks, poisoning, and other adversarial settings.

Learning with agents

I study learning in environments where data is produced by agents who may adapt, improve, or strategically respond to deployed models and algorithms.

Publications

2026

2025

2024

2023

2022

2021

  • [C7] Data driven semi-supervised learning, NeurIPS 2021 Oral, <1%
    with Maria-Florina Balcan
  • with Maria-Florina Balcan, Mikhail Khodak, Ameet Talwalkar
  • [W2] Improved pronunciation prediction accuracy using morphology, ACL SIG on Computational Morphology and Phonology, ACL 2021
    with Saumya Sahai, Neha Chaudhari, Antoine Bruguier
  • [W1] Predicting and Explaining French Grammatical Gender, ACL Special Interest Group on Typology, NAACL 2021
    with Saumya Sahai

2020 and before

Tutorials and Teaching

Workshops and Service

LAW at COLT 2026

Learning in an Agentic World: Foundations and Challenges. June 29, 2026, San Diego.

LAMP at TTIC

Learning-driven Algorithms and Machine-aided Proofs. August 6–7, 2026, TTIC.

Area chair

2026: ICML 2026, NeurIPS 2026
2025: NeurIPS 2025.

Reviewer

2026: COLT, JMLR, TMLR, UAI, AISTATS, SODA, ITCS, FOCS.
Before 2026 (other venues): ICML, NeurIPS, AAAI, ALT, TPAMI, SIMODS, Machine Learning.

Reach Me

Email: dravy [AT] ttic [DOT] edu
Office: 434, Toyota Technological Institute at Chicago, 6045 S Kenwood Ave, Chicago, IL 60637