Pavel Izmailov

I am a Researcher at Anthropic. I am primarily interested in reinforcement learning, reasoning, AI for science and AI alignment. Previously, I worked on reasoning and superintelligent AI alignment at OpenAI.

Starting in Fall 2025, I will be joining NYU as an Assistant Professor in the Tandon CSE department, and Courant CS department by courtesy. I am also a member of the NYU CILVR Group.

My research interests are broadly in understanding how deep neural networks work. I am excited about a broad array of topics in core machine learning, including:

• Problem-solving and reasoning in AI
• Reinforcement learning, planning and search
• Interpretability of deep learning models
• AI for scientific discovery and math
• Generalization and robustness of AI models
• Technical AI alignment
• Probabilistic deep learning, uncertainty estimation and Bayesian methods

You can see some of my representative publications below.

Highlights

• I contributed to the Anthropic Claude 3.7 Sonnet and Claude 4, state-of-the art reasoning and coding models.
• I contributed to OpenAI o1, a new state-of-the-art in LLM reasoning.
• Our work on weak-to-strong generalization was covered by a WIRED, MIT Technology Review and others.
• Our work on Bayesian model selection was recognized with an Outstanding Paper Award 🏆 at ICML 2022!

Publications

*Equal first authorship.
OpenAI o1 System Card
OpenAI
2024
[arXiv]
Learning to Reason with LLMs
OpenAI Technical Post (contributor)
2024
[OpenAI blog]
Can a Confident Prior Replace a Cold Posterior?
M. Marek, B. Paige, Pavel Izmailov P. Izmailov
arXiv preprint, 2024
[PDF, ArXiv, Code]
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
C. Burns*, P. Izmailov*, J. H. Kirchner*, B. Baker*, L. Gao*, L. Aschenbrenner*, Y. Chen*, A. Ecoffet*, M. Joglekar*, J. Leike, I. Sutskever, J. Wu*
2023
[PDF, ArXiv, OpenAI blog, Code] [WIRED, TechCrunch, MIT Technology Review, IEEE Spectrum]
Simple and Fast Group Robustness by Automatic Feature Reweighting
S. Qiu, A. Potapczynski, P. Izmailov, A. G. Wilson
International Conference on Machine Learning (ICML), 2023
[PDF, ArXiv, Code]
FlexiViT: one model for all patch sizes
L. Beyer, P. Izmailov, A. Kolesnikov, M. Caron, S. Kornblith, X. Zhai, M. Minderer, M. Tschannen, I. Alabdulmohsin, F. Pavetic
Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[PDF, ArXiv, Code]
Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations
Polina Kirichenko*, Pavel Izmailov*, Andrew Gordon Wilson
International Conference on Learning Representations (ICLR), 2023
🌟 Spotlight Presentation
[PDF, ArXiv, Code]
On Feature Learning in the Presence of Spurious Correlations
Pavel Izmailov*, Polina Kirichenko*, Nate Gruver*, Andrew Gordon Wilson
Neural Information Processing Systems (NeurIPS), 2022
[PDF, ArXiv, Code]
On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification
Sanyam Kapoor*, Wesley J. Maddox*, Pavel Izmailov*, Andrew Gordon Wilson
Neural Information Processing Systems (NeurIPS), 2022
[PDF, ArXiv, Code]
Bayesian Model Selection, the Marginal Likelihood, and Generalization
Sanae Lotfi, Pavel Izmailov, Gregory Benton, Micah Goldblum, Andrew Gordon Wilson
International Conference on Machine Learning (ICML), 2022
🏆 Outstanding Paper Award, 📢 Long Talk (Oral)
[PDF, ArXiv, Code]
Unsupervised learning of two-component nematicity from STM data on magic angle bilayer graphene
W. Taranto, S. Lederer, Y. Choi, P. Izmailov, A. G. Wilson, S. Nadj-Perge, E. Kim
arXiv preprint, 2022
[PDF, ArXiv]
Dangers of Bayesian Model Averaging under Covariate Shift
Pavel Izmailov, Patrick Nicholson, Sanae Lotfi, Andrew Gordon Wilson
Neural Information Processing Systems (NeurIPS), 2021
[PDF, ArXiv, Poster, Code]
Does Knowledge Distillation Really Work?
Samuel Stanton, Pavel Izmailov, Polina Kirichenko, Alexander A. Alemi, Andrew Gordon Wilson
Neural Information Processing Systems (NeurIPS), 2021
[PDF, ArXiv, Poster, Code]
What Are Bayesian Neural Network Posteriors Really Like?
Pavel Izmailov, Sharad Vikram, Matthew D. Hoffman, Andrew Gordon Wilson
International Conference on Machine Learning (ICML), 2021
📢 Long Talk (Oral)
[PDF, ArXiv, Code, HMC samples, Poster, NeurIPS competition]
Learning Invariances in Neural Networks from Training Data
Gregory Benton, Marc Finzi, Pavel Izmailov, Andrew Gordon Wilson
Neural Information Processing Systems (NeurIPS), 2020
[PDF, ArXiv, Code]
Why Normalizing Flows Fail to Detect Out-of-Distribution Data
Polina Kirichenko*, Pavel Izmailov*, Andrew Gordon Wilson
Neural Information Processing Systems (NeurIPS), 2020
[PDF, ArXiv, Code]
Bayesian Deep Learning and a Probabilistic Perspective of Generalization
Andrew Gordon Wilson, Pavel Izmailov
Neural Information Processing Systems (NeurIPS), 2020
[PDF, ArXiv, Code]
Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data
Marc Finzi, Samuel Stanton, Pavel Izmailov, Andrew Gordon Wilson
International Conference on Machine Learning (ICML), 2020
[PDF, ArXiv, Code]
Semi-Supervised Learning with Normalizing Flows
Pavel Izmailov*, Polina Kirichenko*, Marc Finzi*, Andrew Gordon Wilson
International Conference on Machine Learning (ICML), 2020
[PDF, ArXiv, Code]
Subspace Inference for Bayesian Deep Learning
Pavel Izmailov*, Wesley Maddox*, Polina Kirichenko*, Timur Garipov*, Dmitry Vetrov, Andrew Gordon Wilson
Uncertainty in Artificial Intelligence (UAI), 2019
[PDF, ArXiv, Code, Poster]
A Simple Baseline for Bayesian Uncertainty in Deep Learning
Wesley Maddox*, Timur Garipov*, Pavel Izmailov*, Dmitry Vetrov, Andrew Gordon Wilson
Neural Information Processing Systems (NeurIPS), 2019
[PDF, ArXiv, Code, Poster, Video]
There Are Many Consistent Explanations of Unlabeled Data: Why You Should Average
Ben Athiwaratkun, Marc Finzi, Pavel Izmailov, Andrew Gordon Wilson
International Conference on Learning Representations (ICLR), 2019
[PDF, ArXiv, Code, Poster]
Averaging Weights Leads to Wider Optima and Better Generalization
Pavel Izmailov*, Dmitry Podoprikhin*, Timur Garipov*, Dmitry Vetrov, Andrew Gordon Wilson
Uncertainty in Artificial Intelligence (UAI), 2018
📢 Oral Presentation
[PDF, ArXiv, Code, Poster, Slides, PyTorch Blogpost, Towards Data Science Blogpost, fast.ai Blogpost]
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
Timur Garipov*, Pavel Izmailov*, Dmitry Podoprikhin*, Dmitry Vetrov, Andrew Gordon Wilson
Neural Information Processing Systems (NeurIPS), 2018
🌟 Spotlight Presentation
[PDF, ArXiv, Code, Poster, Slides, Video, Blogpost]
Tensor Train decomposition on TensorFlow (T3F)
Alexander Novikov, Pavel Izmailov, Valentin Khrulkov, Michael Figurnov, Ivan Oseledets
Journal of Machine Learning Research, 2020
[PDF, ArXiv, Code]
Scalable Gaussian Processes with Billions of Inducing Inputs via Tensor Train Decomposition
Pavel Izmailov, Alexander Novikov, Dmitry Kropotov
Artificial Intelligence and Statistics (AISTATS), 2018
📢 Oral Presentation
[PDF, ArXiv, Code, Poster, Slides]
Faster variational inducing input Gaussian process classification
Pavel Izmailov, Dmitry Kropotov
Journal of Machine Learning and Data Analysis, 2017
[PDF, ArXiv]

Workshop Papers

On Feature Learning in the Presence of Spurious Correlations
Pavel Izmailov*, Polina Kirichenko*, Nate Gruver*, Andrew Gordon Wilson
ICML Workshop on Principles of Distribution Shift (PODS), 2022
Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations
Polina Kirichenko*, Pavel Izmailov*, Andrew Gordon Wilson
ICML Workshop on Spurious Correlations, Invariance, and Stability, 2022
📢 Oral Presentation
[PDF, ArXiv, Code]
Semi-Supervised Learning with Normalizing Flows
Pavel Izmailov, Polina Kirichenko, Marc Finzi, Andrew Gordon Wilson
ICML Workshop on Invertible Neural Nets and Normalizing Flows, 2019
[PDF, Poster]
Invertible Convolutional Networks
Marc Finzi, Pavel Izmailov, Wesley Maddox, Polina Kirichenko, Andrew Gordon Wilson
ICML Workshop on Invertible Neural Nets and Normalizing Flows, 2019
🌟 Spotlight Presentation
[PDF, Poster, Slides]
Subspace Inference for Bayesian Deep Learning
Pavel Izmailov, Wesley Maddox, Polina Kirichenko, Timur Garipov, Dmitry Vetrov, Andrew Gordon Wilson
ICML Workshop on Uncertainty & Robustness in Deep Learning, 2019
📢 Oral Presentation
[PDF, ArXiv, Code, Poster, Slides, Polina's Talk]
Fast Uncertainty Estimates and Bayesian Model Averaging of DNNs
Wesley Maddox, Timur Garipov, Pavel Izmailov, Andrew Gordon Wilson
UAI Workshop: Uncertainty in Deep Learning, 2018
📢 Oral Presentation
[PDF, Code, Poster, Slides]
Improving Stability in Deep Reinforcement Learning with Weight Averaging
Evgenii Nikishin, Pavel Izmailov, Ben Athiwaratkun, Dmitrii Podoprikhin,
Timur Garipov, Pavel Shvechikov, Dmitry Vetrov, Andrew Gordon Wilson
UAI Workshop: Uncertainty in Deep Learning, 2018
[PDF, Poster]

Pavel Izmailov

Highlights

Links

Publications

Workshop Papers