Publications

Check my Google Scholar profile for more information! (* stands for equal contribution.)

Economics and Computer Science

Bandits and Online Learning

  • Working2026

    Policy Regret for Embedding Model Routing: Contextual Bandits with Low-Rank Experts
    Yan Dai, Negin Golrezaei, and Patrick Jaillet.

  • SIGMETRICS2025

    Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks
    Yan Dai and Longbo Huang.
    In Proceedings of the ACM on Measurement and Analysis of Computing Systems, 8(3):31, 2024.
    Best Paper Award of ACM SIGMETRICS 2025.

  • ICLR2025

    uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs
    Yu Chen*, Jiatai Huang*, Yan Dai*, and Longbo Huang.

  • ICML2023

    Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning
    Jiatai Huang*, Yan Dai*, and Longbo Huang.

  • ICLR2023

    Variance-Aware Sparse Linear Bandits
    Yan Dai, Ruosong Wang, and Simon S. Du.

  • ICML2022

    Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits
    Jiatai Huang*, Yan Dai*, and Longbo Huang.

Reinforcement Learning Theory

  • Working2026

    Learning Adversarial Continuous MDPs with Bandit Feedback and Unknown Transitions
    Aarush Kulkarni, Khang Nguyen, Ricardo Parada, Kenny Guo, William Chang, and Yan Dai.

  • COLT2024

    Refined Sample Complexity for Markov Games with Independent Linear Function Approximation
    Yan Dai, Qiwen Cui, and Simon S. Du.

  • ICML2023

    Refined Regret for Adversarial MDPs with Linear Function Approximation
    Yan Dai, Haipeng Luo, Chen-Yu Wei, and Julian Zimmert.

  • NeurIPS2022

    Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback
    Yan Dai, Haipeng Luo, and Liyu Chen.

Deep Learning Theory

  • ICML2024

    Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise
    Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook, and Yan Dai.

  • NeurIPS2023

    The Crucial Role of Normalization in Sharpness-Aware Minimization
    Yan Dai*, Kwangjun Ahn*, and Suvrit Sra.