Accepted Papers


  1. Planning with Consistency Models for Model-Based Offline Reinforcement Learning
    Guanquan Wang, Takuya Hiraoka, Yoshimasa Tsuruoka

  2. Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning
    Homayoun Honari, Amir Mehdi Soufi Enayati, Mehran Ghafarian Tamizi, Homayoun Najjaran

  3. An Adaptation of RLSVI with Explicit Action Sampling Probabilities
    Ziping Xu, Iris Yan, Susan Murphy

  4. Handling Delay in Reinforcement Learning Caused by Parallel Computations of Neurons
    Ivan Anokhin, Rishav, Stephen Chung, Irina Rish, Samira Ebrahimi Kahou

  5. On Shallow Planning under Partial Observability
    Randy Lefebvre, Audrey Durand

  6. A Practical Approach for Safe Exploration
    Yarden As, Bhavya Sukhija, Andreas Krause

  7. Settling Statistical Barriers for the Deployment of a Meta-Trained Agent
    Mirco Mutti, Aviv Tamar

  8. Personalized Federated Reinforcement Learning with Shared Representations
    Guojun Xiong, Shufan Wang, Daniel Jiang, Jian Li

  9. Uncertainty of Joint Neural Contextual Bandit
    Hongbo Guo, Zheqing (Bill) Zhu

  10. Multimodal Model-Based Reinforcement Learning for Autonomous Racing
    Elena Shrestha, Hanxi Wan, Chetan Reddy, Yulun Zhuang, Ram Vasudevan

  11. Towards Faster Matrix Diagonalization with Graph Isomorphism Networks and the AlphaZero Framework
    Geigh Zollicoffer, Kshitij Bhatta, Manish Bhattarai, Phil Romero, Christian Negre, Anders M. N. Niklasson, Adetokunbo Adedoyin

  12. Navigating Safe Campus Operations during Epidemics with Reinforcement Learning
    Elizabeth Ondula, Bhaskar Krishnamachari

  13. A Crystal Ball for Comfort: Financial Autonomy in Thermostats via Reinforcement Learning and Prediction
    Narjes Nourzad, Bhaskar Krishnamachari, Matthew Kahn

  14. Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels
    Blake Castleman, Uzay Macar, Ansaf Salleb-Aouissi

  15. Simulator-Based Reinforcement Learning for Data Center Cooling Optimization
    Chi Zhou, Doris Gao, Lisa Rivalin, Andrew Grier, Gerson Arteaga Ramirez, John Fabian

  16. Sequential Decision-Making for Inline Text Autocomplete
    Rohan Chitnis, Shentao Yang, Alborz Geramifard