Accepted Papers
-
Planning with Consistency Models for Model-Based Offline Reinforcement Learning
Guanquan Wang, Takuya Hiraoka, Yoshimasa Tsuruoka -
Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning
Homayoun Honari, Amir Mehdi Soufi Enayati, Mehran Ghafarian Tamizi, Homayoun Najjaran -
An Adaptation of RLSVI with Explicit Action Sampling Probabilities
Ziping Xu, Iris Yan, Susan Murphy -
Handling Delay in Reinforcement Learning Caused by Parallel Computations of Neurons
Ivan Anokhin, Rishav, Stephen Chung, Irina Rish, Samira Ebrahimi Kahou -
On Shallow Planning under Partial Observability
Randy Lefebvre, Audrey Durand -
A Practical Approach for Safe Exploration
Yarden As, Bhavya Sukhija, Andreas Krause -
Settling Statistical Barriers for the Deployment of a Meta-Trained Agent
Mirco Mutti, Aviv Tamar -
Personalized Federated Reinforcement Learning with Shared Representations
Guojun Xiong, Shufan Wang, Daniel Jiang, Jian Li -
Uncertainty of Joint Neural Contextual Bandit
Hongbo Guo, Zheqing (Bill) Zhu -
Multimodal Model-Based Reinforcement Learning for Autonomous Racing
Elena Shrestha, Hanxi Wan, Chetan Reddy, Yulun Zhuang, Ram Vasudevan -
Towards Faster Matrix Diagonalization with Graph Isomorphism Networks and the AlphaZero Framework
Geigh Zollicoffer, Kshitij Bhatta, Manish Bhattarai, Phil Romero, Christian Negre, Anders M. N. Niklasson, Adetokunbo Adedoyin -
Navigating Safe Campus Operations during Epidemics with Reinforcement Learning
Elizabeth Ondula, Bhaskar Krishnamachari -
A Crystal Ball for Comfort: Financial Autonomy in Thermostats via Reinforcement Learning and Prediction
Narjes Nourzad, Bhaskar Krishnamachari, Matthew Kahn -
Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels
Blake Castleman, Uzay Macar, Ansaf Salleb-Aouissi -
Simulator-Based Reinforcement Learning for Data Center Cooling Optimization
Chi Zhou, Doris Gao, Lisa Rivalin, Andrew Grier, Gerson Arteaga Ramirez, John Fabian -
Sequential Decision-Making for Inline Text Autocomplete
Rohan Chitnis, Shentao Yang, Alborz Geramifard