Mar 5 – 8, 2024
Lahan Select Gyeongju, South Korea
Asia/Seoul timezone

Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning

Mar 5, 2024, 1:30 PM
20m
Lahan Select Gyeongju, South Korea

Lahan Select Gyeongju, South Korea

Lahan Select Gyeongju, South Korea
Oral (16mins + 4 mins) Optimization & Control Tutorials

Speaker

Chenran Xu (Karlsruhe Institut für Technologie (KIT))

Description

Online tuning of particle accelerators is a complex optimisation problem that continues to require manual intervention by experienced human operators. Autonomous tuning is a rapidly expanding field of research, where learning-based methods like Bayesian optimisation (BO) hold great promise in improving plant performance and reducing tuning times. At the same time, Reinforcement Learning (RL) is a capable method of learning intelligent controllers, while recent work shows that RL can also be used to train domain-specialised optimisers in so-called Reinforcement Learning-
trained Optimisation (RLO). In parallel efforts, both algorithms have found successful adoption in particle accelerator tuning. Here we present a comparative case study, analysing the behaviours of both algorithms and outlining their strengths and weaknesses. The results of our study help provide criteria for choosing a suitable learning-based tuning algorithm for a given task and will accelerate research and adoption of these methods with particle accelerators and other complex real-
world facilities, ultimately improving their availability and pushing their operational limits, thereby enabling scientific and engineering advancements.

Primary Keyword ML-based optimization
Secondary Keyword reinforcement learning
Tertiary Keyword bayesian optimization

Primary authors

Chenran Xu (Karlsruhe Institut für Technologie (KIT)) Jan Kaiser (Deutsches Elektronen-Synchrotron DESY)

Co-authors

Andrea Santamaria Garcia (Karlsruhe Institute of Technology) Annika Eichler (DESY) Dr Erik Bründermann (Karlsruhe Institute of Technology (KIT)) Florian Burkart (DESY) Frank Mayet (DESY) Hannes Dinter (DESY) Holger Schlarb (DESY) Oliver Stein (DESY) Thomas Vinatier (DESY) Willi Kuropka (DESY)

Presentation materials