
This work presents the application of reinforcement learning for the optimal resistive control of a point absorber. The model-free Q-learning algorithm is selected in order to maximise energy absorption in each sea state. Step changes are made to the controller damping, observing the associated penalty, for excessive motions, or reward, i.e. gain in associated power. Due to the general periodicity of gravity waves, the absorbed power is averaged over a time horizon lasting several wave periods. The performance of the algorithm is assessed through the numerical simulation of a point absorber subject to motions in heave in both regular and irregular waves. The algorithm is found to converge towards the optimal controller damping in each sea state. Additionally, the model-free approach ensures the algorithm can adapt to changes to the device hydrodynamics over time and is unbiased by modelling errors.
Technology, Science & Technology, SEA, Energy & Fuels, reinforcement learning (RL), Environmental engineering, DEVICE, CONTROL STRATEGIES, 620, POWER TAKE-OFF, Engineering, Q-learning, Electrical & Electronic, Science & Technology - Other Topics, LATCHING CONTROL, OPERATION, TA170, WAVE-ENERGY CONVERTERS, GREEN & SUSTAINABLE SCIENCE & TECHNOLOGY, Wave energy converter (WEC), SYSTEM, power take-off (PTO) system
Technology, Science & Technology, SEA, Energy & Fuels, reinforcement learning (RL), Environmental engineering, DEVICE, CONTROL STRATEGIES, 620, POWER TAKE-OFF, Engineering, Q-learning, Electrical & Electronic, Science & Technology - Other Topics, LATCHING CONTROL, OPERATION, TA170, WAVE-ENERGY CONVERTERS, GREEN & SUSTAINABLE SCIENCE & TECHNOLOGY, Wave energy converter (WEC), SYSTEM, power take-off (PTO) system
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 60 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
