基于深度强化学习调控的非平稳风速模拟

doi:10.12066/j.issn.1007-2861.2569

上海大学学报(自然科学版) ›› 2024, Vol. 30 ›› Issue (3): 451-465.doi: 10.12066/j.issn.1007-2861.2569

基于深度强化学习调控的非平稳风速模拟

曹黎媛, 张震雨, 李春祥

上海大学力学与工程科学学院, 上海 200444

出版日期:2024-06-30 发布日期:2024-07-09
通讯作者: 曹黎媛 (1991—) 女, 博士, 研究方向为结构振动控制、结构风工程 E-mail: caoly@shu.edu.cn
基金资助:
国家自然科学基金资助项目 (52108460)

Non-stationary wind velocity simulation using deep reinforcement learning-based regulation and control

CAO Liyuan, ZHANG Zhenyu, LI Chunxiang

School of Mechanics and Engineering Sciences, Shanghai University, Shanghai 200444, China

Online:2024-06-30 Published:2024-07-09

摘要/Abstract

摘要： 提出一种深度确定性策略梯度(deep deterministic policy gradient, DDPG) 算法和广义S 变换(generalized s transform, GST) 的新型混合模拟方法(DDPG-GST). 首先, 采用经验模态分解 (empirical mode decomposition, EMD) 技术将原始数据分解为非平稳脉动风速分量与趋势分量, 运用 GST 提取出非平稳脉动风速分量的时频特征, 构建广义 S 变换时频功率谱矩阵; 然后, 对矩阵进行 Cholesky 分解, 得到非平稳脉动风速模拟值; 接着, 将非平稳脉动风速模拟值载入DDPG 网络进行调控, 进而生成最优模拟值; 最后, 将非平稳脉动风速的模拟值与趋势分量叠加得到总风速时程模拟值. 结果表明: 与 GST 模拟方法相比, DDPG-GST方法的模拟值可以精准保留时域内非平稳脉动风速的能量特征, 由 DDPG-GST 得到的 GST系数幅值在时频域内的能量分布更接近目标值; 同时, DDPG-GST 方法的平均功率谱值更接近目标值. 基于深度强化学习调控的非平稳风速模拟是一种高精度数据驱动模拟方法.

关键词: 非平稳风速模拟, 深度强化学习, S 变换, 调控

Abstract: A novel hybrid simulation method for a deep deterministic policy gradient (DDPG) algorithm and generalized S-transform (GST), referred to as DDPG-GST, is pro-posed. In the DDPG-GST method, empirical mode decomposition is ﬁrst used to decompose the original data into nonstationary ﬂuctuating wind speed components and trend components. The GST is then used to extract the time–frequency characteristics of the nonstationary ﬂuctuating wind speed components, followed by the construction of the GST time–frequency power spectrum matrix. Subsequently, Cholesky decomposition is applied to generate simulated nonstationary ﬂuctuating wind speeds. These simulated speeds are input into the DDPG network for regulation and control to optimize the simulation pro-cess. Finally, the simulated total wind speeds are obtained by superposing the simulated nonstationary ﬂuctuating wind speeds with the trend components. The results show that DDPG-GST retains the energy characteristics of nonstationary ﬂuctuating wind speeds more accurately in the time domain compared to the GST simulation method. Additionally,the energy distributions, derived from the GST coeﬃcient amplitudes by the DDPG-GST method in the time-frequency domain, align more closely with the targets. The average power spectrum of the DDPG-GST method is closer to the target. Therefore, the non-stationary wind speed simulation based on deep reinforcement learning is a high-precision, data-driven simulation method.

Key words: non-stationary wind speed simulation, deep reinforcement learning, S-transform, regulation and control

中图分类号:

TU 311

曹黎媛, 张震雨, 李春祥. 基于深度强化学习调控的非平稳风速模拟[J]. 上海大学学报(自然科学版), 2024, 30(3): 451-465.

CAO Liyuan, ZHANG Zhenyu, LI Chunxiang. Non-stationary wind velocity simulation using deep reinforcement learning-based regulation and control[J]. Journal of Shanghai University（Natural Science Edition）, 2024, 30(3): 451-465.

[1]	郑丝雨, 曹雪洁, 焦丽芳. 电催化尿素氧化反应催化剂设计的研究进展 [J]. 上海大学学报(自然科学版), 2023, 29(5): 886-899.
[2]	李振娅, 李征鸿. 反事实量子调控研究进展[J]. 上海大学学报(自然科学版), 2022, 28(5): 780-793.
[3]	赵敏捷, 李颖洁. 成功完成认知重评任务增强低 Theta 脑电活动[J]. 上海大学学报(自然科学版), 2020, 26(5): 735-746.
[4]	丁杨楠, 吕双杰, 陈厚早, 刘德培. 血管衰老中的表观遗传调控[J]. 上海大学学报(自然科学版), 2019, 25(3): 381-388.
[5]	刘秀秀, 周斌. 成体哺乳动物心肌细胞增殖及其调控[J]. 上海大学学报(自然科学版), 2019, 25(3): 365-374.
[6]	刘仕进, 刘艳伟, 王瑞琦. 基于分层网络和反馈机制的细胞重编程[J]. 上海大学学报(自然科学版), 2016, 22(5): 552-559.
[7]	马立新, 江霓, 袁淑娟. 负荷跟踪型发电系统协调控制方式的智能化[J]. 上海大学学报(自然科学版), 2013, 19(2): 144-149.
[8]	张志勇,徐凤丹. p53振子的形成机制及microRNA对p53目标基因表达的精细调控[J]. 上海大学学报(自然科学版), 2011, 17(5): 631-635.
[9]	颜婷婷，张登松，施利毅. 纳米结构材料的制备及应用进展[J]. 上海大学学报(自然科学版), 2011, 17(4): 447-457.

基于深度强化学习调控的非平稳风速模拟

Non-stationary wind velocity simulation using deep reinforcement learning-based regulation and control

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 9

编辑推荐

Metrics

本文评价