上海大学学报(自然科学版) ›› 2022, Vol. 28 ›› Issue (3): 427-439.doi: 10.12066/j.issn.1007-2861.2379

• 数据采集、数据库和数据处理 • 上一篇    下一篇

基于卡尔曼滤波的含噪声小样本数据处理方法

刘芬1, 范洪强2, 吕涛3, 李谦2,4, 钱权1,3,5()   

  1. 1.上海大学 计算机工程与科学学院, 上海 200444
    2.上海大学 材料科学与工程学院, 上海 200444
    3.上海大学 材料基因组工程研究院 材料信息与数据科学中心, 上海 200444
    4.重庆大学 材料科学与工程学院 国家镁合金材料工程技术研究中心, 重庆 400044
    5.之江实验室, 浙江 杭州 311100
  • 收稿日期:2022-03-15 出版日期:2022-06-30 发布日期:2022-05-27
  • 通讯作者: 钱权 E-mail:qqian@shu.edu.cn
  • 作者简介:钱权(1972—), 男, 研究员, 博士生导师, 博士, 研究方向为材料信息学、机器学习、网络安全等. E-mail: qqian@shu.edu.cn
  • 基金资助:
    国家重点研发计划资助项目(2018YFB0704400);云南省重大科技专项资助项目(202102AB080019-3);云南省重大科技专项资助项目(202002AB080001-2);之江实验室科研攻关资助项目(2021PE0AC02);上海张江国家自主创新示范区专项发展资金重大资助项目(ZJ2021-ZD-006)

Kalman filter based method for processing small noisy sample data

LIU Fen1, FAN Hongqiang2, LÜ Tao3, LI Qian2,4, QIAN Quan1,3,5()   

  1. 1. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
    2. School of Materials Science and Engineering, Shanghai University, Shanghai 200444, China
    3. Center of Materials Informatics and Data Science, Materials Genome Institute, Shanghai University, Shanghai 200444, China
    4. National Engineering Research Center for Magnesium Alloys, School of Materials Science and Engineering, Chongqing University, Chongqing 400044, China
    5. Zhejiang Laboratory, Hangzhou 311100, Zhejiang, China
  • Received:2022-03-15 Online:2022-06-30 Published:2022-05-27
  • Contact: QIAN Quan E-mail:qqian@shu.edu.cn

摘要:

提出一种基于卡尔曼滤波和扩展卡尔曼滤波的小样本噪声数据处理方法. 首先, 通过物理模型或经验公式建立系统模型. 然后, 利用系统模型预测模型数据. 最后, 采用观测数据修正模型数据, 达到平滑数据噪声的效果. 实验结果表明, 对于BC500耐候钢腐蚀增重数据, 用差分整合移动平均自回归(autoregressive integrated moving average, ARIMA)模型和随机森林(random forest, RF)模型进行腐蚀增重预测时, 经卡尔曼滤波降噪后, 决定系数$R^2$平均提升6.4%, 而经扩展卡尔曼滤波降噪后, $R^2$平均提升4.9%, 验证了本方法的有效性.

关键词: 卡尔曼滤波, 扩展卡尔曼滤波, 数据降噪, 小样本, 腐蚀数据

Abstract:

A small sample noisy data processing method based on Kalman filter and extended Kalman filter has been proposed. The core idea was to establish a system model using physical models or empirical formula, then used the system model to predict the model data, and finally used the observation data to correct the model data and achieve the effect of smoothing data noise. Experimental results showed that when using the autoregressive integrated moving average (ARIMA) model and random forest (RF) model to predict the corrosion weight gain of weather steel BC500, the coefficient of determination $R^{2}$ was increased by an average of 6.4% after Kalman filter denoising, while the $R^{2}$ was increased by an average of 4.9% after extended Kalman filter. These results verified the effectiveness of the proposed methods.

Key words: Kalman filter, extended Kalman filter, data denoising, small samples, corrosion data

中图分类号: