Journal of Shanghai University(Natural Science Edition) ›› 2022, Vol. 28 ›› Issue (3): 492-503.doi: 10.12066/j.issn.1007-2861.2383

• Machine Learning • Previous Articles     Next Articles

Multi-modal data representation learning for ceramic coating materials

WU Xing1,2,3(), HU Mingtao1, DING Peng3,4   

  1. 1. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
    2. Zhejiang Laboratory, Hangzhou 311100, Zhejiang, China
    3. Center of Materials Informatics and Data Science, Materials Genome Institute, Shanghai University, Shanghai 200444, China
    4. College of Sciences, Shanghai University, Shanghai 200444, China
  • Received:2022-03-28 Online:2022-06-30 Published:2022-05-27
  • Contact: WU Xing E-mail:xingwu@shu.edu.cn

Abstract:

Ceramic coatings have excellent temperature resistance, corrosion resistance, and wear resistance, among other advantages. Their thermal expansion coefficient and thermal conductivity are two properties directly related to their performance. To address the issues of high experimental costs and challenging test conditions, we propose a method to predict the performance of ceramic coating materials based on multimodal data representation learning. To enlarge the data set, this method uses the Gaussian mixture model virtual sample generation (GMMVSG) algorithm to generate samples that match the real ceramic-coating data distribution. The method extracts micro-structural image data's features using the very deep convolutional neural network VGG16, extracts structured data's features using TabNet, and fuses the features of the extracted image data with those of the structured data. the final prediction models based on three machine learning algorithms-K-nearest neighbor (KNN), support-vector-machine regression (SVR), and multi-layer perceptron(MLP)—are established by using multimodal data representation to predict the thermal expansion coefficient and thermal conductivity of the performance index of ceramic coatings. The experimental results show that the proposed multimodal-data representation-learning model has a better prediction performance than that of the single-modal-data machine-learning model, and that the former model based on the MLP can most accurately predict ceramic coating performance. In the test set, the mean absolute and mean square errors for the prediction of the thermal expansion coefficient are 0.026 6 and 0.001 7, respectively, and the mean absolute and mean square errors for the prediction of thermal conductivity are 0.017 9 and 0.000 7, respectively. Our proposed learning method for multimodal data representation of ceramic coating materials effectively combines structured and unstructured data to learn both types of modal data with potentially shared information and successfully improves the pred.

Key words: ceramic coatings, Gaussian mixture models, multimodal data representation, machine learning algorithm

CLC Number: