上海大学学报(自然科学版) ›› 2026, Vol. 32 ›› Issue (2): 261-269.doi: 10.12066/j.issn.1007-2861.2553

• 通信与信息工程 • 上一篇    

基于局部语义的遥感图像不完备多标签分类

欧寒芝, 黄睿   

  1. 上海大学 通信与信息工程学院, 上海 200444
  • 收稿日期:2023-12-10 发布日期:2026-05-11
  • 通讯作者: 黄睿(1976-), 女, 副教授, 博士, 研究方向为模式识别与智能信息处理. E-mail:huangr@shu.edu.cn

Incomplete multilabel classification of remote sensing images based on local semantics

OU Hanzhi, HUANG Rui   

  1. School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China
  • Received:2023-12-10 Published:2026-05-11

摘要: 遥感图像包含丰富的地物信息,采用多标签学习方法更有利于对遥感图像进行准确分类.近年来,基于深度网络的遥感图像多标签分类已成为研究热点.深度学习方法通常需要在大型数据集上训练模型以获得更好的泛化能力,然而大规模数据集的类别标注费时费力,标注质量无法得到保证,可能会出现缺失标签的情况.标签信息的不完备会误导模型训练过程,影响深度模型分类性能.针对遥感图像的不完备多标签分类问题,提出一种基于局部语义(local semantics,LS)学习的分类方法.首先,通过语义表征学习(semanticrepresentation learning,SRL)模块提取输入图像的深度特征进行标签预测,获得伪标签;接着,通过局部语义学习模块捕获特征中的语义局部相关性,利用局部语义信息对伪标签进行加权增强;最后,以伪标签和增强伪标签的加权组合作为最终的预测标签,采用交叉熵损失函数对网络进行训练.遥感图像不完备多标签分类实验结果表明,与其他多标签分类方法相比,所提方法在不同标签缺失率下具有更好的标签预测性能.

关键词: 遥感图像, 多标签分类, 不完备多标签, 局部语义

Abstract: Multilabel learning is highly conducive to the accurate classification of remote sensing (RS) images that contain rich feature information, and the deep-learning-based multilabel classification of RS images has recently gained popularity owing to its exceptional performance. However, the categorical annotation of training samples in large-scale datasets is a time-consuming and labor-intensive task whose quality cannot be guaranteed, leading to the potential occurrence of missing labels. Incomplete label information can mislead the model training process, thereby negatively impacting classification performance. To address this problem, an incomplete multilabel classification method for RS images based on local semantics (LS) is proposed in this paper. First, a semantic representation learning (SRL) module is used to extract deep features from input images, which are then used to generate pseudo-labels. Next, a local semantic learning module is deployed to capture local semantic correlations within the features. These correlations are weighted and subsequently used to enhance the pseudo-labels. Finally, a weighted combination of pseudo-labels and enhanced pseudo-labels is used as the final predicted label, and the network is optimizes using the cross-entropy loss function. The experimental results demonstrate that the proposed method outperforms existing multilabel classification methods with different missing label rates.

Key words: remote sensing images, multilabel classification, incomplete multilabel, local semantics (LS)

中图分类号: