上海大学学报(自然科学版) ›› 2025, Vol. 31 ›› Issue (6): 915-930.doi: 10.12066/j.issn.1007-2861.2483

• 通信工程 •    下一篇

基于增强标签相关性矩阵的不完备多标签学习

许智磊, 黄睿   

  1. 上海大学 通信与信息工程学院, 上海 200444
  • 收稿日期:2022-10-14 出版日期:2025-12-31 发布日期:2025-12-31
  • 通讯作者: 黄睿(1976-),女,副教授,博士,研究方向为模式识别与智能信息处理. E-mail:huangr@shu.edu.cn

Multi-label learning with incomplete labels via augmented label correlation matrix

XU Zhilei, HUANG Rui   

  1. School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China
  • Received:2022-10-14 Online:2025-12-31 Published:2025-12-31

摘要: 在多标签学习中,人工标注标签的主观性和不稳定性往往造成标签缺失,无法形成完备的标签空间,从而对监督学习算法的训练产生误导.标签相关性可在一定程度上弥补缺失标签对算法分类性能造成的不利影响.但缺失标签也会导致对标签相关性的估计不准确.针对该问题,提出一种增强标签相关性矩阵的不完备多标签学习(multi-label learning with incomplete labels via augmented label correlation matrix,ML-ALC)方法.首先,通过拉普拉斯映射构造数据的低维流形;然后,使用标签向量计算原始标签相关矩阵;接着,构造一个校正矩阵对原始标签相关矩阵进行增强,并通过回归系数矩阵和增强标签相关性矩阵将原始特征空间和标签空间分别映射到低维流形;最后,经过迭代学习获得优化的回归系数矩阵和增强标签相关性矩阵,并应用于多标签分类.实验结果表明,ML-ALC方法的分类性能优于其他针对缺失标签的多标签分类方法.

关键词: 不完备多标签学习, 缺失标签, 标签相关性, 低维流形

Abstract: In multi-label learning, the subjectivity and instability of manual labeling often result in the absence of partial class labels and an incomplete label space. Missing labels are likely to be misleading in the training of supervised learning algorithms. The use of label correlation can alleviate the negative effect of missing labels on the algorithm classification performance to a certain extent. However, missing labels can lead to inaccurate estimations of label correlations. To address this problem, a method called multi-label learning with incomplete labels using an augmented label correlation matrix (ML-ALC) was proposed. First, a low-dimensional manifold of the data is constructed using Laplace mapping, and the original label correlation matrix is calculated using the label vectors. Subsequently, a correction matrix is constructed to augment the original label correlation matrix, and the original feature space and label space are mapped to the low-dimensional manifold through a regression coefficient matrix and augmented label correlation matrix, respectively. Finally, the optimized regression coefficient matrix and augmented label correlation matrix are obtained through iterative learning and applied to multi-label classification. The experimental results demonstrate that the proposed algorithm performs better than other state-of-the-art methods in multi-label learning with incomplete labels.

Key words: multi-label learning with incomplete labels, missing labels, label correlation, low-dimensional manifold

中图分类号: