Research Articles

Multi-label label-specific feature selection based on graph Laplacian

Expand
  • School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China

Received date: 2020-03-07

  Online published: 2020-06-19

Abstract

Multi-label feature selection, which can effectively removeredundant features and improve classification performance, has become an effective solution for the problem of "curse of dimensionality". However, existing multi-label feature selection methods select the same features for all labels without considering the intrinsic relation between labels and features. In fact, each label has label-specific features that reflect the specific attributes of the label. A feature selection method called multi-label label-specific feature selectionbased on graph Laplacian (LSGL) is proposed in this study. LSGL first obtains alow-dimensional embedding of instances for each class label based on Laplacianeigenmaps. Next, it obtains a projection matrix that can project samples from adata space to manifold embedding space through sparse regularization. It thendetermines the label-specific features of the corresponding class label bycoefficient analysis of the matrix. Finally, the label-specific featuresare used for classification. Experimental results of multi-label featureselection andclassification on five public multi-label datasets showed the effectiveness of the proposed algorithm.

Cite this article

WU Zhejun, HUANG Rui . Multi-label label-specific feature selection based on graph Laplacian[J]. Journal of Shanghai University, 2022 , 28(2) : 281 -290 . DOI: 10.12066/j.issn.1007-2861.2243

References

[1] Zhang M L, Zhou Z H. A review on multi-label learning algorithms[J]. IEEE Transactions on Knowledge and Data Engineering, 2014, 26(8): 1819-1837.
[2] 余鹰. 多标记学习研究综述[J]. 计算机工程与应用, 2015, 51(17): 20-27.
[3] Wang C, Yan S C, Zhang L, et al. Multi-label sparse coding for automatic image annotation[C]// IEEE Conference on Computer Vision and Pattern Recognition. 2009: 1643-1650.
[4] Li L, Wang M X, Zhang L K, et al. Learning semantic similarity for multi-label text categorization[C]// Chinese Lexical Semantics Lecture Notes in Computer Science. 2014: 260-269.
[5] Li T, Ogihara M. Toward intelligent music information retrieval[J]. IEEE Transactions on Multimedia, 2006, 8(3): 564-574.
[6] Alalga A, Benabdeslem K, Taleb N. Soft-constrained Laplacian score for semi-supervised multi-label feature selection[J]. Knowledge and Information Systems, 2016, 47(1): 75-98.
[7] Yu Y, Wang Y L. Feature selection for multi-label learning using mutual information and GA[C]// International Conference on Rough Sets and Knowledge Technology. 2014: 454-463.
[8] Ma Z G, Nie F P, Yang Y, et al. Web image annotation via subspace-sparsity collaborated feature selection[J]. IEEE Transactions on Multimedia, 2012, 14(4): 1021-1030.
[9] Chang X J, Nie F P, Yang Y, et al. A convex formulation for semi-supervised multi-label feature selection[C]// Twenty-Eighth AAAI Conference on Artificial Intelligence. 2014: 1171-1177.
[10] Zhang J, Luo Z M, Li C D, et al. Manifold regularized discriminative feature selection for multi-label learning[J]. Pattern Recognition, 2019, 95: 136-150.
[11] Tibshirani R. Regression shrinkage and selection via the LASSO[J]. Journal of the Royal Statistical Society Series B (Methodological), 1996, 58(1): 267-288.
[12] Nie F P, Huang H, Cai X, et al. Efficient and robust feature selection via joint $\ell$2, 1-norms minimization[C]// Advances in Neural Information Processing Systems 23: Conference on Neural Information Processing Systems. 2010: 1813-1821.
[13] Zhang M L, Wu L, LIFT: multi-label learning with label-specific features[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(1): 1609-1614.
[14] Huang J, Li G R, Huang Q M, et al. Learning label specific features for multi-label classification[C]// IEEE International Conference on Data Mining. 2015: 181-190.
[15] 牟甲鹏, 蔡剑, 余孟池, 等. 一种基于标签相关性的类属属性多标签分类算法[J]. 计算机应用研究, 2020, 37(9): 2656-2658, 2673.
[16] Zhan W, Zhang M L. Multi-label learning with label-specific features via clustering ensemble[C]// IEEE International Conference on Data Science and Advanced Analytics (DSAA). 2017: 129-136.
[17] Huang J, Li G R, Huang Q M, et al. Joint feature selection and classification for multilabel learning[J]. IEEE Transactions on Cybernetics, 2018, 48(3): 876-899.
[18] Cai D, Zhang C Y, He X F. Unsupervised feature selection for multi-cluster data[C]// Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2010: 333-342.
[19] Chung F R K. Spectral graph theory[C]// Number 92 in Regional Conference Series in Mathematics. 1997: 10-20.
[20] Belkin M, Niyogi P. Laplacian eigenmaps and spectral techniques for embedding and clustering[J]. Advances in Neural Information Processing Systems, 2002, 14(6): 585-591.
[21] Ng A Y, Jordan M I, Weiss Y. On spectral clustering: analysis and an algorithm[C]// Advances in Neural Information Processing System 14: Conference on Neural Information Processing Systems. 2001: 849-856.
[22] Efron B, Hastie T, Tibshirani J R. Least angle regression[J]. The Annals of Statistics, 2004, 32(2): 407-451.
[23] Boutell M R, Luo J B, Shen X P, et al. Learning multi-label scene classification[J]. Pattern Recognition, 2004, 37(9): 1757-1771.
[24] Chang C C, Lin C J. LIBSVM: a library for support vector machines[J]. ACM Transactions on Intelligent Systems and Technology, 2011, 2(3): 1-27.
Outlines

/