Journal of Shanghai University(Natural Science Edition)

• Articles • Previous Articles     Next Articles

Database Cluster Preprocessing with Principal Component Extraction

XU Jun,XIA Jiao-xiong,LI Qing   

  1. School of Computer Engineering and Science, Shanghai University,
    Shanghai 200072, China
  • Received:2007-04-07 Revised:1900-01-01 Online:2007-12-20 Published:2007-12-20
  • Contact: XU Jun

Abstract:

According to the principle of least relativity of the data object, database cluster preprocessing with principal component extraction (DCP-PCE) is proposed to reduce dimension of a high dimensional system. Cluster extraction is carried out with hierarchical principal component analysis. The projection on the most differentiation of the data object is defined as principal component, which can be proved to include all the original information of the data object sets. With the DCP-PCE, comprehensive coverage of variables and lower dimension of principal component are solved synchronously, dissimilarity and dimension of the data object sets are decreased, and clustering reduction of the data object sets are reached. By leading the clustering analysis into the preprocessing of data resource on the colleges and universities, the application example is given to illustrate the effectiveness for exploration model.

Key words: cluster preprocessing, data resource, database principal
component extraction,
principal component analysis