Journal of Shanghai University(Natural Science Edition)

• Articles • Previous Articles     Next Articles

Language Identification Based on Minimum Distortion of Cepstrum Distance Segmentation

MIAO Wei, HOU Li-min

  

  1. School of Communication and Information Engineering, Shanghai University, Shanghai 200072, China
  • Received:2006-06-22 Revised:1900-01-01 Online:2007-04-30 Published:2007-04-30
  • Contact: HOU Li-min

Abstract:

We propose a novel approach to language identification. Generally speaking, an ideal language identification system needs a large number of speech transcriptions at the phoneme level for training the phone model, involving a huge amount of work and cost. In this project, we use a rough segmentation instead of transcription to produce sub-words, and a front-end sub-words recognizer for individual languages to be identified. This is followed by clustering the sub-words and creating an HMM for each cluster. Preliminary results on language identification are provided to demonstrate simplicity and effectiveness of this approach.

Key words: language identification, sub-words segmentation
,
hidden markov model (HMM)

CLC Number: