Abstract
The performance of speaker identification systems has improved due to recent advances in speech processing techniques but there is still need of improvement in term of text-independent speaker identification and suitable modelling techniques for voice feature vectors. It becomes difficult for person to recognize a voice when an uncontrollable noise adds in to it. In this paper, feature vectors from speech are extracted by using mel-frequency cepstral coefficients and vector quantization technique is implemented through Linde-Buzo-Gray algorithm. Two purposeful speech databases with added noise, recorded at sampling frequencies 8000 Hz and 11025 Hz, are used to check the accuracy of the developed speaker identification system in non-ideal conditions. An analysis is also provided by performing different experiments on the databases that number of vectors in VQ codebook and sampling frequency influence the identification accuracy significantly.
Original language | English |
---|---|
Title of host publication | 2009 Third International Symposium on Intelligent Information Technology Application |
Publisher | IEEE Xplore |
Pages | 115-118 |
Number of pages | 4 |
ISBN (Print) | 978-0-7695-3859-4 |
Publication status | Published (in print/issue) - 31 Dec 2009 |
Keywords
- cepstral analysis
- signal sampling
- speaker recognition
- vector quantisation
- speaker identification
- MFCC feature
- speech processing
- voice feature vector
- mel-frequency cepstral coefficient
- vector quantization
- Linde-Buzo-Gray algorithm
- speech database
- VQ codebook
- sampling frequency
- frequency 8000 Hz
- frequency 11025 Hz
- Mel frequency cepstral coefficient
- Cepstral analysis
- Vector quantization
- Noise figure
- Sampling methods
- Hidden Markov models
- Working environment noise
- Speech recognition
- Speech enhancement
- Spatial databases
- Speaker identification
- MFCC
- VQ