Abstract
The proposed technique measures the voice intensity of an utterance by calculating area under the curve. The curve is obtained by normalizing the cubic polynomial fitted through the peaks. These peaks are found from each frame of the utterance when it is divided into segments of 20 milliseconds. The Simpson's rule is used to calculate area under the curve and SVM uses this area to classify the genders. The use of one dimensional feature, area of utterance, is an evidence for the time and computational efficiency of this technique. The aspects observed in this paper, for the validity of the technique, are: it works for different natural languages, independent of recording equipment, any text can be used for the classification, and its biasness when different number of male and female speakers are used for the training of the system. A promising classification rate of 98.27% is achieved.
Original language | English |
---|---|
Pages | 552-555 |
Number of pages | 4 |
Publication status | Published (in print/issue) - 1 Jun 2012 |
Event | 2012 19th International Conference on Systems, Signals and Image Processing, IWSSIP 2012 - Vienna, Austria Duration: 11 Apr 2012 → 13 Apr 2012 |
Conference
Conference | 2012 19th International Conference on Systems, Signals and Image Processing, IWSSIP 2012 |
---|---|
Country/Territory | Austria |
City | Vienna |
Period | 11/04/12 → 13/04/12 |
Keywords
- Area under the curve
- Simpson's Rule
- SVM
- TIMIT
- Voice Intensity