Total views : 250
Hindi Vowel Classification using QCN-PNCC Features
This paper present a novel hybridized QCN-PNCC features. These features are obtained by processing Power Normalized Cepstral Coefficients (PNCC) with Quantile based Dynamic Cepstral Normalization Technique (QCN). The robustness of the QCN-PNCC features is compared with PNCC features for the task of Hindi Vowel classification with HMM classifier for Context-Dependent and Context- Independent cases in clean as well as in noisy environment. It is observed that the recognition accuracy of QCN-PNCC features with Hidden Markov Model (HMM) as classifier exhibit an improvement of approximately 8% as compared to PNCC features for Hindi vowel classification task.
Power normalized Cepstral Coefficient (PNCC), QCN, QCN-PNCC, Speech Recognition.
- Harvilla MJ, Stern RM. Histogram-based sub band power warping and spectral averaging for robust speech recognition under matched and multistyle training. IEEE International Conference on Acoustics, Speech Signal Processing; 2012 May.
- Boˇril H. Robust speech recognition: Analysis and equalization of lombard effect in czech corpora, Ph.D. Thesis, Czech Technical University in Prague, Czech Republic; 2008.
- Kim C, Stern RM. Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction. INTERSPEECH-2009; 2009 Sep; p. 28–31.
- Kim C, Stern RM. Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power ﬂooring. IEEE International Conference on Acoustics, Speech, and Signal Processing; 2010 Mar. p. 4574–7.
- Kelly F, Harte N. A comparison of auditory features for robust speech recognition. EUSIPCO-2010; 2010 Aug. p. 1968–72.
- Kelly F, Harte N. Auditory features revisited for robust speech recognition. International Conference on Pattern Recognition. 2010 Aug; p. 4456– 9.
- Shipra, Chandra M. Hindi vowel classification using QCN-MFCC features. Perspectives in Science. 2016 Sep; 8:28–31. DOI: dx.doi.org/10.1016/j.pisc.2016.01.010.
- Rabiner LR. A Tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE. 1989; 77(2):257–85.
- Samudravijaya K, Rao PVS, Agrawal SS. Hindi speech database. International Conference on Spoken Language Processing (ICSLP00). Beijing; 2002. p. 456–9.
- Biswas A, Sahu P, Chandra M. Admissible wavelet packet features based on human inner ear frequency response for Hindi consonant recognition. Computers and Electrical Engineering. 2014; 40(4):1111–22.
- Biswas A, Sahu P, Bhowmick A, Chandra M. Feature extraction technique using ERB like wavelet sub-band periodic and aperiodic decomposition for TIMIT phoneme recognition. International Journal of Speech Technology. 2014; 17:389–99.
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution 3.0 License.