Total views : 104
Dialect Identification of Assamese Language using Spectral Features
Objective: Accurate dialect identification technique helps in improving the speech recognition systems that exist in most of the present day electronic devices and is also expected to help in providing new services in the field of e-health and telemedicine which is especially important for older and homebound people. Methods: In this paper we have developed the speech corpora needed for the dialect identification purpose and described a method to identify Assamese language (an Indian language) and two of its dialects, viz., Kamrupi and Goalparia. Findings: Research work done on dialect identification is relatively much less than that on language identification for which one of the reasons being dearth of sufficient database on dialects. As mentioned, we have developed the database and then Mel-Frequency Cepstral Coefficient has been used to extract the spectral features of the collected speech data. Two modeling techniques, namely, Gaussian Mixture Model and Gaussian Mixture Model with Universal Background Model have been used as the modeling techniques to identify the dialects and language. Novelty: So far, standard speech database for Assamese dialects that can be used for speech processing research has not been made. In this paper, we not only describe a method to identify dialects of the Assamese language, but have also developed the speech corpora needed for the dialect identification purpose.
Assamese, Gaussian Mixture Model, Gaussian Mixture Model with Universal Background Model, Goalparia, Kamrupi, Mel-Frequency Cepstral Coefficient.
- Li H, Ma B, Lee AK. Spoken language recognition: From fundamentals to practice. Proceedings of the IEEE. 2013; 101(5):1136–59. Crossref
- Etman A, Louis BAA. Language and dialect identification: A survey. Proceedings of SAI Intelligent Systems Conference; 2015. p. 220–31. Crossref
- Torres-Carraquillo PA, Gleason TP, Reynolds DA. Dialect identification using Gaussian Mixture Models Proceedings of the Speaker and Language Recognition Workshop; 2004. p. 41–4.
- Chan FN, Shan W, Campbell JP. A linguisticallyinformative approach to dialect recognition using Dialect-Discriminating Context-Dependent Phonetic Models. Proceedings of the IEEE International Conference on Acoustic Speech Signal Processing; 2010. p. 5014–7.
- Torres-Carrasquillo PA, Sturim D, Reynolds D, McCree A. Eigen-Channel Compensation and Discriminatively Trained Gaussian Mixture Models for dialect and accent recognition. Proceedings of the INTERSPEECH; 2008. p. 723–6.
- Sengupta D, Saha G. Identification of the major language families of India and evaluation of their mutual influence. Current Science. 2016; 110(4):667–81. Crossref
- Jothilakshmi S, Ramalingam V, Palanivil S. A hierarchical language identification system for Indian languages. Digital Signal Processing. 2012; 22(3):544–53. Crossref
- Reddy VR, Maity S, Rao KS. Identification of Indian languages using Multi-Level Spectral and Prosodic Features. International Journal of Speech Technology. 2013; 16(4):489–511. Crossref
- Balleda J, Murthy HA, Nagarajan T. Language identification from short segments of speech Proceedings of the INTERSPEECH; 2000. p. 1033–6.
- Manwani N, Mitra SK, Joshi MV. Spoken language identification for Indian languages using split and merge EM Algorithm. Ghosh A, De RK, Pal SK. Pattern Recognition and Machine Intelligence. Editions. PReMI 2007. Lecture Notes in Computer Science. 2007; 4815:463–8. Crossref
- Mohanty S. Phonotactic model for spoken language identification in Indian language perspective. International Journal of Computer Applications. 2011; 19(9):18–24. Crossref
- Roy P. Language recognition of three Indian languages based on clustering and supervised learning. Proceedings of the International Conference on Computer ApplicationsTelecommunications; 2010. p. 77–82. Crossref
- Rao KS, Koolagudi SG. Identification of Hindi dialects and emotions using spectral and prosodic features of speech. Journal of Systemics Cybernetics and Informatics. 2011; 9:24–33.
- Dutta M, Patgiri C, Sarma M, Sarma K. Closed-set text-independent speaker identification system using multiple ANN classifiers. Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications; 2014. p. 377–85. PMid:25471364
- Language Information Service (LIS) – India.
- Goswami U. A study on Kamrupi: A dialect of Assamese. 1st edition. Department of Historical Antiquarian Studies. Government of Assam: Gauhati; 1970.
- Goswami U. An introduction to Assamese. Mani-Manik Prakash: Gauhati; 1978.
- Medhi K. Assamese grammar and origin of the Assamese language. 1st edition. Publication Board. Government of Assam: Guwahati; 1988.
- Reynolds DA. An overview of automatic speaker recognition technology. Proceedings of International Conference on Acoustics Speech and Signal Processing. 2002; 4:4072–5. Crossref, Crossref
- Patil HA, Basu TK. Development of speech corpora for speaker recognition research and evaluation in Indian languages. International Journal of Speech Technology. 2008; 11:17–32. Crossref
- Kersta LG. Voiceprint classification. Journal of the Acoustical Society of America. 1965; 37:1217. Crossref
- Junqua JC, Fincke SC, Field K. The Lombard effect: A reflex to better communicate with other in noise. Proceedings of the IEEE International Conference on Acoustic Speech Signal Processing; 1999. p. 2083–6. Crossref
- Gish H, Schmidt M. Text-independent speaker identification. IEEE Signal Processing Magazine. 1994; 11(4):18–32. Crossref
- Colleen Richey. Speech Corpora; 2000.
- Shrawanka U, Thakare V. Techniques of feature extraction in speech recognition system: A comparative study. International Journal of Computer Applications in Engineering Technology and Sciences. 2010; 2:412–8.
- Nour-Eddine L, Abdelkader A. GMM-based Maghreb dialect identification system. Journal of Information Processing Systems. 2015; 11(1):22–38.
- Muthusamy YK, Jain N, Cole RA. Perceptual benchmarks for automatic language identification. Proceedings of the IEEE International Conference on Acoustic Speech Signal Processing; 1994. p. 333–6. Crossref
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution 3.0 License.