Total views : 195

Effect of Statistical POS Tagger on Syntactic Analysis of Punjabi Sentences

Affiliations

  • Department of Computer Science and Applications, DAV University, Jalandhar – 144012, Punjab, India

Abstract


Objectives: In this research article, author has explored the effect of statistics based part of speech tagger on the syntactic analysis of Punjabi sentences. Methods/Statistical Analysis: To study the effect of statistical POS tagger on the syntactic analysis of Punjabi sentence, author performed two experiments; first a rule based POS tagger is used for syntactic analysis and second this rule based POS tagger is replaced with HMM based statistical POS tagger. An annotated corpus of 20,000 words has been used to train the HMM based POS tagger. Findings: The system is tested on three types of errors; first subject/object and verb agreement error second noun and modifier agreement (in attributed form) error and third modifier and noun agreement error. On using HMM based POS tagger, the system shows a precision of 80.67 for subject/object and verb agreement error whereas on using rule based POS tagger the system shows a precision of 72.81. Similarly for noun and modifier agreement (in attributed form) error, author claims a precision of 82.45 on using HMM based tagger whereas on using rule based tagger, the precision is 76.00. And in case of modifier and noun agreement error, a precision of 97.56 is claimed by the author by using HMM based tagger which was 95.45 when rule based POS tagger is used. Application/Improvements: The result indicates that the grammar checker performs better when rule based POS tagger is replaced with statistics based POS tagger.

Keywords

Punjabi Sentences, POS Tagger, Syntactic Analysis.

Full Text:

 |  (PDF views: 169)

References


  • Martin JH, Jurafsky D. Speech and Language Processing. International Edition. 2000.
  • Allen J. Natural Language Understanding. 1995.
  • Gill MS, Singh M. Development of a Punjabi grammar checker [PhD thesis]. Patiala: Punjabi University; 2008.
  • Gill MS, Lehal GS, Joshi SS. A punjabi grammar checker. IJCNLP. 2008. p. 940-4.
  • Gill MS, Lehal GS. A grammar checking system for Punjabi. In 22nd International Conference on Computational Linguistics: Demonstration Papers; 2008. p. 149-52.
  • Gill MS, Lehal GS, Joshi SS. Part of speech tagging for grammar checking of Punjabi. The Linguistic Journal. 2009 May; 4(1):6-21.
  • Blunsom P. Hidden Markov Models. Technical Report. 2000 8. Sharma SK, Lehal GS. Using hidden Markov model to improve the accuracy of Punjabi POS tagger. IEEE International Conference Computer Science and Automation Engineering (CSAE); China. 2011. p. 697-701.

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.