Total views : 170

Utility of Corpus based Approach in the Recognition of Opinionated Text


  • Department of Information Technology, Amity University, Noida - 201313, Uttar Pradesh, India


Objectives: The proposed work focuses on mining opinion word catalogue by using corpus based approach. The motive is to use different parts of speech to improve the classification of the sentiments. Methods/Statistical Analysis: The methodology involved in the proposed work incorporates both the sentiment orientation approach and machine learning approach. The various features like content – specific, content-free and other sentiment features have been used to classify the sentiments. The previous works in the field involved only some specific parts of speech, which have been replaced by the usage of nouns, adjectives, verbs and adverbs. In this approach an algorithm for calculation of sentiment feature has been proposed. Findings: The algorithm proposed in this work is more efficient in comparison to other existing work. In this work, since we have developed a corpus based approach amalgamating both the machine learning and semantic orientation approaches into a common skeleton, it improvises the classification method. Our projected method also incorporates the content-specific and content free features involved in the existing approaches. It also utilizes the infrequent and sentiment features in the semantic orientation approach. The proposed technique can be classified into three main modules: Acquiring of data, generation of features, followed by classification and evaluation. Application/ Improvements: The researches to be done in future can deal with other feature generation methods. Moreover the method can also be improved by making the modifications so that the feature classification can be done on quite large data sets. The method can further be implemented for multilingual languages to build a multilingual sentiment-based lexicon.


Corpus–based Approach, Opinionated Text, Sentiment Analysis, Sentiment Classification, Sentiment Features.

Full Text:

 |  (PDF views: 131)


  • Dash M, Liu H. Feature selection for classification. Intelligent Data Analysis. 1997 Mar; 1:131–56.
  • Kim SM, Hovy E. Determining the sentiment of opinions. Proceedings of the 20th International Conference on Computational Linguistics (COLING); USA. 2004 Jan. p. 1–8.
  • Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment classification using machine learning techniques. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), USA. 2002 Jul; 10:79–86.
  • Turney PD. Thumbs Up or Thumbs Down? Semantic orientation applied to unsupervised classification of reviews. Proceedings of the 40th Annual Meetings of the Association of Computational Linguistics; USA. 2002 Dec. p. 417–24.
  • Riloff E, Wiebe J. Learning extraction patterns for subjective expressions. Proceedings of the Conference on Empirical Methods in Natural Language Processing; 2003 Jul. p. 105–12.
  • Turney PD, Littman LM. Measuring praise and criticism: Inference of semantic orientation from association. ACM Transaction Information Systems. 2003 Oct; 21(4):315–46.
  • Razavi AH, Inkpen D, Matwin S, Uritsky S. Offensive language detection using multi-level classification. Springer Berlin Heidelberg; 2010 May-Jun. p. 16–27.
  • Wiebe J, Wilson T, Bruce R, Bell M, Martin M. Learning subjective language. Computational Linguistics. 2004 Sep; 30(3):277–308.
  • Abbasi A, Chen H. Applying authorship analysis to extremist-group web forum messages. IEEE Intelligent Systems. 2005 Sep; 20(5):67–75.
  • Using SentiWord Net for Multilingual Sentiment Analysis 2008. Available from:
  • Esuli A, Sebastiani F. Senti-WordNet: A publicly available lexical resource for opinion mining. Proceedings of the 5th Conference on Language Resources and Evaluation (LREC-06); 2006. p. 417–22.
  • Devitt A, Ahmad K. Sentiment polarity identification in financial news: A cohesion-based approach. Proceedings of 45th Annual Meeting of the Association of Computational Linguistics; 2007 Jun. p. 984–91.
  • Hatzivassiloglou V, Wiebe J. Effects of adjective orientation and gradability on sentence subjectivity. Proceedings of the 18th International Conference on Computational Linguistics, USA. 2000; 1:299–305.
  • Hu M, Liu B. Mining and summarizing customer reviews. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge discovery and Data Mining; USA. 2004 Aug. p. 168–77.


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.