Total views : 406

Classification Algorithms of Data Mining

Affiliations

  • Department of Computer Science and Engineering, SRM University, Kattankulathur, Chennai - 603203, Tamil Nadu, India

Abstract


Objectives: To make a comparative study about different classification techniques of data mining. Methods: In this paper some data mining techniques like Decision tree algorithm, Bayesian network model, Naive Bayes method, Support Vector Machine and K-Nearest neighbour classifier were discussed. Findings: Each algorithm has its own advantages and disadvantages. Decision tree technique do not perform well if the data have smooth boundaries. The Naive Bayesian classifier works with both continuous and discrete attributes and works well for real time problems. This method is very fast and highly scalable. The drawback of this technique is when a data set which has strong dependency among the attribute is considered then this method gives poor performance. KNN can perform well in many situations and it particularly suits well for multi-model classes as well as applications in which an object can have many labels. The drawback of KNN is it involves lot of computation and when the size of training set taken is large then the process will become slow. Support vector machine suites well when the data need to be classified into two groups. Application: This paper is to provide a wide range of idea about different classification algorithms..

Keywords

Bayesian Network, Data Mining, Decision Tree, K-Nearest Neighbour Classifier, Naive Bayes, Support Vector Machine.

Full Text:

 |  (PDF views: 699)

References


  • Tan P-N, Steinbach M, Kumar V. Introduction to data mining. Pearson New International Edition; 2006 Mar.
  • Han J, Kamber M, Pei J. Data mining: Concepts and techniques. 3rd ed. Morgan Kaufmann Publishers; 2011 Jul.
  • Gao Y-Y, Ren N-P. Data mining and analysis of our agriculture based on the decision tree. ISECS International Colloquium on Computing, Communication, Control, and Management. 2009; 2:134–8.
  • Goltz E, Arcoverde GFB, de Aguiar DA, Rudorff BFT, Maeda EE. Data mining by decision tree for object oriented classification of the sugar cane cut kinds. IEEE International Geoscience and Remote Sensing Symposium. 2009; 5:v405–8.
  • Zhang L, Zhang J, Sun Y. The construction and application of Bayesian network in data mining. 6th International Conference on Information Management, Innovation Management and Industrial Engineering IEEE. 2013; 3:501–3.
  • Patankar B, Chavda V. A comparative study of decision tree, Naive Bayesian and KNN classifier in data mining. International Journal of Advanced Research in Computer Science and Software Engineering. 2014 Dec; 4(12).
  • Jiang L, Zhang H, Su J. Learning K-Nearest Neighbour Naive Bayes for ranking. Proceedings of First International Conference ADMA 2005, Wuhan: China; 2005 Jul 22–24. p. 175–85
  • Su J, Zhang H. Full Bayesian network classifiers. Proceedings of the 23rd International Conference on Machine Learning (ICML) ACM; 2006. p. 897–904.
  • Gao S, Li H. Breast cancer diagnosis based on Support vector machine. 2012 International Conference on Uncertainity Reasoning and Knowledge Engineering IEEE; 2012. p. 240–3.
  • Rajeswari V, Arunesh K. Analysing soil data using data mining classification techniques. Indian Journal of Science and Technology. 2016 May; 9(19).
  • Purusothaman G, Krishnakumari P. A survey of data mining techniques on risk prediction: Heart disease. Indian Journal of Science and Technology. 2015 Jun; 8(12).
  • Vedanayaki M. A study of data mining and social network analysis. Indian Journal of Science and Technology. 2014 Nov; 7(S7):185–7.

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.