Total views : 158

Data Stream Classification using Random Forest and Very Fast Decision Tree

Affiliations

  • School of Computing, SASTRA University, Tirumalaisamudram, Thanjavur – 613401, Tamilnadu, India

Abstract


Objective: Data Stream Classification is a big problem. Ensemble based learning methods are used to tackle the data Stream classification problem. Method: Methods such as Random Forest and Very Fast Decision Tree (VFDT) for classification are used in the system. Findings: This hybrid approach is an Effective method for calculating hidden data and maintains accuracy in the large data set. It also calculates the neighbors between each pair of cases that can be used in clusters. It is used for finding unknown or (by scaling) gives informational views over the data. This hybrid approach achieves 85 % accuracy and result proves that the hybrid approach performs well when compared to other algorithms in terms of accuracy. This is the application where we can download data streams of any application at faster rate. Many methods are available to process data streams. But the proposed algorithm performs well when compared to other algorithms. Applications: Many real time data streams are downloaded and uploaded to test and train various Applications. Data streams are used in many applications such as medical applications and educational applications.

Keywords

Classification, Data Stream, Random Forest, Very Fast Decision Tree (VFDT).

Full Text:

 |  (PDF views: 200)

References


  • Zhang P, Zhu X, Shi Y, Guo L, Wu X. Robust ensemble learning for mining noisy data streams. Decision Support System. 2011 Jan; 50(2):469–79.
  • Lia P, Wua X, Hua X, Wanga H. Learning concept-drifting data streams with random ensemble decision trees. Neurocomputing. 2015 Oct; 166:68–83.
  • Aggarwal CC. Data streams: models and algorithms. Springer-Verlag New York; 2006.
  • Brzezinski D, Stefanowski J. Reacting to different types of concept drift: The accuracy updated ensemble algorithm. IEEE Transactions on Neural Networks and Learning Systems. 2014 Jan; 25(1):81–94.
  • Karthika S, Sairam N. A naive bayesian classifier for educational qualification. Indian Journal of Science and Technology. 2015 Jul; 8(16):1–5.
  • Farid D, Zhang L, Hossain A, Rahman CM, Strachan R, Sexton G, Dahal K. An adaptive ensemble classifier for mining concept drifting data streams. Expert Systems with Applications. 2013 Nov; 40(15):5895–906.
  • Shaker A, Hullermeier E. Instance Based Learning (IBL) streams: a system for instance-based classification and regression on data streams. Evolving Systems. 2013 Dec; 3(4):235–49.

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.