Total views : 329

Quantitative Evaluation of Web user Session Dissimilarity measures using Medoids based Relational Fuzzy clustering

Affiliations

  • Department of Computer Science and Engineering, National Institute of Technology Raipur – 492010, Chhattisgarh, India
  • Department of Electronics and Telecommunication, National Institute of Technology, Raipur - 492010, Chhattisgarh,, India
  • Department of Information Technology, Indian Institute of Information Technology, Allahabad - 211011, Uttar Pradesh, India

Abstract


Background/Objectives: Proficient relational clustering of web users’ sessions not only depends on clustering algorithm’s character but also profoundly influenced by the used dissimilarity measures. Therefore, determining the right dissimilarity measure to capture the actual access behaviour of the web user is imperative for the significant clustering.Methods: In this paper, the concept of an augmented session is used to derive different augmented session dissimilarity measures. The quantitative performance evaluation of different session dissimilarity measures are performed using a relational fuzzy c-medoid clustering approach. The intra-cluster and inter-cluster distance based cluster quality ratio is used for performance evaluation. Findings: The experimental results demonstrated that augmented web user session dissimilarity in general, and intuitive augmented session dissimilarity, in particular, performed better than the other dissimilarity measures. Improvements: It is argued that augmented session similarity measures are more realistic and represent session similarities based on the web user’s habits, interest, and expectations as compared to simple binary session similarity measures.

Keywords

Augmented user Sessions, Cluster Evaluation, Dissimilarity Measures, Fuzzy Clustering, Page Relevance, Web User Sessions.

Full Text:

 |  (PDF views: 257)

References


  • Lim M, Byun H, Kim J. A web usage mining for modeling buying behavior at a web store using network analysis. Indian Journal of Science and Technology. 2015; 8(25):1–7.
  • Guerbas A, Addam O, Zaarour O, Nagi M, Elhaj A, Ridley M, et al. Effective web log mining and online navigational pattern prediction. Knowledge-Based Systems. Elsevier B.V.; 2013; 49(12):50–62.
  • Mobasher B, Cooley R. Automatic Personalization Based on Web Usage Mining. Communications of the ACM. 2000; 43(8):142–51.
  • Krishnapuram R, Joshi A, Liyu Yi. A fuzzy relative of the k-medoids algorithm with application to web document and snippet clustering. IEEE International Fuzzy Systems Conference Proceedings (FUZZ-IEEE’99). IEEE; 1999. p. 1281–6.
  • Nasraoui O, Frigui H, Krishnapuram R, Joshi A. Extracting web user profiles using relational competitive fuzzy clustering. International Journal on Artificial Intelligence tools. 2000; 9(4):509–26.
  • Nasraoui O, Krishnapuram R, Anupam Joshi TK. Automatic web user profiling and personalization using robust fuzzy relational clustering. E-Commerce and Intelligent Methods. Physica-Verlag HD. 2002; 233–61.
  • Krishnapuram R, Joshi A, Nasraoui O, Yi L. Low-complexity fuzzy relational clustering algorithms for Web mining. IEEE Transactions on Fuzzy Systems. 2001; 9(4):595–607.
  • Nasraoui O, Cardona C. Mining evolving user profiles in noisy web clickstream data with a scalable immune system clustering algorithm. Proceedings of WebKDD. 2003; 71–81.
  • Yan TW, Jacobsen M, Garcia-Molina H, Dayal U. From user access patterns to dynamic hypertext linking. Computer Networks and ISDN Systems. 1996;28(7):1007–14.
  • Forsati R, Moayedikia A, Shamsfard M. An effective Web page recommender using binary data clustering. Information Retrieval Journal. Springer Netherlands. 2015; 18(3):167–214.
  • Chan PK. A non-invasive learning approach to building web user profiles. Proceedings of Workshop on Web Usage Analysis (KDD-99). 1999; 7–12.
  • Xiao J, Zhang Y. Clustering of web users using sessionbased similarity measures. In: International Conference on Computer Networks and Mobile Computing. 2001. p. 223–8.
  • Liu H, Keselj V. Combined mining of Web server logs and web contents for classifying user navigation patterns and predicting users’ future requests. Data and Knowledge Engineering. 2007; 61(2):304–30.
  • Vakali A, Pokorny J, Dalamagas T. An overview of web data clustering practices. Current Trends in Database WebKdd. Springer Berlin Heidelberg. 2004; 597–606.
  • Hay B, Wets G, Vanhoof K. Clustering navigation patterns on a website using a Sequence Alignment Method. Intelligent Techniques for Web Personalization, IJCAI. 2001; 1–6.
  • Li C, Lu Y. Similarity measurement of web sessions by sequence alignment. In: IFIP International Conference on Network and Parallel Computing Workshops, NPC 2007. IEEE Computer Society. 2007. p. 716–20.
  • Bose A, Beemanapalli K, Srivastava J, Sahar S. Incorporating concept hierarchies into usage mining based recommendations. Proceedings of the 8th Knowledge Discovery on the Web International Conference on Advances in Web Mining and Web usage Analysis. 2006. p. 110–26.
  • Yang Q, Kou J, Chen F, Li M. A new similarity measure for generalized web session clustering. Proceedings - Fourth International Conference on Fuzzy Systems and Knowledge Discovery(FSKD 2007). 2007. p. 278–82.
  • Xie Y, Phoha V V. Web user clustering from access log using belief function. Proceedings of the International Conference on Knowledge Capture - K-CAP 2001. 2001. p. 202.
  • Sisodia D, Verma S. Web usage pattern analysis through web logs: A review. International Joint Conference on Computer Science and Software Engineering (JCSSE). IEEE. 2012. p. 49–53.
  • Sisodia DS, Verma S, Vyas OP. Agglomerative approach for identification and elimination of web robots from web server logs to extract knowledge about actual Visitors. Journal of Data Analysis and Information Processing. 2015; 3(2):1–10.
  • Fernandez FMH, Ponnusamy R. Data preprocessing and cleansing in web log on ontology for enhanced decision making. Indian Journal of Science and Technology. 2016; 9(10):1–10.
  • Spiliopoulou M, Mobasher B, Berendt B, Nakagawa M. A framework for the evaluation of session reconstruction heuristics in web-usage analysis. INFORMS Journal on Computing. 2003; 15(2):171–90.
  • Sisodia DS, Verma S, Vyas OP. A comparative analysis of browsing behavior of human visitors and automatic software agents. American Journal of Systems and Software. 2015; 3(2):31–5.
  • Sisodia DS, Verma S, Vyas OP. Augmented intuitive dissimilarity metric for clustering of web user sessions. Journal of Information Science. 2016; 1–12. Doi: 101177/0165551516648259.
  • Huang A. Similarity measures for text document clustering. Proceedings of the Sixth New Zealand. 2008 Apr; 49–56.
  • Revathy S, Parvaathavarthini B, Rajathi S. Futuristic validation method for rough fuzzy clustering. Indian Journal of Science and Technology. 2015; 8(2):120–7.
  • Halkidi M, Batistakis Y, Vazirgiannis M. On clustering validation techniques. Journal of Intelligent Information Systems. 2001; 17(2-3):107–45.
  • Halkidi M, Batistakis Y, Vazirgiannis M. Cluster Validity Methods : Part I. ACM SIGMOD Record. 2002; 31(2):40–5.
  • Brun M, Sima C, Hua J, Lowey J, Carroll B, Suh E, et al. Model-based evaluation of clustering validation measures. Pattern Recognition. 2007; 40(3):807–24.
  • NASA_Aug95. NASA Kennedy space centre’s www server log data, Available at [Internet]. Available from: http://ita. ee.lbl.gov/html/contrib/NASA-HTTP.html
  • MATLAB(2012a). Software package [Internet]. Available from: http://www.mathworks.com.
  • Havens TC, Member S, Bezdek JC. An Efficient Formulation of the Improved Visual Assessment of Cluster Tendency ( iVAT ) Algorithm. IEEE Transactions on Knowledge and Data Engineering. 2012; 24(5):813–22.
  • Wang LA, Geng X, Bezdek J, Leckie C, Ramamohanarao K. iVAT and aVAT:Enhanced Visual Analysis for Cluster Tendency Assessment and Data Partitioning. Advances in

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.