Total views : 211

XML Indexing Techniques for Handling Large Amounts of Data

Affiliations

  • Korea Information and Communication Industry Institute, Seoul, Korea, Republic of
  • Republic of Korea Army, Korea, Republic of
  • Department of Computer, Dong Shin University, Korea, Republic of

Abstract


Objectives: This paper presents an effective XML indexing technology for input/output process and retrieval of large amounts of data to exchange structured documents in the Internet space. Methods/Statistical Analysis: In these days, many data are produce in various forms by Web media, the Internet of Things, and social media. However, the data are too complicated in type and large to effectively analyze, search, and rapid input/output processing with the traditional Relational DB system. In this paper, we have looked to measure the document searching speed, document-writing speed in order to assess the performance of the XML Information Retrieval and management system using an index model. Many cases contain hundreds of millions of different elements of an XML document. In Keyword-based approach, it is necessary for all the elements in the document tagging to a quick search because it does not contain the structural information for the XML document to a query. Findings: The XML is a widely used mark-up language in Internet, and a large amount of data is produce in XML format. In this paper, we propose two search methods for efficient search in a large amount of data with the XML indexing technology. First, existing studies show the true performance to lean to one side in the keyword search of large XML documents have a problem. Both informal and formal data processing, the Content Model was proposes to solve. Non-structured data to a right processing and utilization as NoSQL View in a variety of devices for rapid processing of the user interface was couple to Data. N-Screen or the like can be used without Viewer from various mobile devices, as well as the task of processing that is used in a large variety of services can be advantageous. Improvements: This work can also extended with other searching algorithms for handling large amount of data

Keywords

Big-Data, Indexing, I/O processing, XML.

Full Text:

 |  (PDF views: 203)

References


  • XML Path Language (XPath) Version 1.0. https://www.w3.org/TR/xpath/. Date Accessed: 07/09/2015.
  • Chamberlin D. XQuery: An XML query language. IBM System Journal. 2002; 41(4): 597-615.
  • Xu Y, Papakonstantinou Y. Efficient Keyword Search for Smallest LCAs in XML Databases. Proceeding of the 2005 ACM SIGMOD international conference on Management of data, USA. 2005, p.527-38.
  • Hoang Do Thanh Tung, Dinh Duc Luong. An Improved Indexing Method for Xpath Queries, Indian Journal of Science and Technology, 2016 Aug; 9(31):1-7.
  • XQuery 1.0: An XML Query Language http://www.immagic.com/eLibrary/ARCHIVES/SUPRSDED/W3C/W010607D.pdf. Date Accessed: 07/06/2016.
  • Trotman A, Lalmas M. Why Structural Hints in Queries do not Help XML-Retrieval, SIGIR '06 Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2006, p.711-12.
  • Larsen B, Tombros A, Malik S. Is XML Retrieval Meaningful to Users? Searcher Preferences for Full Documents vs,Elements, SIGIR '06 Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006, p. 663-64.
  • Kamps J, Koolen M, Lalmas M. Where to Start Reading a Textual XML Document?. SIGIR '07 Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development In Information Retrieva. 2007, p. 723-24.
  • Kim HS, Son HJ. Users Interaction with the Hierarchically Structured Presentation in XML Document Retrieval, Springer Berlin Heidelberg, 2005 Nov; p. 422-31.
  • Betsi S, Lalmas M, Tombros A, Tsikrika T. User Expectations from XML Element Retrieval, SIGIR '06 Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006, p.611-12.
  • Florescu D, Kossmann D, Manolescu I. Integrating Keyword Search into XML Query Processing, Proceedings of the 9th international World Wide Web conference on Computer networks, 2000 Jun,pp.119-35.
  • Shimizu T, Terada N, Yoshikawa M. Kikori-KS: An Effective and Efficient Keyword Search System for Digital Libraries in XML. Springer Berlin Heidelberg, 2006 Nov, p. 390-99.
  • Guo L, Shao F, Botev C, Shanmugasundaram J. XRANK: Ranked Keyword Search over XML Documents, SIGMOD '03 Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003, p. 16-27.
  • Structural Feedback for Keyword-Based XML Retrieval. http://dbis.eprints.uni-ulm.de/1268/. Date Accessed: 09/09/2015.
  • Ko SK, Choy YC. A Structured Documents Retrieval Method supporting Attribute-based Structure Information. SAC 2002, Proceedings of the 2002 ACM Symposium on Applied Computing. 2002, p.668-74.
  • Francois P. Generalized SGML repositories:Requirements and modelling. Computer Standards and Interfaces. 1996 Jan;18(1):11-24.

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.