This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
|
||||||||
|
Paper Details
Paper Title
A Survey on Information Retrieval Using Various Techniques
Authors
  Florence Vijila. S,  Nirmala. K
Abstract
Structured data, typically, is predefined data. Semi-structured and unstructured data are not predefined data that includes documents, emails, social media posts, images, videos, etc. Text extraction is a critical stage of analyzing Journal papers. Journal papers generally are in PDF format which is semi structured data. Journal papers are presented into different sections like Introduction, Methodology, Experimental, Result, Conclusion etc. It makes easy to analyze based on readers interested topic. The main importance on section extraction is to find a representative subset of the data, which contains the information of the entire set. To extract research papers, we can approach machine learning, NLP, etc. In this paper we present review of various extraction techniques from a PDF document. Data consolidation is used to combine the extracted data to obtain structured data from papers. This will make the knowledge extraction process easy to manage and analyze.
Keywords- Information extraction, Text Mining, NLP, Machine Learning Methods
Publication Details
Unique Identification Number - IJEDR1803062Page Number(s) - 347-351Pubished in - Volume 6 | Issue 3 | August 2018DOI (Digital Object Identifier) -    Publisher - IJEDR (ISSN - 2321-9939)
Cite this Article
  Florence Vijila. S,  Nirmala. K,   "A Survey on Information Retrieval Using Various Techniques", International Journal of Engineering Development and Research (IJEDR), ISSN:2321-9939, Volume.6, Issue 3, pp.347-351, August 2018, Available at :http://www.ijedr.org/papers/IJEDR1803062.pdf
Article Preview
|
|
||||||
|