Generation Of Extractive Summary Based On Document Semantics
Main Article Content
Article Sidebar
Abstract
In the recent years, significant research contribution and progress observed in developing methods for machines to understand concepts within documents. For machines a document represents language based information which consist of meaningful units known as data patterns or document units. These document units are the language’s verbs, adverbs, nouns, prepositions, etc. that contributes towards building the document. The current research activities in this field, is not just limited to picking some keywords to understand the document concepts but aims to gain a precise understanding of the concepts through correlation  of words and extracting sentences to obtain summaries. This would help in retrieving meaningful information and reducing the effort of going through the whole document to get its main insight.In our application, we use the Latent Semantic Analysis (LSA) algorithm for text summarization. The dataset is trained using the algorithm and a matrix is generated. This matrix gives us the correlation of words within documents. LSA uses the SVD to capture all correlations latent within a document by modelling relationships among words and sentences within the text.
Article Details
COPYRIGHT AGREEMENT AND AUTHORSHIP RESPONSIBILITY
 All paper submissions must carry the following duly signed by all the authors:
“I certify that I have participated sufficiently in the conception and design of this work and the analysis of the data (wherever applicable), as well as the writing of the manuscript, to take public responsibility for it. I believe the manuscript represents valid work. I have reviewed the final version of the manuscript and approve it for publication. Neither has the manuscript nor one with substantially similar content under my authorship been published nor is being considered for publication elsewhere, except as described in an attachment. Furthermore I attest that I shall produce the data upon which the manuscript is based for examination by the editors or their assignees, if requested.â€