A new preprocessing phase for LSA-based Turkish text summarization
|A new preprocessing phase for LSA-based Turkish text summarization|
|Author(s)||Guran A., Bayazit N.G.|
|Published in||Lecture Notes in Electrical Engineering|
|Keyword(s)||Latent Semantic Analysis, Turkish Text Summarization, Turkish Wikipedia (Extra: Latent Semantic Analysis, Performance analysis, Pre-processing method, Preprocessing phase, Text summarization, Turkish texts, Turkishs, Wikipedia, Computer science, Semantics, Websites, Text processing)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Text Summarization is a process of identifying the most salient information in a document or a set of related documents. This paper presents the performance analysis of a Turkish text summarization system that applies two Latent Semantic Analysis based algorithms with different preprocessing phases. The preprocessing method called "Consecutive Words Detection" is a new method that uses Turkish Wikipedia links to represent related consecutive words as a single term and improves the performance of text summarization in Turkish.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.