Symbolic representation of text documents
|Symbolic representation of text documents|
|Author(s)||Guru D.S., Harish B.S., Manjunath S.|
|Published in||COMPUTE 2010 - The 3rd Annual ACM Bangalore Conference|
|Keyword(s)||Classification, Pattern recognition, Symbolic representation, Text classification, Text document (Extra: Data sets, Feature vectors, Interval-valued, Novel methods, Symbolic features, Symbolic representation, Text classification, Text document, Wikipedia, Character recognition, Information retrieval systems, Text processing)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Symbolic representation of text documents is a 2010 conference paper written in English by Guru D.S., Harish B.S., Manjunath S. and published in COMPUTE 2010 - The 3rd Annual ACM Bangalore Conference.
This paper presents a novel method of representing a text document by the use of interval valued symbolic features. A method of classification of text documents based on the proposed representation is also presented. The newly proposed model significantly reduces the dimension of feature vectors and also the time taken to classify a given document. Further, extensive experimentations are conducted on vehicles-wikipedia datasets to evaluate the performance of the proposed model. The experimental results reveal that the obtained results are on par with the existing results for vehicles-wikipedia dataset. However, the advantage of the proposed model is that it takes relatively a less time for classification as it is based on a simple matching strategy.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.