Segmentation of review texts by using thesaurus and corpus-based word similarity
|Segmentation of review texts by using thesaurus and corpus-based word similarity|
|Author(s)||Suzuki Y., Fukumoto F.|
|Published in||KEOD 2012 - Proceedings of the International Conference on Knowledge Engineering and Ontology Development|
|Keyword(s)||Reviews, Text segmentation, Thesaurus, Wikipedia (Extra: Corpus-based, Exponential growth, Short texts, Text segmentation, User reviews, Wikipedia, Word similarity, Word-pairs, Growth (materials), Hotels, Ontology, Reviews, Semantic Web, Thesauri, Knowledge engineering)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Segmentation of review texts by using thesaurus and corpus-based word similarity is a 2012 conference paper written in English by Suzuki Y., Fukumoto F. and published in KEOD 2012 - Proceedings of the International Conference on Knowledge Engineering and Ontology Development.
Recently, we can refer to user reviews in the shopping or hotel reservation sites. However, with the exponential growth of information of the Internet, it is becoming increasingly difficult for a user to read and understand all the materials from a large-scale reviews that is potentially of interest. In this paper, we propose a method for review texts segmentation by guest's criteria, such as service, location and facilities. Our system firstly extracts words which represent criteria from hotel review texts. We focused on topic markers such as "ha" in Japanese to extract guest's criteria. The extracted words are classified into classes with similar words. The classification is proceeded by using Japanese WordNet. Then, for each hotel, each text with all of the guest reviews is segmented into word sequence by using criteria classes. Review text segmentation is difficult because of short text. We thus used Japanese WordNet, extracted similar word pairs, and indexes of Wikipedia. We performed text segmentation of hotel review. The results showed the effectiveness of our method and indicated that it can be used for review summarization by guest's criteria.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.