LDA-based topic modeling in labeling blog posts with wikipedia entries
|LDA-based topic modeling in labeling blog posts with wikipedia entries|
|Author(s)||Yokomoto D., Makita K., Suzuki H., Koike D., Utsuro T., Kawada Y., Fukuhara T.|
|Published in||Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|
|Keyword(s)||Blog, LDA, Topic Analysis, Topic Model, Wikipedia (Extra: Blog, LDA, Topic analysis, Topic model, Wikipedia, Search engines, Statistics, Websites, Blogs)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
LDA-based topic modeling in labeling blog posts with wikipedia entries is a 2012 conference paper written in English by Yokomoto D., Makita K., Suzuki H., Koike D., Utsuro T., Kawada Y., Fukuhara T. and published in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics).
Given a search query, most existing search engines simply return a ranked list of search results. However, it is often the case that those search result documents consist of a mixture of documents that are closely related to various contents. In order to address the issue of quickly overviewing the distribution of contents, this paper proposes a framework of labeling blog posts with Wikipedia entries through LDA (latent Dirichlet allocation) based topic modeling. More specifically, this paper applies an LDA-based document model to the task of labelling blog posts with Wikipedia entries. One of the most important advantages of this LDA-based document model is that the collected Wikipedia entries and their LDA parameters heavily depend on the distribution of keywords across all the search result of blog posts. This tendency actually contributes to quickly overviewing the search result of blog posts through the LDA-based topic distribution. In the evaluation of the paper, we also show that the LDA-based document retrieval scheme outperforms our previous approach.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers. Cited 1 time(s)