Temporal latent semantic analysis for collaboratively generated content: Preliminary results
|Temporal latent semantic analysis for collaboratively generated content: Preliminary results|
|Author(s)||Wang Y., Agichtein E.|
|Published in||SIGIR'11 - Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval|
|Keyword(s)||Algorithm, Experimentation (Extra: Benchmark data, Document generation, Experimentation, Latent Semantic Analysis, NAtural language processing, Online forums, Question Answering, Temporal information, Tensor decomposition, Wikipedia, Algorithms, Benchmarking, Computational linguistics, Experiments, Information retrieval, Natural language processing systems, Semantics)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Temporal latent semantic analysis for collaboratively generated content: Preliminary results is a 2011 conference paper written in English by Wang Y., Agichtein E. and published in SIGIR'11 - Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval.
Latent semantic analysis (LSA) has been intensively studied because of its wide application to Information Retrieval and Natural Language Processing. Yet, traditional models such as LSA only examine one (current) version of the document. However, due to the recent proliferation of collaboratively generated content such as threads in online forums, Collaborative Question Answering archives, Wikipedia, and other versioned content, the document generation process is now directly observable. In this study, we explore how this additional temporal information about the document evolution could be used to enhance the identification of latent document topics. Specifically, we propose a novel hidden-topic modeling algorithm, temporal Latent Semantic Analysis (tLSA), which elegantly extends LSA to modeling document revision history using tensor decomposition. Our experiments show that tLSA outperforms LSA on word relatedness estimation using benchmark data, and explore applications of tLSA for other tasks.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers. Cited 1 time(s)