Automatically weighting tags in XML collection
|Automatically weighting tags in XML collection|
|Author(s)||Liu D., Wan C., Chen L., Liu X.|
|Published in||International Conference on Information and Knowledge Management, Proceedings|
|Keyword(s)||Tag weighting model, Topic generalization, XML retrieval (Extra: Automatic method, Highly-correlated, Real data sets, Relevance ranking, Retrieval effectiveness, Retrieval performance, Topic generalization, Weight assignment, Weighting model, Wikipedia, XML retrieval, Feature extraction, Knowledge management, Quality control, XML, Information retrieval)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Automatically weighting tags in XML collection is a 2010 conference paper written in English by Liu D., Wan C., Chen L., Liu X. and published in International Conference on Information and Knowledge Management, Proceedings.
In XML retrieval, nodes with different tags play different roles in XML documents and then tags should be reflected in the relevance ranking. An automatic method is proposed in this paper to infer the weights of tags. We first investigate 15 features about tags, and then select five of them based on the correlations between these features and manual tag weights. Using these features, a tag weight assignment model, ATG, is designed. We evaluate the performance of ATG on two real data sets, IEEECS and Wikipedia from two different perspectives. One is to evaluate the quality of the model by measuring the correlation between weights generated by our model and those given by experts. The other is to test the effectiveness of the model in improving retrieval performance. Experimental results show that the tag weights generated by ATG are highly correlated with the manually assigned weights and the ATG model improves retrieval effectiveness significantly.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers. Cited 4 time(s)