A multiple-stage framework for related entity finding: FDWIM at TREC 2010 entity track

From WikiPapers
Jump to: navigation, search

A multiple-stage framework for related entity finding: FDWIM at TREC 2010 entity track is a 2010 conference paper written in English by Wang D., Wu Q., Chen H., Niu J. and published in NIST Special Publication.

[edit] Abstract

This paper describes a multiple-stage retrieval framework for the task of related entity finding on TREC 2010 Entity Track. In the document retrieval stage, search engine is used to improve the retrieval accuracy. In the entity extraction and filtering stage, we extract entity with NER tools, Wikipedia and text pattern recognition. Then stoplist and other rules are employed to filter entity. Deep mining of the authority pages is proved to be effective in this stage. In entity ranking stage, many factors including keywords from narrative, page rank, combined results of corpus-based association rules and search engine are considered. In the final stage, an improved feature-based algorithm is proposed for the entity homepage detection.

[edit] References

This section requires expansion. Please, help!

Cited by

Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.