A multiple-stage framework for related entity finding: FDWIM at TREC 2010 entity track
|A multiple-stage framework for related entity finding: FDWIM at TREC 2010 entity track|
|Author(s)||Wang D., Wu Q., Chen H., Niu J.|
|Published in||NIST Special Publication|
|Keyword(s)||Unknown (Extra: Deep mining, Document Retrieval, Entity extractions, Entity ranking, Feature-based algorithm, Homepage, Page ranks, Retrieval accuracy, Retrieval frameworks, Wikipedia, Filtration, Pattern recognition, Search engines, Information retrieval)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
A multiple-stage framework for related entity finding: FDWIM at TREC 2010 entity track is a 2010 conference paper written in English by Wang D., Wu Q., Chen H., Niu J. and published in NIST Special Publication.
This paper describes a multiple-stage retrieval framework for the task of related entity finding on TREC 2010 Entity Track. In the document retrieval stage, search engine is used to improve the retrieval accuracy. In the entity extraction and filtering stage, we extract entity with NER tools, Wikipedia and text pattern recognition. Then stoplist and other rules are employed to filter entity. Deep mining of the authority pages is proved to be effective in this stage. In entity ranking stage, many factors including keywords from narrative, page rank, combined results of corpus-based association rules and search engine are considered. In the final stage, an improved feature-based algorithm is proposed for the entity homepage detection.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.