Choosing better seeds for entity set expansion by leveraging wikipedia semantic knowledge
|Choosing better seeds for entity set expansion by leveraging wikipedia semantic knowledge|
|Author(s)||Qi Z., Liu K., Zhao J.|
|Published in||Communications in Computer and Information Science|
|Keyword(s)||information extraction, seed set refinement, semantic knowledge (Extra: Information Extraction, Seed set, Semantic category, Semantic knowledge, Semantic relatedness, Set expansions, Web Corpora, Wikipedia, Natural language processing systems, Pattern recognition, Semantics, Websites, Knowledge management)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Choosing better seeds for entity set expansion by leveraging wikipedia semantic knowledge is a 2012 conference paper written in English by Qi Z., Liu K., Zhao J. and published in Communications in Computer and Information Science.
Entity Set Expansion, which refers to expanding a human-input seed set to a more complete set which belongs to the same semantic category, is an important task for open information extraction. Because human-input seeds may be ambiguous, sparse etc., the quality of seeds has a great influence on expansion performance, which has been proved by many previous researches. To improve seeds quality, this paper proposes a novel method which can choose better seeds from original input ones. In our method, we leverage Wikipedia semantic knowledge to measure semantic relatedness and ambiguity of each seed. Moreover, to avoid the sparseness of the seed, we use web corpus to measure its population. Lastly, we use a linear model to combine these factors to determine the final selection. Experimental results show that new seed sets chosen by our method can improve expansion performance by up to average 13.4% over random selected seed sets.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.