DFT-extractor: A system to extract domain-specific faceted taxonomies from wikipedia
|DFT-extractor: A system to extract domain-specific faceted taxonomies from wikipedia|
|Author(s)||Wei B., Liu J., Ma J., Zheng Q., Zhang W., Feng B.|
|Published in||WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web|
|Keyword(s)||Faceted taxonomy, Network motif, Wikipedia (Extra: Classification models, Community detection algorithms, Domain specific, Heuristic rules, Network motif, Topical crawlers, Tree structures, Wikipedia, Graphical user interfaces, Trees (mathematics), World Wide Web, Taxonomies)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
DFT-extractor: A system to extract domain-specific faceted taxonomies from wikipedia is a 2013 conference paper written in English by Wei B., Liu J., Ma J., Zheng Q., Zhang W., Feng B. and published in WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web.
Extracting faceted taxonomies from the Web has received increasing attention in recent years from the web mining community. We demonstrate in this study a novel system called DFT-Extractor, which automatically constructs domain-specific faceted taxonomies from Wikipedia in three steps: 1) It crawls domain terms from Wikipedia by using a modified topical crawler. 2) Then it exploits a classification model to extract hyponym relations with the use of motif-based features. 3) Finally, it constructs a faceted taxonomy by applying a community detection algorithm and a group of heuristic rules. DFT-Extractor also provides a graphical user interface to visualize the learned hyponym relations and the tree structure of taxonomies.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers. Cited 1 time(s)