Mining a Large-Scale Term-Concept Network from Wikipedia
|Mining a Large-Scale Term-Concept Network from Wikipedia|
|Author(s)||Andrew Gregorowicz and Mark A. Kramer|
|Published in||Mitre Technical Report|
|Keyword(s)||Information retrieval, concept search, Wikipedia, text mining.|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of journal articles|
Social tagging and information retrieval are challenged by the fact that the same item or idea can be expressed by different terms or words. To counteract the problem of variable terminology, researchers have proposed concept-based information retrieval. To date, however, most concept spaces have been either manually-produced taxonomies or special-purpose ontologies, too small for classifying arbitrary resources. To create a large set of concepts, and to facilitate terms to concept mapping, we introduce mine a network of concepts and terms from Wikipedia. Our algorithm results in a robust, extensible term-concept network for tagging and information retrieval, containing over 2,000,000 concepts with mappings to over 3,000,000 unique terms.
- This section requires expansion. Please, help!
Cited byThis publication has 1 citations. Only those publications available in WikiPapers are shown here: