Semantic Content Filtering with Wikipedia and Ontologies
| Semantic Content Filtering with Wikipedia and Ontologies | |
| Author(s) | Pekka Malo, Pyry Siitari, Oskar Ahlgren, Jyrki Wallenius, Pekka Korhonen |
| Published in | Unknown [+] |
| Date | 2010 |
| Keyword(s) | Unknown [+] |
| Peer-reviewed? | Unknown [+] |
| Language(s) | English |
| License(s) | Unknown [+] |
| Identifiers | |
| ISBN | Unknown [+] |
| DOI | Unknown [+] |
| OCLC Number | Unknown [+] |
| CiteULike | Unknown [+] |
| arXiv | Unknown [+] |
| PubMed | Unknown [+] |
| Related material | |
| Concept(s) | Unknown [+] |
| Tool(s) | Unknown [+] |
| Dataset(s) | Unknown [+] |
| Slides | Not available [+] |
| Presentation | Not available [+] |
| Search | |
| Article | BASE, CiteSeerX, Google Scholar |
| Web | Ask, Bing, Google (PDF), Yahoo! |
| Download and mirrors | |
| Local copy | Not available [+] |
| Remote mirror(s) | arxiv.org |
| Archive(s) | Not available [+] |
| Paywall(s) | Not available [+] |
| Export and share | |
| BibTeX, CSV, RDF, JSON | |
| | |
| Browse properties ยท List of publications | |
Semantic Content Filtering with Wikipedia and Ontologies is a 2010 publication written in English by Pekka Malo, Pyry Siitari, Oskar Ahlgren, Jyrki Wallenius, Pekka Korhonen.
[edit] Abstract
The use of domain knowledge is generally found to improve query efficiency in content filtering applications. In particular, tangible benefits have been achieved when using knowledge-based approaches within more specialized fields, such as medical free texts or legal documents. However, the problem is that sources of domain knowledge are time-consuming to build and equally costly to maintain. As a potential remedy, recent studies on Wikipedia suggest that this large body of socially constructed knowledge can be effectively harnessed to provide not only facts but also accurate information about semantic concept-similarities. This paper describes a framework for document filtering, where Wikipedia's concept-relatedness information is combined with a domain ontology to produce semantic content classifiers. The approach is evaluated using Reuters RCV1 corpus and TREC-11 filtering task definitions. In a comparative study, the approach shows robust performance and appears to outperform content classifiers based on Support Vector Machines (SVM) and C4.5 algorithm.
[edit] References
This publication has 17 references. Only those references related to wikis are included here:
- "Computing semantic relatedness using Wikipedia link structure" (create it!) [search]
- "Overcoming the brittleness bottleneck using Wikipedia" (create it!) [search]
- "Wikify!: linking documents to encyclopedic knowledge" (create it!) [search]
- "Mining a large-scale term-concept network from Wikipedia" (create it!) [search]
- "Improving weak ad-hoc queries using Wikipedia as external corpus" (create it!) [search]
- "Cross-domain Text Classification using Wikipedia" (create it!) [search]
- "An open-source toolkit for mining Wikipedia" (create it!) [search]
Cited by
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.
Discussion
No comments yet. Be first!
