Measuring the quality of web content using factual information
|Measuring the quality of web content using factual information|
|Author(s)||Lex E., Voelske M., Errecalde M., Ferretti E., Cagnina L., Horn C., Stein B., Granitzer M.|
|Published in||ACM International Conference Proceeding Series|
|Keyword(s)||Unknown (Extra: Count measure, Density measures, F-measure, Factual information, Information Extraction, Relational features, Statistical quality, Web content, Wikipedia, Computer applications, Websites)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Measuring the quality of web content using factual information is a 2012 conference paper written in English by Lex E., Voelske M., Errecalde M., Ferretti E., Cagnina L., Horn C., Stein B., Granitzer M. and published in ACM International Conference Proceeding Series.
Nowadays, many decisions are based on information found in the Web. For the most part, the disseminating sources are not certified, and hence an assessment of the quality and credibility of Web content became more important than ever. With factual density we present a simple statistical quality measure that is based on facts extracted from Web content using Open Information Extraction. In a first case study, we use this measure to identify featured/good articles in Wikipedia. We compare the factual density measure with word count, a measure that has successfully been applied to this task in the past. Our evaluation corroborates the good performance of word count in Wikipedia since featured/good articles are often longer than non-featured. However, for articles of similar lengths the word count measure fails while factual density can separate between them with an F-measure of 90.4%. We also investigate the use of relational features for categorizing Wikipedia articles into featured/good versus non-featured ones. If articles have similar lengths, we achieve an F-measure of 86.7% and 84% otherwise.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.