On the Use of PU Learning for Quality Flaw Prediction in Wikipedia
|On the Use of PU Learning for Quality Flaw Prediction in Wikipedia|
|Author(s)||Edgardo Ferretti, Donato Hernández Fusilier, Rafael Guzmán Cabrera, Manuel Montes y Gómez, Marcelo Errecalde, Paolo Rosso|
|Keyword(s)||Unknown (Extra: quality)|
|Dataset(s)||PAN Wikipedia quality flaw corpus 2012|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
On the Use of PU Learning for Quality Flaw Prediction in Wikipedia is a 2012 conference paper written in English by Edgardo Ferretti, Donato Hernández Fusilier, Rafael Guzmán Cabrera, Manuel Montes y Gómez, Marcelo Errecalde, Paolo Rosso and published in PAN.
In this article we describe a new approach to assess Quality Flaw Prediction in Wikipedia. The partially supervised method studied, called PU Learning, has been successfully applied in classifications tasks with traditional corpora like Reuters-21578 or 20-Newsgroups. To the best of our knowledge, this is the first time that it is applied in this domain. Throughout this paper, we describe how the original PU Learning approach was evaluated for assessing quality flaws and the modifications introduced to get a quality flaws predictor which obtained the best F1 scores in the task “Quality Flaw Prediction in Wikipedia” of the PAN challenge.
- This section requires expansion. Please, help!
Cited byThis publication has 1 citations. Only those publications available in WikiPapers are shown here: