Fine grained classification of named entities in Wikipedia
|Fine grained classification of named entities in Wikipedia|
|Author(s)||Tkachenko M., Ulanov A., Simanovsky A.|
|Published in||HP Laboratories Technical Report|
|Keyword(s)||Classification, Named entity recognition, Wikipedia (Extra: Classification, Classification quality, Fine grained, Named entities, Named entity recognition, Semiautomatic methods, State of the art, Training sets, Wikipedia, Classifiers, Natural language processing systems)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of journal articles|
This report describes the study on classifying Wikipedia articles into an extended set of named entity classes. We employed semi-automatic method to extend Wikipedia class annotation and created a training set for 15 named entity classes. We implemented two classifiers. A binary named-entity classifier decides between articles about named entities and other articles. A support vector machine (SVM) classifier trained on a variety ofWikipedia features determines the class of a named entity. Combination of the two classifiers helped us to boost classification quality and obtain classification quality that is better than state of the art. © Copyright 2010 Hewlett-Packard Development Company, L.P.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.