Andrey Simanovsky

From WikiPapers
Jump to: navigation, search

Andrey Simanovsky is an author.

Publications

Only those publications related to wikis are shown here.
Title Keyword(s) Published in Language DateThis property is a special property in this wiki. Abstract R C
Classifying Wikipedia entities into fine-grained classes ICDEW English 2011 0 0
Mining text patterns for synonyms extraction Synonym extraction
Thesauri creation
Proceedings - International Workshop on Database and Expert Systems Applications, DEXA English 2011 We propose a method for extracting synonym patterns from text and ranking them. Patterns are the text fragments between pairs of synonyms. In our method we extract a number of synonyms fromWikipedia, build a markup on Wikipedia articles and then extract patterns and measure their confidence. These patterns can be used for extracting synonyms from free text. 0 0
Term validation for vocabulary construction and key term extraction International Conference Recent Advances in Natural Language Processing, RANLP English 2011 We extract new terminology from a text by term validation in a dictionary. Our approach is based on estimating probabilities for previously unseen terms, i.e. not present in a dictionary. To do this we apply several probabilistic models previously not used for term recognition and propose a new one. We apply restriction of domain similarity on terms used for probability estimation and vary the parameters of the models. Performance of our approach is demonstrated using Wikipedia titles vocabulary. 0 0
Fine grained classification of named entities in Wikipedia Classification
Named entity recognition
Wikipedia
HP Laboratories Technical Report English 2010 This report describes the study on classifying Wikipedia articles into an extended set of named entity classes. We employed semi-automatic method to extend Wikipedia class annotation and created a training set for 15 named entity classes. We implemented two classifiers. A binary named-entity classifier decides between articles about named entities and other articles. A support vector machine (SVM) classifier trained on a variety ofWikipedia features determines the class of a named entity. Combination of the two classifiers helped us to boost classification quality and obtain classification quality that is better than state of the art. © Copyright 2010 Hewlett-Packard Development Company, L.P. 0 0