Last modified on September 20, 2014, at 23:07

Jorge Civera

Jorge Civera is an author.

Publications

Only those publications related to wikis are shown here.
Title Keyword(s) Published in Language DateThis property is a special property in this wiki. Abstract R C
Extracción de Corpus Paralelos de la Wikipedia basada en la Obtención de Alineamientos Bilingües a Nivel de Frase Comparable corpora
Parallel sentences extraction
Machine translation
Proceedings of the Workshop on Iberian Cross-Language Natural Language Processing Tasks (ICL 2011) Spanish 2011 This paper presents a proposal for extracting parallel corpora from Wikipedia on the basis of statistical machine translation techniques. We have used word-level alignment models from IBM in order to obtain phrase-level bilingual alignments between documents pairs. We have manually annotated a set of test English-Spanish comparable documents in order to evaluate the model. The obtained results are encouraging. 4 0