Browse wiki

Jump to: navigation, search
Extracción de Corpus Paralelos de la Wikipedia basada en la Obtención de Alineamientos Bilingües a Nivel de Frase
Abstract This paper presents a proposal for extractThis paper presents a proposal for extracting parallel corpora from Wikipedia on the basis of statistical machine translation techniques. We have used word-level alignment models from IBM in order to obtain phrase-level bilingual alignments between documents pairs. We have manually annotated a set of test English-Spanish comparable documents in order to evaluate the model. The obtained results are encouraging.del. The obtained results are encouraging.
Abstractsub This paper presents a proposal for extractThis paper presents a proposal for extracting parallel corpora from Wikipedia on the basis of statistical machine translation techniques. We have used word-level alignment models from IBM in order to obtain phrase-level bilingual alignments between documents pairs. We have manually annotated a set of test English-Spanish comparable documents in order to evaluate the model. The obtained results are encouraging.del. The obtained results are encouraging.
Bibtextype inproceedings  +
Has author Joan Albert Silvestre-Cerdà + , Mercedes García-Martínez + , Alberto Barrón-Cedeño + , Jorge Civera + , Paolo Rosso +
Has extra keyword Wikipedia +
Has keyword Comparable corpora + , Parallel sentences extraction + , Machine translation +
Has reference Finding Similar Sentences across Multiple Languages in Wikipedia + , Building Bilingual Parallel Corpora Based on Wikipedia + , Mining wikipedia as a parallel and comparable corpus + , Method for building sentence-aligned corpus from wikipedia +
Has remote mirror http://ceur-ws.org/Vol-824/paper2.pdf  +
Has webcitation mirror 67qAWwT2B  +
Language Spanish +
Number of citations by publication 0  +
Number of references by publication 4  +
Pages 14-21  +
Published in Proceedings of the Workshop on Iberian Cross-Language Natural Language Processing Tasks (ICL 2011) +
Title Extracci´on de corpus paralelos de la Wikipedia basada en la obtenci´on de alineamientos biling¨ues a nivel de frase +
Type conference paper  +
Year 2011 +
Creation dateThis property is a special property in this wiki. 22 May 2012 02:58:57  +
Categories Publications without license parameter  + , Publications without DOI parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications  +
Modification dateThis property is a special property in this wiki. 22 May 2012 02:58:57  +
DateThis property is a special property in this wiki. 2011  +
hide properties that link here 
  No properties link to this page.
 

 

Enter the name of the page to start browsing from.