Browse wiki

Jump to: navigation, search
Managing information disparity in multilingual document collections
Abstract Information disparity is a major challengeInformation disparity is a major challenge with multilingual document collections. When documents are dynamically updated in a distributed fashion, information content among different language editions may gradually diverge. We propose a framework for assisting human editors to manage this information disparity, using tools from machine translation and machine learning. Given source and target documents in two different languages, our system automatically identifies information nuggets that are new with respect to the target and suggests positions to place their translations. We perform both real-world experiments and large-scale simulations on Wikipedia documents and conclude our system is effective in a variety of scenarios.em is effective in a variety of scenarios.
Abstractsub Information disparity is a major challengeInformation disparity is a major challenge with multilingual document collections. When documents are dynamically updated in a distributed fashion, information content among different language editions may gradually diverge. We propose a framework for assisting human editors to manage this information disparity, using tools from machine translation and machine learning. Given source and target documents in two different languages, our system automatically identifies information nuggets that are new with respect to the target and suggests positions to place their translations. We perform both real-world experiments and large-scale simulations on Wikipedia documents and conclude our system is effective in a variety of scenarios.em is effective in a variety of scenarios.
Bibtextype article  +
Doi 10.1145/2442076.2442077  +
Has author Kevin Duh + , Yeung C.-M.A. + , Iwata T. + , Masaaki Nagata +
Has extra keyword Experimentation + , Information contents + , Large scale simulations + , Machine translations + , Multilingual documents + , Real world experiment + , Wikipedia + , Algorithms + , Experiments + , Query languages + , Translation +
Has keyword Algorithms + , Experimentation + , Language +
Issn 15504875  +
Issue 1  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Published in ACM Transactions on Speech and Language Processing +
Title Managing information disparity in multilingual document collections +
Type journal article  +
Volume 10  +
Year 2013 +
Creation dateThis property is a special property in this wiki. 7 November 2014 22:35:16  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Journal articles  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 7 November 2014 22:35:16  +
DateThis property is a special property in this wiki. 2013  +
hide properties that link here 
Managing information disparity in multilingual document collections + Title
 

 

Enter the name of the page to start browsing from.