Managing information disparity in multilingual document collections
|Managing information disparity in multilingual document collections|
|Author(s)||Duh K., Yeung C.-M.A., Iwata T., Nagata M.|
|Published in||ACM Transactions on Speech and Language Processing|
|Keyword(s)||Algorithms, Experimentation, Languages (Extra: Experimentation, Information contents, Large scale simulations, Machine translations, Multilingual documents, Real world experiment, Wikipedia, Algorithms, Experiments, Query languages, Translation (languages))|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of journal articles|
Managing information disparity in multilingual document collections is a 2013 journal article written in English by Duh K., Yeung C.-M.A., Iwata T., Nagata M. and published in ACM Transactions on Speech and Language Processing.
Information disparity is a major challenge with multilingual document collections. When documents are dynamically updated in a distributed fashion, information content among different language editions may gradually diverge. We propose a framework for assisting human editors to manage this information disparity, using tools from machine translation and machine learning. Given source and target documents in two different languages, our system automatically identifies information nuggets that are new with respect to the target and suggests positions to place their translations. We perform both real-world experiments and large-scale simulations on Wikipedia documents and conclude our system is effective in a variety of scenarios.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.