Managing information disparity in multilingual document collections

From WikiPapers
Jump to: navigation, search

Managing information disparity in multilingual document collections is a 2013 journal article written in English by Duh K., Yeung C.-M.A., Iwata T., Nagata M. and published in ACM Transactions on Speech and Language Processing.

[edit] Abstract

Information disparity is a major challenge with multilingual document collections. When documents are dynamically updated in a distributed fashion, information content among different language editions may gradually diverge. We propose a framework for assisting human editors to manage this information disparity, using tools from machine translation and machine learning. Given source and target documents in two different languages, our system automatically identifies information nuggets that are new with respect to the target and suggests positions to place their translations. We perform both real-world experiments and large-scale simulations on Wikipedia documents and conclude our system is effective in a variety of scenarios.

[edit] References

This section requires expansion. Please, help!

Cited by

Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.