Enriching multilingual language resources by discovering missing cross-language links in Wikipedia
|Enriching multilingual language resources by discovering missing cross-language links in Wikipedia|
|Author(s)||Oh J.-H., Kawahara D., Uchimoto K., Kazama J., Torisawa K.|
|Published in||Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008|
|Keyword(s)||Unknown (Extra: Classifiers, Concentration (process), Learning systems, External knowledge, Language resources, Novel methods, Recall rates, Wikipedia, Linguistics)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Enriching multilingual language resources by discovering missing cross-language links in Wikipedia is a 2008 conference paper written in English by Oh J.-H., Kawahara D., Uchimoto K., Kazama J., Torisawa K. and published in Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008.
We present a novel method for discovering missing crosslanguage links between English and Japanese Wikipedia articles. We collect candidates of missing cross-language links - a pair of English and Japanese Wikipedia articles, which could be connected by cross-language links. Then we select the correct cross-language links among the candidates by using a classifier trained with various types of features. Our method has three desirable characteristics for discovering missing links. First, our method can discover cross-language links with high accuracy (92% precision with 78% recall rates). Second, the features used in a classifier are language-independent. Third, without relying on any external knowledge, we generate the features based on resources automatically obtained from Wikipedia. In this work, we discover approximately 105 missing crosslanguage links from Wikipedia, which are almost two-thirds as many as the existing cross-language links in Wikipedia.
- This section requires expansion. Please, help!
Cited byThis publication has 1 citations. Only those publications available in WikiPapers are shown here:
Cited 4 time(s)