Automatic multilingual lexicon generation using wikipedia as a resource
|Automatic multilingual lexicon generation using wikipedia as a resource|
|Author(s)||Shahid A.R., Kazakov D.|
|Published in||ICAART 2009 - Proceedings of the 1st International Conference on Agents and Artificial Intelligence|
|Keyword(s)||Data mining, Multilingual lexicons, Natural language processing, Web crawler, Web mining, Wikipedia (Extra: Multilingual lexicons, Natural language processing, Web crawler, Web mining, Wikipedia, Artificial intelligence, Computational linguistics, Linguistics, Natural language processing systems, Query languages, Translation (languages))|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Automatic multilingual lexicon generation using wikipedia as a resource is a 2009 conference paper written in English by Shahid A.R., Kazakov D. and published in ICAART 2009 - Proceedings of the 1st International Conference on Agents and Artificial Intelligence.
This paper proposes a method for creating a multilingual dictionary by taking the titles of Wikipedia pages in English and then finding the titles of the corresponding articles in other languages. The creation of such multilingual dictionaries has become possible as a result of exponential increase in the size of multilingual information on the web. Wikipedia is a prime example of such multilingual source of information on any conceivable topic in the world, which is edited by the readers. Here, a web crawler has been used to traverse Wikipedia following the links on a given page. The crawler takes out the title along with the titles of the corresponding pages in other targeted languages. The result is a set of words and phrases that are translations of each other. For efficiency, the URLs are organized using hash tables. A lexicon has been constructed which contains 7-tuples corresponding to 7 different languages, namely: English, German, French, Polish, Bulgarian, Greek and Chinese.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers.