Browse wiki

Jump to: navigation, search
Automatic multilingual lexicon generation using wikipedia as a resource
Abstract This paper proposes a method for creating This paper proposes a method for creating a multilingual dictionary by taking the titles of Wikipedia pages in English and then finding the titles of the corresponding articles in other languages. The creation of such multilingual dictionaries has become possible as a result of exponential increase in the size of multilingual information on the web. Wikipedia is a prime example of such multilingual source of information on any conceivable topic in the world, which is edited by the readers. Here, a web crawler has been used to traverse Wikipedia following the links on a given page. The crawler takes out the title along with the titles of the corresponding pages in other targeted languages. The result is a set of words and phrases that are translations of each other. For efficiency, the URLs are organized using hash tables. A lexicon has been constructed which contains 7-tuples corresponding to 7 different languages, namely: English, German, French, Polish, Bulgarian, Greek and Chinese.nch, Polish, Bulgarian, Greek and Chinese.
Abstractsub This paper proposes a method for creating This paper proposes a method for creating a multilingual dictionary by taking the titles of Wikipedia pages in English and then finding the titles of the corresponding articles in other languages. The creation of such multilingual dictionaries has become possible as a result of exponential increase in the size of multilingual information on the web. Wikipedia is a prime example of such multilingual source of information on any conceivable topic in the world, which is edited by the readers. Here, a web crawler has been used to traverse Wikipedia following the links on a given page. The crawler takes out the title along with the titles of the corresponding pages in other targeted languages. The result is a set of words and phrases that are translations of each other. For efficiency, the URLs are organized using hash tables. A lexicon has been constructed which contains 7-tuples corresponding to 7 different languages, namely: English, German, French, Polish, Bulgarian, Greek and Chinese.nch, Polish, Bulgarian, Greek and Chinese.
Bibtextype inproceedings  +
Has author Shahid A.R. + , Kazakov D. +
Has extra keyword Multilingual lexicons + , Natural Language Processing + , Web crawler + , Web mining + , Wikipedia + , Artificial intelligence + , Computational linguistics + , Linguistics + , Natural language processing systems + , Query languages + , Translation +
Has keyword Data mining + , Multilingual lexicons + , Natural Language Processing + , Web crawler + , Web mining + , Wikipedia +
Isbn 9789898111661  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 357–360  +
Published in ICAART 2009 - Proceedings of the 1st International Conference on Agents and Artificial Intelligence +
Title Automatic multilingual lexicon generation using wikipedia as a resource +
Type conference paper  +
Year 2009 +
Creation dateThis property is a special property in this wiki. 6 November 2014 22:07:58  +
Categories Publications without license parameter  + , Publications without DOI parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 6 November 2014 22:07:58  +
DateThis property is a special property in this wiki. 2009  +
hide properties that link here 
Automatic multilingual lexicon generation using wikipedia as a resource + Title
 

 

Enter the name of the page to start browsing from.