Browse wiki

Jump to: navigation, search
Mining a multilingual geographical gazetteer from the Web
Abstract Geographical gazetteers are necessary in aGeographical gazetteers are necessary in a wide variety of applications. In the past, the construction of such gazetteers has been a tedious, manual process and only recently have the first attempts to automate the gazetteers creation been made. Here we describe our approach for mining accurate but large-scale multilingual geographic information by successively filtering information found in heterogeneous data sources (Flickr, Wikipedia, Panoramio, Web pages indexed by search engines). Statistically crosschecking information found in each site, we are able to identify new geographic objects, and to indicate, for each one, its name, its GPS coordinates, its encompassing regions (city, region, country), the language of the name, its popularity, and the type of the object (church, bridge, etc.). We evaluate our approach by comparing, wherever possible, our multilingual gazetteer to other known attempts at automatically building a geographic database and to Geonames, a manually built gazetteer.d to Geonames, a manually built gazetteer.
Abstractsub Geographical gazetteers are necessary in aGeographical gazetteers are necessary in a wide variety of applications. In the past, the construction of such gazetteers has been a tedious, manual process and only recently have the first attempts to automate the gazetteers creation been made. Here we describe our approach for mining accurate but large-scale multilingual geographic information by successively filtering information found in heterogeneous data sources (Flickr, Wikipedia, Panoramio, Web pages indexed by search engines). Statistically crosschecking information found in each site, we are able to identify new geographic objects, and to indicate, for each one, its name, its GPS coordinates, its encompassing regions (city, region, country), the language of the name, its popularity, and the type of the object (church, bridge, etc.). We evaluate our approach by comparing, wherever possible, our multilingual gazetteer to other known attempts at automatically building a geographic database and to Geonames, a manually built gazetteer.d to Geonames, a manually built gazetteer.
Bibtextype inproceedings  +
Doi 10.1109/WI-IAT.2009.16  +
Has author Adrian Popescu + , Gregory Grefenstette + , Bouamor H. +
Has extra keyword Geographic database + , Geographic information + , Geographic objects + , Heterogeneous data sources + , Manual process + , Wikipedia + , Search engine + , Websites +
Isbn 9780769538013  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 58–65  +
Published in Proceedings - 2009 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2009 +
Title Mining a multilingual geographical gazetteer from the Web +
Type conference paper  +
Volume 1  +
Year 2009 +
Creation dateThis property is a special property in this wiki. 8 November 2014 08:09:01  +
Categories Publications without keywords parameter  + , Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 8 November 2014 08:09:01  +
DateThis property is a special property in this wiki. 2009  +
hide properties that link here 
Mining a multilingual geographical gazetteer from the Web + Title
 

 

Enter the name of the page to start browsing from.