Browse wiki

Jump to: navigation, search
CE2 - Towards a large scale hybrid search engine with integrated ranking support
Abstract The Web contains a large amount of documenThe Web contains a large amount of documents and increasingly, also semantic data in the form of RDF triples. Many of these triples are annotations that are associated with documents. While structured query is the principal mean to retrieve semantic data, keyword queries are typically used for document retrieval. Clearly, a form of hybrid search that seamlessly integrates these formalisms to query both documents and semantic data can address more complex information needs. In this paper, we present CE2, an integrated solution that leverages mature database and information retrieval technologies to tackle challenges in hybrid search on the large scale. For scalable storage, CE2 integrates database with inverted indices. Hybrid query processing is supported in CE2 through novel algorithms and data structures, which allow for advanced ranking schemes to be integrated more tightly into the process. Experiments conducted on Dbpedia and Wikipedia show that CE2 can provide good performance in terms of both effectiveness and effciency.terms of both effectiveness and effciency.
Abstractsub The Web contains a large amount of documenThe Web contains a large amount of documents and increasingly, also semantic data in the form of RDF triples. Many of these triples are annotations that are associated with documents. While structured query is the principal mean to retrieve semantic data, keyword queries are typically used for document retrieval. Clearly, a form of hybrid search that seamlessly integrates these formalisms to query both documents and semantic data can address more complex information needs. In this paper, we present CE2, an integrated solution that leverages mature database and information retrieval technologies to tackle challenges in hybrid search on the large scale. For scalable storage, CE2 integrates database with inverted indices. Hybrid query processing is supported in CE2 through novel algorithms and data structures, which allow for advanced ranking schemes to be integrated more tightly into the process. Experiments conducted on Dbpedia and Wikipedia show that CE2 can provide good performance in terms of both effectiveness and effciency.terms of both effectiveness and effciency.
Bibtextype inproceedings  +
Doi 10.1145/1458082.1458258  +
Has author Haofen Wang + , Thanh Tran + , Che-Hung Liu +
Has extra keyword Annotations + , Complex information + , Document Retrieval + , Hybrid search + , Information-retrieval technology + , Integrated ranking + , Integrated solutions + , Inverted indices + , Keyword queries + , Novel algorithm + , Ranking + , Scalable storage + , Semantic data + , Structured queries + , Wikipedia + , Algorithms + , Data structures + , Information retrieval + , Information services + , Knowledge management + , Search engine + , Semantics +
Has keyword Annotations + , Hybrid search + , Ranking + , Scalable storage +
Isbn 9781595939913  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 1323–1324  +
Published in International Conference on Information and Knowledge Management, Proceedings +
Title CE2 - Towards a large scale hybrid search engine with integrated ranking support +
Type conference paper  +
Year 2008 +
Creation dateThis property is a special property in this wiki. 7 November 2014 05:01:31  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Conference papers  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 7 November 2014 05:01:31  +
DateThis property is a special property in this wiki. 2008  +
hide properties that link here 
CE2 - Towards a large scale hybrid search engine with integrated ranking support + Title
 

 

Enter the name of the page to start browsing from.