Browse wiki

Jump to: navigation, search
STAIRS: Towards efficient full-text filtering and dissemination in DHT environments
Abstract Nowadays "live" content, such as weblog, wNowadays "live" content, such as weblog, wikipedia, and news, is ubiquitous in the Internet. Providing users with relevant content in a timely manner becomes a challenging problem. Differing from Web search technologies and RSS feeds/reader applications, this paper envisions a personalized full-text content filtering and dissemination system in a highly distributed environment such as a Distributed Hash Table (DHT) based Peer-to-Peer (P2P) Network. Users subscribe to their interested content by specifying input keywords and thresholds as filters. Then, content is disseminated to those users having interest in it. In the literature, full-text document publishing in DHTs has suffered for a long time from the high cost of forwarding a document to home nodes of all distinct terms. It is aggravated by the fact that a document contains a large number of distinct terms (typically tens or thousands of terms per document). In this paper, we propose a set of novel techniques to overcome such a high forwarding cost by carefully selecting a very small number of meaningful terms (or key features) among candidate terms inside each document. Next, to reduce the average hop count per forwarding, we further prune irrelevant documents during the forwarding path. Experiments based on two real query logs and two real data sets demonstrate the effectiveness of our solution.nstrate the effectiveness of our solution.
Abstractsub Nowadays "live" content, such as weblog, wNowadays "live" content, such as weblog, wikipedia, and news, is ubiquitous in the Internet. Providing users with relevant content in a timely manner becomes a challenging problem. Differing from Web search technologies and RSS feeds/reader applications, this paper envisions a personalized full-text content filtering and dissemination system in a highly distributed environment such as a Distributed Hash Table (DHT) based Peer-to-Peer (P2P) Network. Users subscribe to their interested content by specifying input keywords and thresholds as filters. Then, content is disseminated to those users having interest in it. In the literature, full-text document publishing in DHTs has suffered for a long time from the high cost of forwarding a document to home nodes of all distinct terms. It is aggravated by the fact that a document contains a large number of distinct terms (typically tens or thousands of terms per document). In this paper, we propose a set of novel techniques to overcome such a high forwarding cost by carefully selecting a very small number of meaningful terms (or key features) among candidate terms inside each document. Next, to reduce the average hop count per forwarding, we further prune irrelevant documents during the forwarding path. Experiments based on two real query logs and two real data sets demonstrate the effectiveness of our solution.nstrate the effectiveness of our solution.
Bibtextype article  +
Doi 10.1007/s00778-011-0224-z  +
Has author Rao W. + , Long Chen + , Fu A.W.-C. +
Has extra keyword Content dissemination + , Content filtering + , DHT + , Distributed environments + , Distributed hash tables + , Full-text documents + , High costs + , Home nodes + , Hop count + , Key feature + , Novel techniques + , P2P + , Query logs + , Real data sets + , RSS feeds + , Web searches + , Wikipedia + , Websites + , Peer to peer networks +
Has keyword Content dissemination + , Content filtering + , DHT +
Issn 10668888  +
Issue 6  +
Language English +
Number of citations by publication 0  +
Number of references by publication 0  +
Pages 793–817  +
Published in VLDB Journal +
Title STAIRS: Towards efficient full-text filtering and dissemination in DHT environments +
Type journal article  +
Volume 20  +
Year 2011 +
Creation dateThis property is a special property in this wiki. 8 November 2014 05:43:44  +
Categories Publications without license parameter  + , Publications without remote mirror parameter  + , Publications without archive mirror parameter  + , Publications without paywall mirror parameter  + , Journal articles  + , Publications without references parameter  + , Publications  +
Modification dateThis property is a special property in this wiki. 8 November 2014 05:43:44  +
DateThis property is a special property in this wiki. 2011  +
hide properties that link here 
STAIRS: Towards efficient full-text filtering and dissemination in DHT environments + Title
 

 

Enter the name of the page to start browsing from.