Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Distributed Cache Table: Efficient Query-Driven Processing of Multi-Term Queries in P2P Networks
 
conference paper

Distributed Cache Table: Efficient Query-Driven Processing of Multi-Term Queries in P2P Networks

Skobeltsyn, Gleb  
•
Aberer, Karl  
2006
P2PIR '06: Proceedings of the international workshop on Information retrieval in peer-to-peer networks
P2PIR

The state-of-the-art techniques for processing multi-term queries in P2P environments are query flooding and inverted list intersection. However, it has been shown that due to scalability reasons both methods fail to support full-text search in large scale document collections distributed among the nodes in a P2P network. Although a number of optimizations have been suggested recently based on the aforementioned techniques, little evidence is given on their scalability. In this paper we suggest a novel query-driven indexing strategy which generates and maintains only those index entries that are actually used for query processing. In our approach called Distributed Cache Table (DCT), by analogy with Distributed Hash Table (DHT), we suggest to abandon the difference between data indexing and query caching, and to store result sets (caches) for the most profitable queries. DCT employs a distributed index to efficiently locate caches that can answer a given multi-term query and broadcasts the query to all the peers only if no such caches were found. Evaluations on real data and query loads show that DCT converges to a high cache-hit ratio and indeed offers a large-scale distributed solution for storing and efficient querying of vast amounts of documents in the P2P setting. DCT achieves two orders of magnitude improvement in traffic consumption compared to a standard distributed single-term indexing approach.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1145/1183579.1183586
Author(s)
Skobeltsyn, Gleb  
Aberer, Karl  
Date Issued

2006

Published in
P2PIR '06: Proceedings of the international workshop on Information retrieval in peer-to-peer networks
Start page

33

End page

40

Subjects

P2P DHT query-driven indexing caching multi-term query processing

URL

URL

http://lsirwww.epfl.ch/p2pir2006/
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LSIR  
Event nameEvent placeEvent date
P2PIR

Arlington VA, USA

Nov.11

Available on Infoscience
August 25, 2006
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/233885
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés