Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Query-Driven Indexing for Peer-to-Peer Text Retrieval
 
conference paper

Query-Driven Indexing for Peer-to-Peer Text Retrieval

Skobeltsyn, Gleb  
•
Luu, Toan
•
Podnar Zarko, Ivana
Show more
2007
WWW '07: Proceedings of the 16th international conference on World Wide Web
16th International World Wide Web Conference (WWW'2007)

We describe a query-driven indexing framework for scalable text retrieval over structured P2P networks. To cope with the bandwidth consumption problem that has been identified as the major obstacle for full-text retrieval in P2P networks, we truncate posting lists associated with indexing features to a constant size storing only top-k ranked document references. To compensate for the loss of information caused by the truncation, we extend the set of indexing features with carefully chosen term sets. Indexing term sets are selected based on the query statistics extracted from query logs, thus we index only such combinations that are a) frequently present in user queries and b) non-redundant w.r.t the rest of the index. The distributed index is compact and efficient as it constantly evolves adapting to the current query popularity distribution. Moreover, it is possible to control the tradeoff between the storage/bandwidth requirements and the quality of query answering by tuning the indexing parameters. Our theoretical analysis and experimental results indicate that we can indeed achieve scalable P2P text retrieval for very large document collections and deliver good retrieval performance.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1145/1242572.1242757
Author(s)
Skobeltsyn, Gleb  
Luu, Toan
Podnar Zarko, Ivana
Rajman, Martin  
Aberer, Karl  
Date Issued

2007

Published in
WWW '07: Proceedings of the 16th international conference on World Wide Web
Start page

1185

End page

1186

Subjects

P2P

•

DHT

•

IR

•

Text Retrieval

•

Query-Driven Indexing

URL

URL

http://www2007.org
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LSIR  
Event nameEvent placeEvent date
16th International World Wide Web Conference (WWW'2007)

Banff, Canada

May 8-12, 2007

Available on Infoscience
March 9, 2007
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/3801
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés