Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. ResIn: A Combination of Result Caching and Index Pruning for High-performance Web Search Engines
 
conference paper

ResIn: A Combination of Result Caching and Index Pruning for High-performance Web Search Engines

Skobeltsyn, Gleb  
•
Junqueira, Flavio
•
Plachouras, Vassilis
Show more
2008
The 31st Annual International ACM SIGIR Conference
SIGIR

Results caching is an efficient technique for reducing the query processing load, hence it is commonly used in real search engines. This technique, however, bounds the maximum hit rate due to the large fraction of singleton queries, which is an important limitation. In this paper we propose ResIn - an architecture that uses a combination of results caching and index pruning to overcome this limitation. We argue that results caching is an inexpensive and efficient way to reduce the query processing load and show that it is cheaper to implement compared to a pruned index. At the same time, we show that index pruning performance is fundamentally affected by the changes in the query traffic that the results cache induces. We experiment with real query logs and a large document collection, and show that the combination of both techniques enables efficient reduction of the query processing costs and thus is practical to use in Web search engines.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Skobeltsyn-SIGIR08.pdf

Access type

openaccess

Size

1.33 MB

Format

Adobe PDF

Checksum (MD5)

a77fab14cc6da343a2f88205642cd963

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés