Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Dynamic faceted search for discovery-driven analysis
 
conference paper

Dynamic faceted search for discovery-driven analysis

Dash, Debabrata
•
Rao, Jun
•
Megiddo, Nimrod
Show more
2008
Proceedings of the 17th ACM Conference on Information and Knowledge Management
Conference on Information and Knowledge Management (CIKM '08)

We propose a dynamic faceted search system for discovery-driven analysis on data with both textual content and structured attributes. From a keyword query, we want to dynamically select a small set of "interesting" attributes and present aggregates on them to a user. Similar to work in OLAP exploration, we define "interestingness" as how surprising an aggregated value is, based on a given expectation. We make two new contributions by proposing a novel "navigational" expectation that’s particularly useful in the context of faceted search, and a novel interestingness measure through judicious application of p-values. Through a user survey, we find the new expectation and interestingness metric quite effective. We develop an efficient dynamic faceted search system by improving a popular open source engine, Solr. Our system exploits compressed bitmaps for caching the posting lists in an inverted index, and a novel directory structure called a bitset tree for fast bitset intersection. We conduct a comprehensive experimental study on large real data sets and show that our engine performs 2 to 3 times faster than Solr.

  • Details
  • Metrics
Type
conference paper
DOI
10.1145/1458082.1458087
Author(s)
Dash, Debabrata
Rao, Jun
Megiddo, Nimrod
Ailamaki, Anastasia  
Lohman, Guy M.
Date Issued

2008

Published in
Proceedings of the 17th ACM Conference on Information and Knowledge Management
Start page

3

End page

12

Editorial or Peer reviewed

REVIEWED

Written at

OTHER

EPFL units
DIAS  
Event nameEvent placeEvent date
Conference on Information and Knowledge Management (CIKM '08)

Napa Valley, California, USA

October 26-30, 2008

Available on Infoscience
January 23, 2009
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/34327
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés