PicShark: Mitigating Metadata Scarcity Through Large-Scale P2P Collaboration

Cudre-Mauroux, Philippe; Budura, Adriana; Hauswirth, Manfred; Aberer, Karl

doi:10.1007/s00778-008-0103-4

Cudre-Mauroux, Philippe; Budura, Adriana; Hauswirth, Manfred; Aberer, Karl

2008

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

With the commoditization of digital devices, personal information and media sharing is becoming a key application on the pervasive Web. In such a context, data annotation rather than data production is the main bottleneck. Metadata scarcity represents a major obstacle preventing effcient information processing in large and heterogeneous communities. However, social communities also open the door to new possibilities for addressing local metadata scarcity by taking advantage of global collections of resources. We propose to tackle the lack of metadata in large-scale distributed systems through a collaborative process leveraging on both content and metadata. We develop a community-based and self-organizing system called PicShark in which information entropy in terms of missing metadata is gradually alleviated through decentralized instance and schema matching. Our approach focuses on semi- structured metadata and confines computationally expensive operations to the edge of the network, while keeping distributed operations as simple as possible to ensure scalability. PicShark builds on structured Peer-to-Peer networks for distributed look-up operations, but extends the application of self-organization principles to the propagation of metadata and the creation of schema mappings. We demonstrate the practical applicability of our method in an image sharing scenario and provide experimental evidences illustrating the validity of our approach.

Details

Title PicShark: Mitigating Metadata Scarcity Through Large-Scale P2P Collaboration

Author(s) Cudre-Mauroux, Philippe ; Budura, Adriana ; Hauswirth, Manfred ; Aberer, Karl

Published in VLDB Journal

Volume 17

Issue 6

Pages 1371-1384

Date 2008

Keywords

Metadata Scarcity; Metadata Heterogeneity; Metadata Entropy; Peer-to-Peer Collaboration; Peer Data Management; NCCR-MICS; NCCR-MICS/CL4

DOI https://doi.org/10.1007/s00778-008-0103-4

Other identifier(s) DAR: 13834
View record in Web of Science

Laboratories LSIR

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LSIR - Distributed Information Systems Laboratory
Scientific production and competences > I&C - School of Computer and Communication Sciences > MICS - Mobile Information & Communication Systems
Scientific production and competences > MICS - Mobile Information & Communication Systems
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2008-06-03

Actions

Preview

Select file: