Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. PicShark: Mitigating Metadata Scarcity Through Large-Scale P2P Collaboration
 
research article

PicShark: Mitigating Metadata Scarcity Through Large-Scale P2P Collaboration

Cudre-Mauroux, Philippe  
•
Budura, Adriana  
•
Hauswirth, Manfred  
Show more
2008
VLDB Journal

With the commoditization of digital devices, personal information and media sharing is becoming a key application on the pervasive Web. In such a context, data annotation rather than data production is the main bottleneck. Metadata scarcity represents a major obstacle preventing effcient information processing in large and heterogeneous communities. However, social communities also open the door to new possibilities for addressing local metadata scarcity by taking advantage of global collections of resources. We propose to tackle the lack of metadata in large-scale distributed systems through a collaborative process leveraging on both content and metadata. We develop a community-based and self-organizing system called PicShark in which information entropy in terms of missing metadata is gradually alleviated through decentralized instance and schema matching. Our approach focuses on semi- structured metadata and confines computationally expensive operations to the edge of the network, while keeping distributed operations as simple as possible to ensure scalability. PicShark builds on structured Peer-to-Peer networks for distributed look-up operations, but extends the application of self-organization principles to the propagation of metadata and the creation of schema mappings. We demonstrate the practical applicability of our method in an image sharing scenario and provide experimental evidences illustrating the validity of our approach.

  • Files
  • Details
  • Metrics
Type
research article
DOI
10.1007/s00778-008-0103-4
Web of Science ID

WOS:000259961800003

Author(s)
Cudre-Mauroux, Philippe  
Budura, Adriana  
Hauswirth, Manfred  
Aberer, Karl  
Date Issued

2008

Published in
VLDB Journal
Volume

17

Issue

6

Start page

1371

End page

1384

Subjects

Metadata Scarcity

•

Metadata Heterogeneity

•

Metadata Entropy

•

Peer-to-Peer Collaboration

•

Peer Data Management

•

NCCR-MICS

•

NCCR-MICS/CL4

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LSIR  
Available on Infoscience
June 3, 2008
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/26053
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés