Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Polisis: Automated Analysis and Presentation of Privacy Policies Using Deep Learning
 
conference paper

Polisis: Automated Analysis and Presentation of Privacy Policies Using Deep Learning

Harkous, Hamza  
•
Fawaz, Kassem
•
Lebret, Remi  
Show more
January 1, 2018
Proceedings Of The 27Th Usenix Security Symposium
27th USENIX Security Symposium

Privacy policies are the primary channel through which companies inform users about their data collection and sharing practices. These policies are often long and difficult to comprehend. Short notices based on information extracted from privacy policies have been shown to be useful but face a significant scalability hurdle, given the number of policies and their evolution over time. Companies, users, researchers, and regulators still lack usable and scalable tools to cope with the breadth and depth of privacy policies. To address these hurdles, we propose an automated framework for privacy policy analysis (Polisis). It enables scalable, dynamic, and multi-dimensional queries on natural language privacy policies. At the core of Polisis is a privacy-centric language model, built with 130K privacy policies, and a novel hierarchy of neural-network classifiers that accounts for both high-level aspects and fine-grained details of privacy practices. We demonstrate Polisis' modularity and utility with two applications supporting structured and free-form querying. The structured querying application is the automated assignment of privacy icons from privacy policies. With Polisis, we can achieve an accuracy of 88.4% on this task. The second application, PriBot, is the first free-form question-answering system for privacy policies. We show that PriBot can produce a correct answer among its top-3 results for 82% of the test questions. Using an MTurk user study with 700 participants, we show that at least one of PriBot's top-3 answers is relevant to users for 89% of the test questions.

  • Details
  • Metrics
Type
conference paper
Web of Science ID

WOS:000485139900032

Author(s)
Harkous, Hamza  
Fawaz, Kassem
Lebret, Remi  
Schaub, Florian
Shin, Kang G.
Aberer, Karl  
Date Issued

2018-01-01

Publisher

USENIX ASSOC

Publisher place

Berkeley

Published in
Proceedings Of The 27Th Usenix Security Symposium
ISBN of the book

978-1-939133-04-5

Start page

531

End page

548

Subjects

Computer Science, Information Systems

•

Computer Science

•

online privacy

•

legal

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LSIR  
Event nameEvent placeEvent date
27th USENIX Security Symposium

Baltimore, MD

Aug 15-17, 2018

Available on Infoscience
September 29, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/161667
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés