Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing
 
research article

Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing

Mueller, Martin M.  
•
Salathe, Marcel  
April 12, 2019
Frontiers In Public Health

In the past decade, tracking health trends using social media data has shown great promise, due to a powerful combination of massive adoption of social media around the world, and increasingly potent hardware and software that enables us to work with these new big data streams. At the same time, many challenging problems have been identified. First, there is often a mismatch between how rapidly online data can change, and how rapidly algorithms are updated, which means that there is limited reusability for algorithms trained on past data as their performance decreases over time. Second, much of the work is focusing on specific issues during a specific past period in time, even though public health institutions would need flexible tools to assess multiple evolving situations in real time. Third, most tools providing such capabilities are proprietary systems with little algorithmic or data transparency, and thus little buy-in from the global public health and research community. Here, we introduce Crowdbreaks, an open platform which allows tracking of health trends by making use of continuous crowdsourced labeling of public social media content. The system is built in a way which automatizes the typical workflow from data collection, filtering, labeling and training of machine learning classifiers and therefore can greatly accelerate the research process in the public health domain. This work describes the technical aspects of the platform, thereby covering the functionalities at its current state and exploring its future use cases and extensions.

  • Files
  • Details
  • Metrics
Type
research article
DOI
10.3389/fpubh.2019.00081
Web of Science ID

WOS:000464546300001

Author(s)
Mueller, Martin M.  
Salathe, Marcel  
Date Issued

2019-04-12

Published in
Frontiers In Public Health
Volume

7

Start page

81

Subjects

Public, Environmental & Occupational Health

•

Public, Environmental & Occupational Health

•

data mining

•

natural language processing (nlp)

•

crowdsourcing

•

social media data

•

sentiment analysis (sa)

•

vaccination

•

data stream analytics

•

machine learning

•

tool

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
UPSALATHE1  
Available on Infoscience
April 26, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/156132
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés