Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Agile text mining with Sherlok
 
conference paper

Agile text mining with Sherlok

Richardet, Renaud
•
Chappelier, Jean-Cedric
•
Tripathy, Shreejoy
Show more
2015
2015 IEEE International Conference on Big Data (Big Data)
2015 IEEE International Conference on Big Data (Big Data)

The successful development of an intelligent text mining application requires the collaboration of two main stakeholders: subject matter experts and text miners. In this paper, we describe a new methodology, agile text mining to improve that collaboration. Agile text mining is characterized by short development cycles, frequent tasks redefinition and continuous performance monitoring through integration tests. We introduce Sherlok, a system supporting the development of agile text mining applications and present an application to extract mention of neurons from a very large corpus of scientific articles. The resulting code and models are publicly available.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/BigData.2015.7363910
Author(s)
Richardet, Renaud
Chappelier, Jean-Cedric
Tripathy, Shreejoy
Hill, Sean  
Date Issued

2015

Publisher

IEEE

Published in
2015 IEEE International Conference on Big Data (Big Data)
Start page

1479

End page

1484

Subjects

Agile data science

•

Natural language processing

•

Text mining

•

Big data

•

UIMA

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
BBP-CORE  
BBP-GR-HILL  
Event nameEvent placeEvent date
2015 IEEE International Conference on Big Data (Big Data)

Santa Clara, CA, USA

29 October - 1 November 2015

Available on Infoscience
October 28, 2016
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/130818
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés