Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Entity-based Classification of Twitter Messages
 
research article

Entity-based Classification of Twitter Messages

Yerva, Surender Reddy  
•
Miklós, Zoltán  
•
Aberer, Karl  
2012
International Journal of Computer Science & Applications

Twitter is a popular micro-blogging service on theWeb, where people can enter short messages, which then become visible to some other users of the service. While the topics of these messages varies, there are a lot of messages where the users express their opinions about some companies or their products. These messages are a rich source of information for companies for sentiment analysis or opinion mining. There is however a great obstacle for analyzing the messages directly: as the company names are often ambiguous (e.g. apple, the fruit vs. Apple Inc.), one needs first to identify, which messages are related to the company. In this paper we address this question. We present various techniques for classifying tweet messages containing a given keyword, whether they are related to a particular company with that name or not. We first present simple techniques, which make use of company profiles, which we created semi-automatically from external Web sources. Our advanced techniques take ambiguity estimations into account and also automatically extend the company profiles from the twitter stream itself. We demonstrate the effectiveness of our methods through an extensive set of experiments. Moreover, we extensively analyze the sources of errors in the classification. The analysis not only brings further improvement, but also enables to use the human input more efficiently.

  • Files
  • Details
  • Metrics
Type
research article
Author(s)
Yerva, Surender Reddy  
Miklós, Zoltán  
Aberer, Karl  
Date Issued

2012

Published in
International Journal of Computer Science & Applications
Volume

9

Issue

2

Start page

88

End page

115

Subjects

Entity

•

Twitter

•

Classification

•

Company Profiles

•

Disambiguation

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LSIR  
Available on Infoscience
February 6, 2012
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/77547
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés