Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Arabic Entity Graph Extraction Using Morphology, Finite State Machines, and Graph Transformations
 
conference paper

Arabic Entity Graph Extraction Using Morphology, Finite State Machines, and Graph Transformations

Makhlouta, Jad
•
Zaraket, Fadi
•
Harkous, Hamza  
2012
Proceedings of 13th International Conference on Intelligent Text Processing and Computational Linguistics, (CICLing-2012)
13th International Conference on Intelligent Text Processing and Computational Linguistics

Research on automatic recognition of named entities from Arabic text uses techniques that work well for the Latin based languages such as local grammars, statistical learning models, pattern matching, and rule-based techniques. These techniques boost their results by using application specific corpora, parallel language corpora, and morphological stemming analysis. We propose a method for extracting entities, events, and relations amongst them from Arabic text using a hierarchy of finite state machines driven by morphological features such as part of speech and gloss tags, and graph transformation algorithms.We evaluated our method on two natural language processing applications. We automated the extraction of narrators and narrator relations from several corpora of Islamic narration books (hadith). We automated the extraction of genealogical family trees from Biblical texts. In all applications, our method reports high precision and recall and learns lemmas about phrases that improve results.

  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-642-28604-9_25
Author(s)
Makhlouta, Jad
Zaraket, Fadi
Harkous, Hamza  
Date Issued

2012

Published in
Proceedings of 13th International Conference on Intelligent Text Processing and Computational Linguistics, (CICLing-2012)
Series title/Series vol.

Lecture Notes in Computer Science

Subjects

NLP

•

Practical Applications

•

Entity Extraction

•

Relational extraction

Note

Acceptance rate: 88/307=28.6%

Editorial or Peer reviewed

REVIEWED

Written at

OTHER

EPFL units
IIF  
ISIM  
Event nameEvent placeEvent date
13th International Conference on Intelligent Text Processing and Computational Linguistics

New Delhi, India

March 11–17, 2012

Available on Infoscience
December 11, 2011
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/73070
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés