A domain-independent rule-based framework for event extraction

Marco A. Valenzuela-Escárcega, Gus Hahn-Powell, Thomas Hicks, Mihai Surdeanu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

67 Scopus citations

Abstract

We describe the design, development, and API of ODIN (Open Domain INformer), a domainindependent, rule-based event extraction (EE) framework. The proposed EE approach is: simple (most events are captured with simple lexico-syntactic patterns), powerful (the language can capture complex constructs, such as events taking other events as arguments, and regular expressions over syntactic graphs), robust (to recover from syntactic parsing errors, syntactic patterns can be freely mixed with surface, token-based patterns), and fast (the runtime environment processes 110 sentences/ second in a real-world domain with a grammar of over 200 rules). We used this framework to develop a grammar for the biochemical domain, which approached human performance. Our EE framework is accompanied by a web-based user interface for the rapid development of event grammars and visualization of matches. The ODIN framework and the domain-specific grammars are available as open-source code.

Original languageEnglish (US)
Title of host publicationACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations
PublisherAssociation for Computational Linguistics (ACL)
Pages127-132
Number of pages6
ISBN (Electronic)9781941643990
DOIs
StatePublished - 2015
Event53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL-IJCNLP 2015 - Beijing, China
Duration: Jul 26 2015Jul 31 2015

Publication series

NameACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations

Other

Other53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL-IJCNLP 2015
Country/TerritoryChina
CityBeijing
Period7/26/157/31/15

ASJC Scopus subject areas

  • Language and Linguistics
  • Pollution

Fingerprint

Dive into the research topics of 'A domain-independent rule-based framework for event extraction'. Together they form a unique fingerprint.

Cite this