TY - GEN
T1 - A domain-independent rule-based framework for event extraction
AU - Valenzuela-Escárcega, Marco A.
AU - Hahn-Powell, Gus
AU - Hicks, Thomas
AU - Surdeanu, Mihai
N1 - Publisher Copyright:
© 2015 ACL and AFNLP.
PY - 2015
Y1 - 2015
N2 - We describe the design, development, and API of ODIN (Open Domain INformer), a domainindependent, rule-based event extraction (EE) framework. The proposed EE approach is: simple (most events are captured with simple lexico-syntactic patterns), powerful (the language can capture complex constructs, such as events taking other events as arguments, and regular expressions over syntactic graphs), robust (to recover from syntactic parsing errors, syntactic patterns can be freely mixed with surface, token-based patterns), and fast (the runtime environment processes 110 sentences/ second in a real-world domain with a grammar of over 200 rules). We used this framework to develop a grammar for the biochemical domain, which approached human performance. Our EE framework is accompanied by a web-based user interface for the rapid development of event grammars and visualization of matches. The ODIN framework and the domain-specific grammars are available as open-source code.
AB - We describe the design, development, and API of ODIN (Open Domain INformer), a domainindependent, rule-based event extraction (EE) framework. The proposed EE approach is: simple (most events are captured with simple lexico-syntactic patterns), powerful (the language can capture complex constructs, such as events taking other events as arguments, and regular expressions over syntactic graphs), robust (to recover from syntactic parsing errors, syntactic patterns can be freely mixed with surface, token-based patterns), and fast (the runtime environment processes 110 sentences/ second in a real-world domain with a grammar of over 200 rules). We used this framework to develop a grammar for the biochemical domain, which approached human performance. Our EE framework is accompanied by a web-based user interface for the rapid development of event grammars and visualization of matches. The ODIN framework and the domain-specific grammars are available as open-source code.
UR - http://www.scopus.com/inward/record.url?scp=84944312138&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84944312138&partnerID=8YFLogxK
U2 - 10.3115/v1/p15-4022
DO - 10.3115/v1/p15-4022
M3 - Conference contribution
AN - SCOPUS:84944312138
T3 - ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations
SP - 127
EP - 132
BT - ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations
PB - Association for Computational Linguistics (ACL)
T2 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL-IJCNLP 2015
Y2 - 26 July 2015 through 31 July 2015
ER -