TY - GEN
T1 - Snaptogrid
T2 - 15th Workshop on Biomedical Natural Language Processing, BioNLP 2016
AU - Valenzuela-Escárcega, Marco A.
AU - Hahn-Powell, Gus
AU - Bell, Dane
AU - Surdeanu, Mihai
N1 - Funding Information:
This work was funded by the Defense Advanced Research Projects Agency (DARPA) Big Mechanism program under ARO contract W911NF-14-1-0395.
Publisher Copyright:
© BioNLP 2016. All rights reserved.
PY - 2016
Y1 - 2016
N2 - We propose an approach for biomedical information extraction that marries the advantages of machine learning models, e.g., learning directly from data, with the benefits of rule-based approaches, e.g., interpretability. Our approach starts by training a feature-based statistical model, then converts this model to a rule-based variant by converting its features to rules, and "snapping to grid" the feature weights to discrete votes. In doing so, our proposal takes advantage of the large body of work in machine learning, but it produces an interpretable model, which can be directly edited by experts. We evaluate our approach on the BioNLP 2009 event extraction task. Our results show that there is a small performance penalty when converting the statistical model to rules, but the gain in interpretability compensates for that: with minimal effort, human experts improve this model to have similar performance to the statistical model that served as starting point.
AB - We propose an approach for biomedical information extraction that marries the advantages of machine learning models, e.g., learning directly from data, with the benefits of rule-based approaches, e.g., interpretability. Our approach starts by training a feature-based statistical model, then converts this model to a rule-based variant by converting its features to rules, and "snapping to grid" the feature weights to discrete votes. In doing so, our proposal takes advantage of the large body of work in machine learning, but it produces an interpretable model, which can be directly edited by experts. We evaluate our approach on the BioNLP 2009 event extraction task. Our results show that there is a small performance penalty when converting the statistical model to rules, but the gain in interpretability compensates for that: with minimal effort, human experts improve this model to have similar performance to the statistical model that served as starting point.
UR - http://www.scopus.com/inward/record.url?scp=85029752792&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85029752792&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85029752792
T3 - BioNLP 2016 - Proceedings of the 15th Workshop on Biomedical Natural Language Processing
SP - 56
EP - 65
BT - BioNLP 2016 - Proceedings of the 15th Workshop on Biomedical Natural Language Processing
A2 - Cohen, Kevin Bretonnel
A2 - Demner-Fushman, Dina
A2 - Ananiadou, Sophia
A2 - Tsujii, Jun-ichi
PB - Association for Computational Linguistics (ACL)
Y2 - 12 August 2016
ER -