Transferring Legal Natural Language Inference Model from a US State to Another: What Makes It So Hard?

Alice Saebom Kwak, Gaetano Vincent Forte, Derek E. Bambauer, Mihai Surdeanu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

This study investigates whether a legal natural language inference (NLI) model trained on the data from one US state can be transferred to another state. We fine-tuned a pre-trained model on the task of evaluating the validity of legal will statements, once with the dataset containing the Tennessee wills and once with the dataset containing the Idaho wills. Each model's performance on the in-domain setting and the out-of-domain setting are compared to see if the models can across the states. We found that the model trained on one US state can be mostly transferred to another state. However, it is clear that the model's performance drops in the out-of-domain setting. The F1 scores of the Tennessee model and the Idaho model are 96.41 and 92.03 when predicting the data from the same state, but they drop to 66.32 and 81.60 when predicting the data from another state. Subsequent error analysis revealed that there are two major sources of errors. First, the model fails to recognize equivalent laws across states when there are stylistic differences between laws. Second, difference in statutory section numbering system between the states makes it difficult for the model to locate laws relevant to the cases being predicted on. This analysis provides insights on how the future NLI system can be improved. Also, our findings offer empirical support to legal experts advocating the standardization of legal documents.

Original languageEnglish (US)
Title of host publicationNLLP 2023 - Natural Legal Language Processing Workshop 2023, Proceedings of the Workshop
EditorsDaniel Preotiuc-Pietro, Catalina Goanta, Ilias Chalkidis, Leslie Barrett, Gerasimos Spanakis, Nikolaos Aletras
PublisherAssociation for Computational Linguistics (ACL)
Pages215-222
Number of pages8
ISBN (Electronic)9798891760547
StatePublished - 2023
Event5th Natural Legal Language Processing Workshop, NLLP 2023 - Singapore, Singapore
Duration: Dec 7 2023 → …

Publication series

NameNLLP 2023 - Natural Legal Language Processing Workshop 2023, Proceedings of the Workshop

Conference

Conference5th Natural Legal Language Processing Workshop, NLLP 2023
Country/TerritorySingapore
CitySingapore
Period12/7/23 → …

ASJC Scopus subject areas

  • Language and Linguistics
  • Computational Theory and Mathematics
  • Software
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Transferring Legal Natural Language Inference Model from a US State to Another: What Makes It So Hard?'. Together they form a unique fingerprint.

Cite this