TY - GEN
T1 - Risk analysis for intellectual property litigation
AU - Surdeanu, Mihai
AU - Nallapati, Ramesh
AU - Gregory, George
AU - Walker, Joshua
AU - Manning, Christopher D.
PY - 2011
Y1 - 2011
N2 - We introduce the problem of risk analysis for Intellectual Property (IP) lawsuits. More specifically, we focus on estimating the risk for participating parties using solely prior factors, i. e., historical and concurrent behavior of the entities involved in the case. This work represents a first step towards building a comprehensive legal risk assessment system for parties involved in litigation. This technology will allow parties to optimize their case parameters to minimize their own risk, or to settle disputes out of court and thereby ease the burden on the judicial system. In addition, it will also help U.S. courts detect and fix any inherent biases in the system. We model risk estimation as a relational classification problem using conditional random fields [6] to jointly estimate the risks of concurrent cases. We evaluate our model on data collected by the Stanford Intellectual Property Litigation Clearinghouse, which consists of over 4,200 IP lawsuits filed across 88 U.S. federal districts and ranging over 8 years, probably the largest legal data set reported in data mining research. Despite being agnostic to the merits of the case, our best model achieves a classification accuracy of 64%, 22% (relative) higher than the majority-class baseline.
AB - We introduce the problem of risk analysis for Intellectual Property (IP) lawsuits. More specifically, we focus on estimating the risk for participating parties using solely prior factors, i. e., historical and concurrent behavior of the entities involved in the case. This work represents a first step towards building a comprehensive legal risk assessment system for parties involved in litigation. This technology will allow parties to optimize their case parameters to minimize their own risk, or to settle disputes out of court and thereby ease the burden on the judicial system. In addition, it will also help U.S. courts detect and fix any inherent biases in the system. We model risk estimation as a relational classification problem using conditional random fields [6] to jointly estimate the risks of concurrent cases. We evaluate our model on data collected by the Stanford Intellectual Property Litigation Clearinghouse, which consists of over 4,200 IP lawsuits filed across 88 U.S. federal districts and ranging over 8 years, probably the largest legal data set reported in data mining research. Despite being agnostic to the merits of the case, our best model achieves a classification accuracy of 64%, 22% (relative) higher than the majority-class baseline.
UR - http://www.scopus.com/inward/record.url?scp=80052414518&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80052414518&partnerID=8YFLogxK
U2 - 10.1145/2018358.2018375
DO - 10.1145/2018358.2018375
M3 - Conference contribution
AN - SCOPUS:80052414518
SN - 9781450307550
T3 - Proceedings of the International Conference on Artificial Intelligence and Law
SP - 116
EP - 120
BT - 13th International Conference on Artificial Intelligence and Law, ICAIL 2011 - Proceedings of the Conference
T2 - 13th International Conference on Artificial Intelligence and Law, ICAIL 2011
Y2 - 6 June 2011 through 10 June 2011
ER -