Metrics and tools for consistent cohort discovery and financial analyses post-transition to ICD-10-CM

Andrew D. Boyd, Jianrong John Li, Colleen Kenost, Binoy Joese, Young Min Yang, Olympia A. Kalagidis, Ilir Zenku, Donald Saner, Neil Bahroos, Yves A. Lussier

Research output: Contribution to journalArticlepeer-review

14 Scopus citations


In the United States, International Classification of Disease Clinical Modification (ICD-9-CM, the ninth revision) diagnosis codes are commonly used to identify patient cohorts and to conduct financial analyses related to disease. In October 2015, the healthcare system of the United States will transition to ICD-10-CM (the tenth revision) diagnosis codes. One challenge posed to clinical researchers and other analysts is conducting diagnosis-related queries across datasets containing both coding schemes. Further, healthcare administrators will manage growth, trends, and strategic planning with these dually-coded datasets. The majority of the ICD-9-CM to ICD-10-CM translations are complex and nonreciprocal, creating convoluted representations and meanings. Similarly, mapping back from ICD-10-CM to ICD-9-CM is equally complex, yet different from mapping forward, as relationships are likewise nonreciprocal. Indeed, 10 of the 21 top clinical categories are complex as 78% of their diagnosis codes are labeled as "convoluted" by our analyses. Analysis and research related to external causes of morbidity, injury, and poisoning will face the greatest challenges due to 41 745 (90%) convolutions and a decrease in the number of codes. We created a web portal tool and translation tables to list all ICD-9-CM diagnosis codes related to the specific input of ICD-10-CM diagnosis codes and their level of complexity: "identity" (reciprocal), "class-to-subclass," "subclass-to-class," "convoluted," or "no mapping." These tools provide guidance on ambiguous and complex translations to reveal where reports or analyses may be challenging to impossible.

Original languageEnglish (US)
Pages (from-to)730-737
Number of pages8
JournalJournal of the American Medical Informatics Association
Issue number3
StatePublished - 2015


  • Financial analyses
  • ICD-10-CM
  • ICD-9-CM
  • Medical informatics
  • Network patterns
  • Patient cohort

ASJC Scopus subject areas

  • Health Informatics


Dive into the research topics of 'Metrics and tools for consistent cohort discovery and financial analyses post-transition to ICD-10-CM'. Together they form a unique fingerprint.

Cite this