SHAP-Prioritised Machine Learning for Diagnostic-Grade Prediction of Lung Function

  • Oliver Pitts
  • , Salman Siddiqui
  • , Anand Shah
  • , Alex Bell
  • , Christopher Brightling
  • , David Singh
  • , Janwillem Kocks
  • , Leonardo Fabbri
  • , Alberto Papi
  • , Klaus Rabe
  • , Maarten Van Den Berge
  • , Monica Kraft
  • , Rossella Arcucci

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Automated machine learning (ML) can streamline the characterisation and management of chronic airway conditions. With the advent of quantitative CT (qCT) imaging allowing precise extraction of structural features from scans, assessment of airway obstruction levels could be automated to compliment traditional testing. This “feature known” approach has the added potential benefit of identifying key structure-function relationships through explainability measures. We therefore aimed to develop inverse models to estimate spirometry parameters from high-dimensional quantitative data using these structural metrics as constraints. With the ATLANTIS (NCT02123667) dataset, this paper experiments with a selection of ML methods, specifically k-nearest neighbours (kNN), random forest (RF) and support vector machine (SVM), to predict spirometry values (Forced Expiratory Volume (FEV1), Forced Vital Capacity (FVC) and FEV1/FVC). The dynamic ratio FEV1/FVC was predicted better by all models than FEV1 or FVC. Results show effective counteraction to high-dimensionality through iterative feature refinement guided by SHapley Additive exPlanations (SHAP), and to limited training data through dynamic Gaussian noise (DGN). Diagnostic-grade prediction accuracy was achieved with DGN SHAP sequential feature selection (SFS)-kNN at 1.64% MRE with 37/76 features. A selection of typical variables including expiratory tissue density and lung volume, vasculature and airway geometries were seen to be important for prediction. This approach therefore can not only predict pulmonary function, but also extract useful structural information in a dynamic airway system through linking back to personalised abnormalities.

Original languageEnglish (US)
Title of host publicationComputational Science – ICCS 2025 Workshops - 25th International Conference, 2025, Proceedings
EditorsMaciej Paszynski, Amanda S. Barnard, Yongjie Jessica Zhang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages73-85
Number of pages13
ISBN (Print)9783031975660
DOIs
StatePublished - 2025
Externally publishedYes
EventWorkshops on Computational Science, which were co-organized with the 25th International Conference on Computational Science, ICCS 2025 - Singapore, Singapore
Duration: Jul 7 2025Jul 9 2025

Publication series

NameLecture Notes in Computer Science
Volume15910 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceWorkshops on Computational Science, which were co-organized with the 25th International Conference on Computational Science, ICCS 2025
Country/TerritorySingapore
CitySingapore
Period7/7/257/9/25

Keywords

  • Airway Diseases
  • Machine Learning
  • Quantitative CT
  • SHAP

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'SHAP-Prioritised Machine Learning for Diagnostic-Grade Prediction of Lung Function'. Together they form a unique fingerprint.

Cite this