PhyImpute and UniFracImpute: two imputation approaches incorporating phylogeny information for microbial count data

Qianwen Luo, Shanshan Zhang, Hamza Butt, Yin Chen, Hongmei Jiang, Lingling An

Research output: Contribution to journalArticlepeer-review

Abstract

Sequencing-based microbial count data analysis is a challenging task due to the presence of numerous non-biological zeros, which can impede downstream analysis. To tackle this issue, we introduce two novel approaches, PhyImpute and UniFracImpute, which leverage similar microbial samples to identify and impute non-biological zeros in microbial count data. Our proposed methods utilize the probability of non-biological zeros and phylogenetic trees to estimate sample-to-sample similarity, thus addressing this challenge. To evaluate the performance of our proposed methods, we conduct experiments using both simulated and real microbial data. The results demonstrate that PhyImpute and UniFracImpute outperform existing methods in recovering the zeros and empowering downstream analyses such as differential abundance analysis, and disease status classification.

Original languageEnglish (US)
Article numberbbae653
JournalBriefings in bioinformatics
Volume26
Issue number1
DOIs
StatePublished - Jan 1 2025

Keywords

  • imputation
  • metagenomics
  • microbiome
  • phylogenetic tree

ASJC Scopus subject areas

  • Information Systems
  • Molecular Biology

Fingerprint

Dive into the research topics of 'PhyImpute and UniFracImpute: two imputation approaches incorporating phylogeny information for microbial count data'. Together they form a unique fingerprint.

Cite this