Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data

Research output: Contribution to journalArticlepeer-review

29 Scopus citations

Abstract

Single-cell RNA sequencing (scRNA-seq) technologies allow researchers to uncover the biological states of a single cell at high resolution. For computational efficiency and easy visualization, dimensionality reduction is necessary to capture gene expression patterns in low-dimensional space. Here we propose an ensemble method for simultaneous dimensionality reduction and feature gene extraction (EDGE) of scRNA-seq data. Different from existing dimensionality reduction techniques, the proposed method implements an ensemble learning scheme that utilizes massive weak learners for an accurate similarity search. Based on the similarity matrix constructed by those weak learners, the low-dimensional embedding of the data is estimated and optimized through spectral embedding and stochastic gradient descent. Comprehensive simulation and empirical studies show that EDGE is well suited for searching for meaningful organization of cells, detecting rare cell types, and identifying essential feature genes associated with certain cell types.

Original languageEnglish (US)
Article number5853
JournalNature communications
Volume11
Issue number1
DOIs
StatePublished - Dec 2020

ASJC Scopus subject areas

  • General Chemistry
  • General Biochemistry, Genetics and Molecular Biology
  • General Physics and Astronomy

Fingerprint

Dive into the research topics of 'Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data'. Together they form a unique fingerprint.

Cite this