Abstract
We propose four probabilistic generative models for simultaneously modeling gene expression levels and Gene Ontology (GO) tags. Unlike previous approaches for using GO tags, the joint modeling framework allows the two sources of information to complement and reinforce each other. We fit our models to three time-course datasets collected to study biological processes, specifically blood vessel growth (angiogenesis) and mitotic cell cycles. The proposed models result in a joint clustering of genes and GO annotations. Different models group genes based on GO tags and their behavior over the entire time-course, within biological stages, or even individual time points. We show how such models can be used for biological stage boundary estimation de novo. We also evaluate our models on biological stage prediction accuracy of held out samples. Our results suggest that the models usually perform better when GO tag information is included.
Original language | English (US) |
---|---|
Pages (from-to) | 80-91 |
Number of pages | 12 |
Journal | Mathematical Biosciences |
Volume | 268 |
DOIs | |
State | Published - Oct 1 2015 |
Keywords
- Cluster
- Gene expression
- Microarray
- Model
- Ontology
ASJC Scopus subject areas
- Statistics and Probability
- Modeling and Simulation
- General Biochemistry, Genetics and Molecular Biology
- General Immunology and Microbiology
- General Agricultural and Biological Sciences
- Applied Mathematics