Comparative shotgun proteomics using spectral count data and quasi-likelihood modeling

Ming Li, William Gray, Haixia Zhang, Christine H. Chung, Dean Billheimer, Wendell G. Yarbrough, Daniel C. Liebler, Yu Shyr, Robbert J.C. Slebos

Research output: Contribution to journalArticlepeer-review

88 Scopus citations


Shotgun proteomics provides the most powerful analytical platform for global inventory of complex proteomes using liquid chromatography-tandem mass spectrometry (LC-MS/MS) and allows a global analysis of protein changes. Nevertheless, sampling of complex proteomes by current shotgun proteomics platforms is incomplete, and this contributes to variability in assessment of peptide and protein inventories by spectral counting approaches. Thus, shotgun proteomics data pose challenges in comparing proteomes from different biological states. We developed an analysis strategy using quasi-likelihood Generalized Linear Modeling (GLM), included in a graphical interface software package (QuasiTel) that reads standard output from protein assemblies created by IDPicker, an HTML-based user interface to query shotgun proteomic data sets. This approach was compared to four other statistical analysis strategies: Student t test, Wilcoxon rank test, Fisher's Exact test, and Poisson-based GLM. We analyzed the performance of these tests to identify differences in protein levels based on spectral counts in a shotgun data set in which equimolar amounts of 48 human proteins were spiked at different levels into whole yeast lysates. Both GLM approaches and the Fisher Exact test performed adequately, each with their unique limitations. We subsequently compared the proteomes of normal tonsil epithelium and HNSCC using this approach and identified 86 proteins with differential spectral counts between normal tonsil epithelium and HNSCC. We selected 18 proteins from this comparison for verification of protein levels between the individual normal and tumor tissues using liquid chromatography-multiple reaction monitoring mass spectrometry (LC-MRM-MS). This analysis confirmed the magnitude and direction of the protein expression differences in all 6 proteins for which reliable data could be obtained. Our analysis demonstrates that shotgun proteomic data sets from different tissue phenotypes are sufficiently rich in quantitative information and that statistically significant differences in proteins spectral counts reflect the underlying biology of the samples.

Original languageEnglish (US)
Pages (from-to)4295-4305
Number of pages11
JournalJournal of Proteome Research
Issue number8
StatePublished - Aug 6 2010


  • Generalized Linear Model
  • LC-MS/MS
  • head and neck carcinoma
  • multiple reaction monitoring (MRM)
  • shotgun proteomics
  • spectral counting

ASJC Scopus subject areas

  • Biochemistry
  • General Chemistry


Dive into the research topics of 'Comparative shotgun proteomics using spectral count data and quasi-likelihood modeling'. Together they form a unique fingerprint.

Cite this