Automatic deconvolution of isotope-resolved mass spectra using variable selection and quantized peptide mass distribution

Peicheng Du, Ruth Hogue Angeletti

Research output: Contribution to journalArticle

44 Citations (Scopus)

Abstract

We present an algorithm for the deconvolution of isotope-resolved mass spectra of complex peptide mixtures where peaks and isotope series often overlap. The algorithm formulates the problem of mass spectrum deconvolution as a classical statistical problem of variable selection, which aims to interpret the spectrum with the least number of peptides. The LASSO method is used to perform automatic variable selection. The algorithm also makes use of the quantized distribution of peptide masses in the NCBInr database after in silico trypsin digestion as filters to aid the deconvolution process. Errors in the expected isotope pattern are accounted for to avoid spurious isotope series. The effectiveness of the algorithm is demonstrated with annotated ESI spectrum of known peptides for which the peaks and isotope series are highly overlapping. The algorithm successfully finds all correct masses in the experimental spectrum, except for one spectrum where an additional refinement procedure is required to obtain the correct results. Our results compare favorably to those from a widely used commercial program.

Original languageEnglish (US)
Pages (from-to)3385-3392
Number of pages8
JournalAnalytical Chemistry
Volume78
Issue number10
DOIs
StatePublished - May 15 2006

Fingerprint

Deconvolution
Isotopes
Peptides
Trypsin

ASJC Scopus subject areas

  • Analytical Chemistry

Cite this

Automatic deconvolution of isotope-resolved mass spectra using variable selection and quantized peptide mass distribution. / Du, Peicheng; Angeletti, Ruth Hogue.

In: Analytical Chemistry, Vol. 78, No. 10, 15.05.2006, p. 3385-3392.

Research output: Contribution to journalArticle

@article{cb40bf8e9d2743629a3951b597ca3de3,
title = "Automatic deconvolution of isotope-resolved mass spectra using variable selection and quantized peptide mass distribution",
abstract = "We present an algorithm for the deconvolution of isotope-resolved mass spectra of complex peptide mixtures where peaks and isotope series often overlap. The algorithm formulates the problem of mass spectrum deconvolution as a classical statistical problem of variable selection, which aims to interpret the spectrum with the least number of peptides. The LASSO method is used to perform automatic variable selection. The algorithm also makes use of the quantized distribution of peptide masses in the NCBInr database after in silico trypsin digestion as filters to aid the deconvolution process. Errors in the expected isotope pattern are accounted for to avoid spurious isotope series. The effectiveness of the algorithm is demonstrated with annotated ESI spectrum of known peptides for which the peaks and isotope series are highly overlapping. The algorithm successfully finds all correct masses in the experimental spectrum, except for one spectrum where an additional refinement procedure is required to obtain the correct results. Our results compare favorably to those from a widely used commercial program.",
author = "Peicheng Du and Angeletti, {Ruth Hogue}",
year = "2006",
month = "5",
day = "15",
doi = "10.1021/ac052212q",
language = "English (US)",
volume = "78",
pages = "3385--3392",
journal = "Analytical Chemistry",
issn = "0003-2700",
publisher = "American Chemical Society",
number = "10",

}

TY - JOUR

T1 - Automatic deconvolution of isotope-resolved mass spectra using variable selection and quantized peptide mass distribution

AU - Du, Peicheng

AU - Angeletti, Ruth Hogue

PY - 2006/5/15

Y1 - 2006/5/15

N2 - We present an algorithm for the deconvolution of isotope-resolved mass spectra of complex peptide mixtures where peaks and isotope series often overlap. The algorithm formulates the problem of mass spectrum deconvolution as a classical statistical problem of variable selection, which aims to interpret the spectrum with the least number of peptides. The LASSO method is used to perform automatic variable selection. The algorithm also makes use of the quantized distribution of peptide masses in the NCBInr database after in silico trypsin digestion as filters to aid the deconvolution process. Errors in the expected isotope pattern are accounted for to avoid spurious isotope series. The effectiveness of the algorithm is demonstrated with annotated ESI spectrum of known peptides for which the peaks and isotope series are highly overlapping. The algorithm successfully finds all correct masses in the experimental spectrum, except for one spectrum where an additional refinement procedure is required to obtain the correct results. Our results compare favorably to those from a widely used commercial program.

AB - We present an algorithm for the deconvolution of isotope-resolved mass spectra of complex peptide mixtures where peaks and isotope series often overlap. The algorithm formulates the problem of mass spectrum deconvolution as a classical statistical problem of variable selection, which aims to interpret the spectrum with the least number of peptides. The LASSO method is used to perform automatic variable selection. The algorithm also makes use of the quantized distribution of peptide masses in the NCBInr database after in silico trypsin digestion as filters to aid the deconvolution process. Errors in the expected isotope pattern are accounted for to avoid spurious isotope series. The effectiveness of the algorithm is demonstrated with annotated ESI spectrum of known peptides for which the peaks and isotope series are highly overlapping. The algorithm successfully finds all correct masses in the experimental spectrum, except for one spectrum where an additional refinement procedure is required to obtain the correct results. Our results compare favorably to those from a widely used commercial program.

UR - http://www.scopus.com/inward/record.url?scp=33646730580&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33646730580&partnerID=8YFLogxK

U2 - 10.1021/ac052212q

DO - 10.1021/ac052212q

M3 - Article

C2 - 16689541

AN - SCOPUS:33646730580

VL - 78

SP - 3385

EP - 3392

JO - Analytical Chemistry

JF - Analytical Chemistry

SN - 0003-2700

IS - 10

ER -