MixGF

Contacts

Jian Wang [jiw006 (at) ucsd.edu]

Summary

In large-scale proteomic experiments, multiple peptide precursors are often co-fragmented simultaneously in the same mixture tandem mass (MS/MS) spectrum. These spectra tend to elude current computational tools because of the ubiquitous assumption that each spectrum is generated from only one peptide. Therefore, tools that consider ultiple peptide matches to each MS/MS spectrum can potentially improve the relatively low spectrum identification rate often observed in proteomics experiments. More importantly, data independent acquisition protocols promoting the co-fragmentation of multiple precursors are emerging as alternative methods that can greatly improve the throughput of peptide identifications but their success also depends on the availability of algorithms to identify multiple peptides from each MS/MS spectrum. Here we address a fundamental question in the identification of mixture MS/MS spectra: determining the statistical significance of multiple peptides matched to a given MS/MS
spectrum. We propose theMixGF generating function model to rigorously compute the statistical significance of peptide identifications for mixture spectra and show that this approach improves the sensitivity of current mixture spectra database search tools by a  30%−390%. Analysis of multiple datasets with MixGF reveals that in complex biological samples the number of identified mixture spectra can be as high as 20% of all the identified spectra and the number of unique peptides identified only in mixture spectra can be up to 35.4% of those identified in single-peptide spectra.

Documentation

MixGF_User_Manual

Downloads

Tools: MixGF Download

Datasets: Datasets download

Publications

MixGF: spectral probabilities for mixture spectra from more than one peptide Wang, J., Bourne, P. E., & Bandeira, N. (2014). MixGF: spectral probabilities for mixture spectra from more than one peptide. Molecular & Cellular Proteomics, mcp-O113.037218