CCMS Tools

MetaSPS

Need to sequence an unknown protein? Compared to traditional approaches, MetaSPS was shown to yield the longest and most accurate de novo sequences from tandem mass spectrometry data at nearly full sequence coverage.

CCMS Research area: Spectral Networks

Contacts

Adrian Guthals [aguthals (at) cs.ucsd.edu]

Summary

Full-length de novo sequencing of unknown proteins remains a challenging open problem. Traditional methods that sequence spectra individually are limited by short peptide length, incomplete peptide fragmentation, and ambiguous de novo interpretations. We address these issues by determining consensus sequences for assembled tandem mass (MS/MS) spectra from overlapping peptides (e.g., by using multiple enzymatic digests). We have combined electron-transfer dissociation (ETD) with collision-induced dissociation (CID) and higher-energy collision-induced dissociation (HCD) fragmentation methods to boost interpretation of long, highly charged peptides and take advantage of corroborating b/y/c/z ions in CID/HCD/ETD. Using these strategies, we show that triplet CID/HCD/ETD MS/MS spectra from overlapping peptides yield de novo sequences of average length 70 AA and as long as 200 AA at up to 99% sequencing accuracy.

Documentation

Quick Start Instructions are available here.

Downloads

Installation Packages

Publications

Sequencing-grade de novo analysis of MS/MS triplets (CID/HCD/ETD) from overlapping peptides. Guthals A, Clauser KR, Frank AM, Bandeira N. J Proteome Res. 2013 Jun 7;12(6):2846-57.
Shotgun protein sequencing with meta-contig assembly. Guthals A, Clauser KR, Bandeira N. Mol Cell Proteomics. 2012 Oct;11(10):1084-96.