InsPecT

Live Search    Download   Publications
Contact: Natalie Castellana [ncastell(at)cs.ucsd.edu]

Inspect and MS-Alignment

Inspect is a MS/MS database search tool specifically designed to address two crucial needs of the proteomics comminuty: post-translational modification identification and search speed. The program is available as a free download or online in our Live Search webserver. The online interface is coordinated with other proteomics software developed in the lab, like PepNovo

Typical database searches do not deal well with the dynamic nature of the proteome. Post-translational modifications, alternative splicing, and laboratory chemisty all affect protein behavior and make spectrum interpretation more challenging. The primary challenge is that the "virtual database" of all modified peptides undergoes a combinatorial explosion when a broad range of modifications is allowed. This affects search running time. A secondary challenge is that in this richer database, there are many more close "relatives" for each peptide. This affects scoring accuracy, since differentiating between correct and incorrect identifications is more difficult.

InsPecT addresses several algorithmic problems in order to identify modified proteins.

InsPecT uses peptide sequence tags (PSTs) to filter the database. InsPecT has an internal tag generator, but can accept tags generated by other tools (e.g. Pepnovo, GutenTAG). Because de novo is imperfect, multiple tags are produced for each spectrum, to ensure that (at least) one tag is corrrect. These PSTs are extremely efficient filters, even in the context of up to a dozen possible modifications. Tag-based filtering can also be combined with the "two-pass" filtering pioneered by X!Tandem, where from one search provides a list of proteins (a mini-database) for a more detailed search.

Unanticipated modifications are common in proteomics. InsPecT implements the MS-Alignment algorithm for "blind" spectral search, with no bias toward anticipated modification types. This search has been applied to annotate heavily-modified proteins such as crystallins.

The InsPecT distribution includes a script (PTMAnalysis.py) implementing the PTMFinder procedure for analysis of unrestrictive modification results. This procedure allows for the accurate scoring of PTMs, and for the calculation of a false discovery rate.

Documentation

Documentation is included in the software package, and is also available online. Also, a tutorial covering the MS2DB file format is available (as presented at Algorithmic Biology 2006).

Acknowledgments

Our research is supported in part by the NBCR computational resource (funded by Research Facilities Improvement Program Grant Number C06 RR017588 awarded to the Whitaker Biomedical Engineering Institute, and the Biomedical Technology Resource Centers Program Grant Number P41 RR08605 awarded to the National Biomedical Computation Resource, UCSD, from the National Center for Research Resources, National Institutes of Health). Our research is supported in part by the UCSD FWGrid project, NSF Research Infrastructure Grant Number EIA-0303622. Stephen Tanner is supported by NSF IGERT training grant DGE0504645. We gratefully acknowledge the bug reports and fixes contributed by users of our open-source software.

Publications

Peptide sequence tags for fast database search in mass-spectrometry.
Frank A, Tanner S, Bafna V, Pevzner P.
J Proteome Res. 2005 Jul-Aug;4(4):1287-95.
Paper Pubmed

Peptide Sequence Tags for Fast Database Search in Mass Spectrometry.
Ari Frank, Stephen Tanner, and Pavel Pevzner.
Conference on Research in Computational Molecular Biology (RECOMB) 2005.

Identification of Post-translational Modifications via Blind Search of Mass-Spectra.
Dekel Tsur, Stephen Tanner, Ebrahim Zandi, Vineet Bafna, and Pavel A. Pevzner
IEEE Computer Society Bioinformatics Conference (CSB) 2005.

InsPecT: Fast and accurate identification of post-translationally modified peptides from tandem mass spectra.
Stephen Tanner, Hongjun Shu, Ari Frank, Ling-Chi Wang, Ebrahim Zandi, Marc Mumby, Pavel A. Pevzner, and Vineet Bafna.
Anal Chem. 2005 Jul 15;77(14):4626-39
Paper Pubmed

Identification of Post-translational Modifications via Blind Search of Mass-Spectra.
Dekel Tsur, Stephen Tanner, Ebrahim Zandi, Vineet Bafna, Pavel A. Pevzner.
Nature Biotechnology 2005 Dec;23(12);1562-7
Pubmed

Latest Releases

ProteoSAFe

1.2.4

GenoMS

2012.01.10

Inspect, MS-Alignment

2012.01.09

Meta-SPS

2013.04.30

MixDB

2011.12.06

MS-Clustering

2011.03.27

MS-Dictionary

2007.11.30

MS-GappedDictionary

2011.06.15

MS-GeneratingFunction

2010.10.14

MS-GFDB

2012.06.07

MS-GF+

2013.04.10

M-SPLIT

2011.06.05

PepNovo

2012.04.23

Spectral Networks

Sept 2007

UniNovo

2013.03.10

Copyright Notice

 

Media Coverage


Nonribosomal Peptide Dereplication and Sequencing (Scientific American, Genetic Engineering News, Natural Products Industry Insider and Genome Web Daily News)

A powerful tool for PTM discovery (Jan 2008, Journal of Proteome research, Vol 7. Issue 1)

From spectral networks to shotgun sequencing (June 2007, Nature Methods, Vol. 4 No. 6)

Identifying peptides without a database (May 2007, Journal of Proteome Research)

UCSD Computer Scientist Wins Young Investigator Award, Research on Snake Venom Proteins Highlighted (Nov 2006, UCSD)