InsPecT
Live Search
Download Publications
Contact: Natalie Castellana [ncastell(at)cs.ucsd.edu]
Inspect and MS-Alignment
Inspect is a MS/MS database search tool specifically designed to address two crucial needs of the proteomics comminuty: post-translational modification identification and search speed. The program is available as a free download or online in our Live Search webserver. The online interface is coordinated with other proteomics software developed in the lab, like PepNovo
Typical database searches do not deal well with the dynamic nature of the proteome.
Post-translational modifications, alternative splicing, and laboratory chemisty all affect
protein behavior and make spectrum interpretation more challenging. The primary challenge
is that the "virtual database" of all modified peptides undergoes a combinatorial
explosion when a broad range of modifications is allowed. This affects search running time.
A secondary challenge is that in this richer database, there are many more close "relatives"
for each peptide. This affects scoring accuracy, since differentiating between correct and
incorrect identifications is more difficult.
InsPecT addresses several algorithmic
problems in order to identify modified proteins.
InsPecT uses peptide sequence tags
(PSTs) to filter the database. InsPecT has an internal tag generator, but can accept tags
generated by other tools (e.g. Pepnovo, GutenTAG). Because de novo is imperfect,
multiple tags are produced for each spectrum, to ensure that (at least) one tag is corrrect.
These PSTs are extremely efficient filters, even in the context of up to a dozen possible
modifications. Tag-based filtering can also be combined with the "two-pass" filtering pioneered
by X!Tandem, where from one search provides a list of proteins (a mini-database) for a more
detailed search.
Unanticipated modifications are common in proteomics. InsPecT
implements the MS-Alignment algorithm for "blind" spectral search, with no bias toward
anticipated modification types. This search has been applied to annotate heavily-modified
proteins such as crystallins.
The InsPecT distribution includes a script (PTMAnalysis.py) implementing the PTMFinder procedure for analysis of unrestrictive modification results. This procedure allows for the accurate scoring of PTMs, and for the calculation of a false discovery rate.
Documentation
Documentation is included in the software package, and is also available online. Also, a tutorial covering the MS2DB file format is available (as presented at Algorithmic Biology 2006).
Acknowledgments
Our research is supported in part by the NBCR computational resource (funded by Research Facilities Improvement Program Grant Number C06 RR017588 awarded to the Whitaker Biomedical Engineering Institute, and the Biomedical Technology Resource Centers Program Grant Number P41 RR08605 awarded to the National Biomedical Computation Resource, UCSD, from the National Center for Research Resources, National Institutes of Health). Our research is supported in part by the UCSD FWGrid project, NSF Research Infrastructure Grant Number EIA-0303622. Stephen Tanner is supported by NSF IGERT training grant DGE0504645. We gratefully acknowledge the bug reports and fixes contributed by users of our open-source software.
Publications
Peptide sequence tags for fast database search in mass-spectrometry.
Frank A, Tanner S, Bafna V, Pevzner P.
J Proteome Res. 2005 Jul-Aug;4(4):1287-95.
Paper Pubmed
Peptide Sequence Tags for Fast Database Search in Mass Spectrometry.
Ari Frank, Stephen Tanner, and Pavel Pevzner.
Conference on Research in Computational Molecular Biology (RECOMB) 2005.
Identification of Post-translational Modifications via Blind Search of Mass-Spectra.
Dekel Tsur, Stephen Tanner, Ebrahim Zandi, Vineet Bafna, and Pavel A. Pevzner
IEEE Computer Society Bioinformatics Conference (CSB) 2005.
InsPecT: Fast and accurate identification of post-translationally modified peptides from tandem mass spectra.
Stephen Tanner, Hongjun Shu, Ari Frank, Ling-Chi Wang, Ebrahim Zandi, Marc Mumby, Pavel A. Pevzner, and Vineet Bafna.
Anal Chem. 2005 Jul 15;77(14):4626-39
Paper
Pubmed
Identification of Post-translational Modifications via Blind Search of Mass-Spectra.
Dekel Tsur, Stephen Tanner, Ebrahim Zandi, Vineet Bafna, Pavel A. Pevzner.
Nature Biotechnology 2005 Dec;23(12);1562-7
Pubmed
Latest Releases
ProteoSAFe
GenoMS
Inspect, MS-Alignment
Meta-SPS
MixDB
MS-Clustering
MS-Dictionary
MS-GappedDictionary
MS-GeneratingFunction
MS-GFDB
MS-GF+
M-SPLIT
PepNovo
Spectral Networks
UniNovo
Media Coverage
Nonribosomal Peptide Dereplication and Sequencing (Scientific American, Genetic Engineering News, Natural Products Industry Insider and Genome Web Daily News)
A powerful tool for PTM discovery (Jan 2008, Journal of Proteome research, Vol 7. Issue 1)
From spectral networks to shotgun sequencing (June 2007, Nature Methods, Vol. 4 No. 6)
Identifying peptides without a database (May 2007, Journal of Proteome Research)
UCSD Computer Scientist Wins Young Investigator Award, Research on Snake Venom Proteins Highlighted (Nov 2006, UCSD)