MS-Dictionary
Download Publications
Contact: Sangtae Kim [sak008 (at) ucsd.edu]
Summary
Database search tools identify peptides by matching tandem mass spectra against a protein database. We study an alternative approach when all plausible de novo interpretations of a spectrum (spectral dictionary) are generated and then quickly matched against the database. We present a new MS-Dictionary algorithm for efficiently generating spectral dictionaries and demonstrate that MS-Dictionary can identify spectra that are missed in the database search. We argue that MS-Dictionary enables proteogenomic searches in six-frame translation of genomic sequences that may be prohibitively time-consuming for existing database search approaches. We show that such searches allow one to correct sequencing errors and find programmed frameshifts.
Download
Contact Author.
Publications
Spectral Dictionaries: Integrating De Novo Peptide Sequencing with Database Search of Tandem Mass Spectra.
Sangtae Kim, Nitin Gupta and Pavel Pevzner.
Submitted.
Latest Releases
ProteoSAFe
GenoMS
Inspect, MS-Alignment
Meta-SPS
MixDB
MS-Clustering
MS-Dictionary
MS-GappedDictionary
MS-GeneratingFunction
MS-GFDB
MS-GF+
M-SPLIT
PepNovo
Spectral Networks
UniNovo
Media Coverage
Nonribosomal Peptide Dereplication and Sequencing (Scientific American, Genetic Engineering News, Natural Products Industry Insider and Genome Web Daily News)
A powerful tool for PTM discovery (Jan 2008, Journal of Proteome research, Vol 7. Issue 1)
From spectral networks to shotgun sequencing (June 2007, Nature Methods, Vol. 4 No. 6)
Identifying peptides without a database (May 2007, Journal of Proteome Research)
UCSD Computer Scientist Wins Young Investigator Award, Research on Snake Venom Proteins Highlighted (Nov 2006, UCSD)