Tool-sets developed during cancer proteogenomic project

Cancer Proteogenomics

Contacts

Sunghee Woo [suwoo at ucsd.edu], Seong Won Cha [s3cha at ucsd.edu], Stefano Bonissone [sbonisso at cs.ucsd.edu], Vineet Bafna [vbafna at cs.ucsd.edu]

Summary

This tool-set is the downloadable list of all software development done during the Cancer Proteogenomics project [2013-2015] in CCMS.

The software is divided majorly into four components.

1. SpliceDB : Proteogenomic database for novel splice junction identifications driven from large-scale RNA-seq data. (input : BAM files, output : FASTA) [PMID: 23802565]

2. MutationDB : Proteogenomic database for identifying all types of mutatoinal variants driven from large-scale RNA-seq data. (input : VCF files, output : FASTA) [PMID: 25263569]

3. IG DB : Assembly based proteogenomic database for identifying complex immunoglobulin gene peptides. (input : BAM files, output : FASTA) [DOI: 10.1021/acs.jproteome.5b00264]

4. Post-processing tool (also called as Enosi) : Proteogenomic post-processing tools to assign proteogenomic events.

 

 

Documentation

Under preparation

Publications

Proteogenomic database construction driven from large scale RNA-seq data Woo S, Cha SW, Merrihew G, He Y, Castellana N, Guest C, MacCoss M, Bafna V. J Proteome Res. 2014 Jan 3;13(1):21-8. doi: 10.1021/pr400294c. Epub 2013 Jul 17.
Proteogenomic strategies for identification of aberrant cancer peptides using large-scale next-generation sequencing data Woo, S.; Cha, S. W.; et al., PROTEOMICS. 2014 Dec;14(23-24):2719-30. doi: 10.1002/pmic.201400206. Epub 2014 Nov 17.
Advanced proteogenomic analysis reveals multiple peptide mutations and complex immunoglobulin peptides in colon cancer Woo, S.; Cha, S. W.;,Bonissone, S.; et al. J Proteome Res. 2015. In press