An integrated mass-spectrometry pipeline identifies novel protein coding-regions in the human genome.
Affiliation
Applied Computational Biology and Bioinformatics Group, Cancer Research UK, Paterson Institute for Cancer Research, The University of Manchester, Manchester, United Kingdom.Issue Date
2010
Metadata
Show full item recordAbstract
BACKGROUND: Most protein mass spectrometry (MS) experiments rely on searches against a database of known or predicted proteins, limiting their ability as a gene discovery tool. RESULTS: Using a search against an in silico translation of the entire human genome, combined with a series of annotation filters, we identified 346 putative novel peptides [False Discovery Rate (FDR)<5%] in a MS dataset derived from two human breast epithelial cell lines. A subset of these were then successfully validated by a different MS technique. Two of these correspond to novel isoforms of Heterogeneous Ribonuclear Proteins, while the rest correspond to novel loci. CONCLUSIONS: MS technology can be used for ab initio gene discovery in human data, which, since it is based on different underlying assumptions, identifies protein-coding genes not found by other techniques. As MS technology continues to evolve, such approaches will become increasingly powerful.Citation
An integrated mass-spectrometry pipeline identifies novel protein coding-regions in the human genome. 2010, 5 (1):e8949 PLoS ONEJournal
PloS OneDOI
10.1371/journal.pone.0008949PubMed ID
20126623Type
ArticleLanguage
enISSN
1932-6203ae974a485f413a2113503eed53cd6c53
10.1371/journal.pone.0008949
Scopus Count
Collections
Related articles
- Integration of mass spectrometry and RNA-Seq data to confirm human ab initio predicted genes and lncRNAs.
- Authors: Sun H, Chen C, Shi M, Wang D, Liu M, Li D, Yang P, Li Y, Xie L
- Issue date: 2014 Dec
- [Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].
- Authors: Zhang DL, Ji L, Li YD
- Issue date: 2004 May
- Genome annotation of Anopheles gambiae using mass spectrometry-derived data.
- Authors: Kalume DE, Peri S, Reddy R, Zhong J, Okulate M, Kumar N, Pandey A
- Issue date: 2005 Sep 19
- Integrated Proteomic Pipeline Using Multiple Search Engines for a Proteogenomic Study with a Controlled Protein False Discovery Rate.
- Authors: Park GW, Hwang H, Kim KH, Lee JY, Lee HK, Park JY, Ji ES, Park SR, Yates JR 3rd, Kwon KH, Park YM, Lee HJ, Paik YK, Kim JY, Yoo JS
- Issue date: 2016 Nov 4
- Interrogating the human genome using uninterpreted mass spectrometry data.
- Authors: Choudhary JS, Blackstock WP, Creasy DM, Cottrell JS
- Issue date: 2001 May