Journal article
High-throughput parallel proteogenomics: A bacterial case study
Proteomics, Vol.14(23-24), pp.2780-2789
2014
Abstract
In recent years, a new paradigm for genome annotation has emerged, termed "proteogenomics," that leverages peptide MS to annotate a genome. This is achieved by mapping peptides to a six-frame translation of a genome, including available splice databases, which may suggest refinements to gene models. Using this approach, it is possible to refine gene regions such as exon boundaries, novel genes, gene boundaries, frame shifts, reverse strands, translated UTRs, and novel splice junctions. One of the challenges of proteogenomics is how best to (1) tackle assigning confidence to any resulting annotation and (2) apply these gene model refinements, either through manual annotation or through an automated process via training gene prediction tools. This is not a straightforward process, as many gene prediction tools have their defined suitability for niche genomes (either eukaryotic or prokaryotic) trained on and refined with model organisms such as Arabidopsis thaliana and Escherichia coli, and varying degrees of features that can leverage the use of external evidence. In this study, we outline a suitable approach toward preprocessing mass spectra and optimizing the MS/MS search for a given dataset. We also discuss future challenges, which continue to pose a problem in the field of proteogenomics, and better strategies to successfully tackle them with, using existing tools. We use Bradyrhizobium diazoefficiens (Nitrogen-fixing bacteria), with a 9.1 Mb genome as a case study, utilizing the latest in second-generation proteogenomics tools with multiple gene models for cross-validation of proteogenomics annotations.
Details
- Title
- High-throughput parallel proteogenomics: A bacterial case study
- Authors/Creators
- B. Chapman (Author/Creator) - Murdoch UniversityM. Bellgard (Author/Creator) - Murdoch University
- Publication Details
- Proteomics, Vol.14(23-24), pp.2780-2789
- Publisher
- John Wiley & Sons Ltd.
- Identifiers
- 991005543997407891
- Copyright
- © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
- Murdoch Affiliation
- Centre for Comparative Genomics
- Language
- English
- Resource Type
- Journal article
UN Sustainable Development Goals (SDGs)
This output has contributed to the advancement of the following goals:
Source: InCites
Metrics
39 Record Views
InCites Highlights
These are selected metrics from InCites Benchmarking & Analytics tool, related to this output
- Citation topics
- 2 Chemistry
- 2.211 Mass Spectrometry
- 2.211.497 Proteomics Mass Spectrometry
- Web Of Science research areas
- Biochemical Research Methods
- Biochemistry & Molecular Biology
- ESI research areas
- Biology & Biochemistry