TY - JOUR
T1 - pepFunk
T2 - A tool for peptide-centric functional analysis of metaproteomic human gut microbiome studies
AU - Simopoulos, Caitlin M.A.
AU - Ning, Zhibin
AU - Zhang, Xu
AU - Li, Leyuan
AU - Walker, Krystal
AU - Lavallée-Adam, Mathieu
AU - Figeys, Daniel
N1 - Publisher Copyright:
© The Author(s) 2020.
PY - 2020/7/15
Y1 - 2020/7/15
N2 - Motivation: Enzymatic digestion of proteins before mass spectrometry analysis is a key process in metaproteomic workflows. Canonical metaproteomic data processing pipelines typically involve matching spectra produced by the mass spectrometer to a theoretical spectra database, followed by matching the identified peptides back to parent-proteins. However, the nature of enzymatic digestion produces peptides that can be found in multiple proteins due to conservation or chance, presenting difficulties with protein and functional assignment. Results: To combat this challenge, we developed pepFunk, a peptide-centric metaproteomic workflow focused on the analysis of human gut microbiome samples. Our workflow includes a curated peptide database annotated with Kyoto Encyclopedia of Genes and Genomes (KEGG) terms and a gene set variation analysis-inspired pathway enrichment adapted for peptide-level data. Analysis using our peptide-centric workflow is fast and highly correlated to a protein-centric analysis, and can identify more enriched KEGG pathways than analysis using protein-level data. Our workflow is open source and available as a web application or source code to be run locally.
AB - Motivation: Enzymatic digestion of proteins before mass spectrometry analysis is a key process in metaproteomic workflows. Canonical metaproteomic data processing pipelines typically involve matching spectra produced by the mass spectrometer to a theoretical spectra database, followed by matching the identified peptides back to parent-proteins. However, the nature of enzymatic digestion produces peptides that can be found in multiple proteins due to conservation or chance, presenting difficulties with protein and functional assignment. Results: To combat this challenge, we developed pepFunk, a peptide-centric metaproteomic workflow focused on the analysis of human gut microbiome samples. Our workflow includes a curated peptide database annotated with Kyoto Encyclopedia of Genes and Genomes (KEGG) terms and a gene set variation analysis-inspired pathway enrichment adapted for peptide-level data. Analysis using our peptide-centric workflow is fast and highly correlated to a protein-centric analysis, and can identify more enriched KEGG pathways than analysis using protein-level data. Our workflow is open source and available as a web application or source code to be run locally.
UR - http://www.scopus.com/inward/record.url?scp=85088882164&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btaa289
DO - 10.1093/bioinformatics/btaa289
M3 - Article
C2 - 32369596
AN - SCOPUS:85088882164
SN - 1367-4803
VL - 36
SP - 4171
EP - 4179
JO - Bioinformatics
JF - Bioinformatics
IS - 14
ER -