Title: A pipeline for identifying endogenous neuropeptides from spectral archives

Authors: Mingze Bai; Mingmin He; Qifeng Sun; Huadong Liao; Kunxian Shu; Henning Hermjakob

Addresses: Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing 400065, China; National Center for Protein Sciences, Beijing 102206, China ' Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing 400065, China ' Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing 400065, China ' Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing 400065, China ' Chongqing Key Laboratory of Big Data for Bio Intelligence, School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing 400065, China ' National Center for Protein Sciences, Beijing 102206, China

Abstract: Shotgun proteomics experiments often provide a big amount of spectra data; however, a big part of them remain unidentified. Many unidentified spectra that are high probably from peptides could be revealed by data mining methods such as clustering. This idea motivates researchers to build 'spectral archives' to identify more peptides from the previously analysed resources. The objective is to build a general way to identify peptides for these high possibility spectra in spectral archives, to help biologists to get more output from the data. We here propose a novel generic pipeline for this approach, based on the PRIDE cluster resources, rather than building a complete archive from scratch. We applied our pipeline to test the identification of endogenous neuropeptides in rat. 33 high probability peptide-induced spectra have been exposed from rat's unidentified spectra in PRIDE cluster's archive.

Keywords: spectral library searching; spectral archives; endogenous neuropeptides; PRIDE cluster.

DOI: 10.1504/IJDMB.2018.092156

International Journal of Data Mining and Bioinformatics, 2018 Vol.20 No.1, pp.12 - 35

Received: 27 Nov 2017
Accepted: 19 Feb 2018

Published online: 05 Jun 2018 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article