AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics
Research output: Contribution to journal › Journal article › Research › peer-review
Standard
AlphaPeptDeep : a modular deep learning framework to predict peptide properties for proteomics. / Zeng, Wen Feng; Zhou, Xie Xuan; Willems, Sander; Ammar, Constantin; Wahle, Maria; Bludau, Isabell; Voytik, Eugenia; Strauss, Maximillian T.; Mann, Matthias.
In: Nature Communications, Vol. 13, 7238, 2022.Research output: Contribution to journal › Journal article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - AlphaPeptDeep
T2 - a modular deep learning framework to predict peptide properties for proteomics
AU - Zeng, Wen Feng
AU - Zhou, Xie Xuan
AU - Willems, Sander
AU - Ammar, Constantin
AU - Wahle, Maria
AU - Bludau, Isabell
AU - Voytik, Eugenia
AU - Strauss, Maximillian T.
AU - Mann, Matthias
N1 - Publisher Copyright: © 2022, The Author(s).
PY - 2022
Y1 - 2022
N2 - Machine learning and in particular deep learning (DL) are increasingly important in mass spectrometry (MS)-based proteomics. Recent DL models can predict the retention time, ion mobility and fragment intensities of a peptide just from the amino acid sequence with good accuracy. However, DL is a very rapidly developing field with new neural network architectures frequently appearing, which are challenging to incorporate for proteomics researchers. Here we introduce AlphaPeptDeep, a modular Python framework built on the PyTorch DL library that learns and predicts the properties of peptides (https://github.com/MannLabs/alphapeptdeep). It features a model shop that enables non-specialists to create models in just a few lines of code. AlphaPeptDeep represents post-translational modifications in a generic manner, even if only the chemical composition is known. Extensive use of transfer learning obviates the need for large data sets to refine models for particular experimental conditions. The AlphaPeptDeep models for predicting retention time, collisional cross sections and fragment intensities are at least on par with existing tools. Additional sequence-based properties can also be predicted by AlphaPeptDeep, as demonstrated with a HLA peptide prediction model to improve HLA peptide identification for data-independent acquisition (https://github.com/MannLabs/PeptDeep-HLA).
AB - Machine learning and in particular deep learning (DL) are increasingly important in mass spectrometry (MS)-based proteomics. Recent DL models can predict the retention time, ion mobility and fragment intensities of a peptide just from the amino acid sequence with good accuracy. However, DL is a very rapidly developing field with new neural network architectures frequently appearing, which are challenging to incorporate for proteomics researchers. Here we introduce AlphaPeptDeep, a modular Python framework built on the PyTorch DL library that learns and predicts the properties of peptides (https://github.com/MannLabs/alphapeptdeep). It features a model shop that enables non-specialists to create models in just a few lines of code. AlphaPeptDeep represents post-translational modifications in a generic manner, even if only the chemical composition is known. Extensive use of transfer learning obviates the need for large data sets to refine models for particular experimental conditions. The AlphaPeptDeep models for predicting retention time, collisional cross sections and fragment intensities are at least on par with existing tools. Additional sequence-based properties can also be predicted by AlphaPeptDeep, as demonstrated with a HLA peptide prediction model to improve HLA peptide identification for data-independent acquisition (https://github.com/MannLabs/PeptDeep-HLA).
U2 - 10.1038/s41467-022-34904-3
DO - 10.1038/s41467-022-34904-3
M3 - Journal article
C2 - 36433986
AN - SCOPUS:85142505814
VL - 13
JO - Nature Communications
JF - Nature Communications
SN - 2041-1723
M1 - 7238
ER -
ID: 328689787