Artificial intelligence for proteomics and biomarker discovery

Research output: Contribution to journalReviewResearchpeer-review

There is an avalanche of biomedical data generation and a parallel expansion in computational capabilities to analyze and make sense of these data. Starting with genome sequencing and widely employed deep sequencing technologies, these trends have now taken hold in all omics disciplines and increasingly call for multi-omics integration as well as data interpretation by artificial intelligence technologies. Here, we focus on mass spectrometry (MS)-based proteomics and describe how machine learning and, in particular, deep learning now predicts experimental peptide measurements from amino acid sequences alone. This will dramatically improve the quality and reliability of analytical workflows because experimental results should agree with predictions in a multi-dimensional data landscape. Machine learning has also become central to biomarker discovery from proteomics data, which now starts to outperform existing best-in-class assays. Finally, we discuss model transparency and explainability and data privacy that are required to deploy MS-based biomarkers in clinical settings.

Original languageEnglish
JournalCell Systems
Volume12
Issue number8
Pages (from-to)759-770
Number of pages12
ISSN2405-4712
DOIs
Publication statusPublished - 18 Aug 2021
Externally publishedYes

Bibliographical note

Copyright © 2021 Elsevier Inc. All rights reserved.

ID: 276706016