Search Databases and Statistics: Pitfalls and Best Practices in Phosphoproteomics

Research output: Chapter in Book/Report/Conference proceedingBook chapterResearchpeer-review

Advances in mass spectrometric instrumentation in the past 15 years have resulted in an explosion in the raw data yield from typical phosphoproteomics workflows. This poses the challenge of confidently identifying peptide sequences, localizing phosphosites to proteins and quantifying these from the vast amounts of raw data. This task is tackled by computational tools implementing algorithms that match the experimental data to databases, providing the user with lists for downstream analysis. Several platforms for such automated interpretation of mass spectrometric data have been developed, each having strengths and weaknesses that must be considered for the individual needs. These are reviewed in this chapter. Equally critical for generating highly confident output datasets is the application of sound statistical criteria to limit the inclusion of incorrect peptide identifications from database searches. Additionally, careful filtering and use of appropriate statistical tests on the output datasets affects the quality of all downstream analyses and interpretation of the data. Our considerations and general practices on these aspects of phosphoproteomics data processing are presented here.

Original languageEnglish
Title of host publicationPhospho-Proteomics : Methods and Protocols
EditorsLouise von Stechow
Number of pages17
Volume1355
PublisherSpringer Publishing Company
Publication date2016
Pages323-39
ISBN (Print)978-1-4939-3048-7
ISBN (Electronic)978-1-4939-3049-4
DOIs
Publication statusPublished - 2016
SeriesMethods in molecular biology (Clifton, N.J.)
ISSN1064-3745

ID: 176738047