Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes : a machine learning approach in the Diabetes Prevention Program. / Diabetes Prevention Program Research Group.

In: B M J Open Diabetes Research & Care, Vol. 9, No. 1, e001953, 03.2021.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Diabetes Prevention Program Research Group 2021, 'Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program', B M J Open Diabetes Research & Care, vol. 9, no. 1, e001953. https://doi.org/10.1136/bmjdrc-2020-001953

APA

Diabetes Prevention Program Research Group (2021). Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program. B M J Open Diabetes Research & Care, 9(1), [e001953]. https://doi.org/10.1136/bmjdrc-2020-001953

Vancouver

Diabetes Prevention Program Research Group. Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program. B M J Open Diabetes Research & Care. 2021 Mar;9(1). e001953. https://doi.org/10.1136/bmjdrc-2020-001953

Author

Diabetes Prevention Program Research Group. / Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes : a machine learning approach in the Diabetes Prevention Program. In: B M J Open Diabetes Research & Care. 2021 ; Vol. 9, No. 1.

Bibtex

@article{d3d0581cd92649aa9881fe6f798208f8,
title = "Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program",
abstract = "INTRODUCTION: Although various lipid and non-lipid analytes measured by nuclear magnetic resonance (NMR) spectroscopy have been associated with type 2 diabetes, a structured comparison of the ability of NMR-derived biomarkers and standard lipids to predict individual diabetes risk has not been undertaken in larger studies nor among individuals at high risk of diabetes.RESEARCH DESIGN AND METHODS: Cumulative discriminative utilities of various groups of biomarkers including NMR lipoproteins, related non-lipid biomarkers, standard lipids, and demographic and glycemic traits were compared for short-term (3.2 years) and long-term (15 years) diabetes development in the Diabetes Prevention Program, a multiethnic, placebo-controlled, randomized controlled trial of individuals with pre-diabetes in the USA (N=2590). Logistic regression, Cox proportional hazards model and six different hyperparameter-tuned machine learning algorithms were compared. The Matthews Correlation Coefficient (MCC) was used as the primary measure of discriminative utility.RESULTS: Models with baseline NMR analytes and their changes did not improve the discriminative utility of simpler models including standard lipids or demographic and glycemic traits. Across all algorithms, models with baseline 2-hour glucose performed the best (max MCC=0.36). Sophisticated machine learning algorithms performed similarly to logistic regression in this study.CONCLUSIONS: NMR lipoproteins and related non-lipid biomarkers were associated but did not augment discrimination of diabetes risk beyond traditional diabetes risk factors except for 2-hour glucose. Machine learning algorithms provided no meaningful improvement for discrimination compared with logistic regression, which suggests a lack of influential latent interactions among the analytes assessed in this study.TRIAL REGISTRATION NUMBER: Diabetes Prevention Program: NCT00004992; Diabetes Prevention Program Outcomes Study: NCT00038727.",
author = "Varga, {Tibor V} and Jinxi Liu and Goldberg, {Ronald B} and Guannan Chen and Samuel Dagogo-Jack and Carlos Lorenzo and Mather, {Kieren J} and Xavier Pi-Sunyer and S{\o}ren Brunak and Marinella Temprosa and {Diabetes Prevention Program Research Group}",
note = "{\textcopyright} Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.",
year = "2021",
month = mar,
doi = "10.1136/bmjdrc-2020-001953",
language = "English",
volume = "9",
journal = "B M J Open Diabetes Research & Care",
issn = "2052-4897",
publisher = "B M J Group",
number = "1",

}

RIS

TY - JOUR

T1 - Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes

T2 - a machine learning approach in the Diabetes Prevention Program

AU - Varga, Tibor V

AU - Liu, Jinxi

AU - Goldberg, Ronald B

AU - Chen, Guannan

AU - Dagogo-Jack, Samuel

AU - Lorenzo, Carlos

AU - Mather, Kieren J

AU - Pi-Sunyer, Xavier

AU - Brunak, Søren

AU - Temprosa, Marinella

AU - Diabetes Prevention Program Research Group

N1 - © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.

PY - 2021/3

Y1 - 2021/3

N2 - INTRODUCTION: Although various lipid and non-lipid analytes measured by nuclear magnetic resonance (NMR) spectroscopy have been associated with type 2 diabetes, a structured comparison of the ability of NMR-derived biomarkers and standard lipids to predict individual diabetes risk has not been undertaken in larger studies nor among individuals at high risk of diabetes.RESEARCH DESIGN AND METHODS: Cumulative discriminative utilities of various groups of biomarkers including NMR lipoproteins, related non-lipid biomarkers, standard lipids, and demographic and glycemic traits were compared for short-term (3.2 years) and long-term (15 years) diabetes development in the Diabetes Prevention Program, a multiethnic, placebo-controlled, randomized controlled trial of individuals with pre-diabetes in the USA (N=2590). Logistic regression, Cox proportional hazards model and six different hyperparameter-tuned machine learning algorithms were compared. The Matthews Correlation Coefficient (MCC) was used as the primary measure of discriminative utility.RESULTS: Models with baseline NMR analytes and their changes did not improve the discriminative utility of simpler models including standard lipids or demographic and glycemic traits. Across all algorithms, models with baseline 2-hour glucose performed the best (max MCC=0.36). Sophisticated machine learning algorithms performed similarly to logistic regression in this study.CONCLUSIONS: NMR lipoproteins and related non-lipid biomarkers were associated but did not augment discrimination of diabetes risk beyond traditional diabetes risk factors except for 2-hour glucose. Machine learning algorithms provided no meaningful improvement for discrimination compared with logistic regression, which suggests a lack of influential latent interactions among the analytes assessed in this study.TRIAL REGISTRATION NUMBER: Diabetes Prevention Program: NCT00004992; Diabetes Prevention Program Outcomes Study: NCT00038727.

AB - INTRODUCTION: Although various lipid and non-lipid analytes measured by nuclear magnetic resonance (NMR) spectroscopy have been associated with type 2 diabetes, a structured comparison of the ability of NMR-derived biomarkers and standard lipids to predict individual diabetes risk has not been undertaken in larger studies nor among individuals at high risk of diabetes.RESEARCH DESIGN AND METHODS: Cumulative discriminative utilities of various groups of biomarkers including NMR lipoproteins, related non-lipid biomarkers, standard lipids, and demographic and glycemic traits were compared for short-term (3.2 years) and long-term (15 years) diabetes development in the Diabetes Prevention Program, a multiethnic, placebo-controlled, randomized controlled trial of individuals with pre-diabetes in the USA (N=2590). Logistic regression, Cox proportional hazards model and six different hyperparameter-tuned machine learning algorithms were compared. The Matthews Correlation Coefficient (MCC) was used as the primary measure of discriminative utility.RESULTS: Models with baseline NMR analytes and their changes did not improve the discriminative utility of simpler models including standard lipids or demographic and glycemic traits. Across all algorithms, models with baseline 2-hour glucose performed the best (max MCC=0.36). Sophisticated machine learning algorithms performed similarly to logistic regression in this study.CONCLUSIONS: NMR lipoproteins and related non-lipid biomarkers were associated but did not augment discrimination of diabetes risk beyond traditional diabetes risk factors except for 2-hour glucose. Machine learning algorithms provided no meaningful improvement for discrimination compared with logistic regression, which suggests a lack of influential latent interactions among the analytes assessed in this study.TRIAL REGISTRATION NUMBER: Diabetes Prevention Program: NCT00004992; Diabetes Prevention Program Outcomes Study: NCT00038727.

U2 - 10.1136/bmjdrc-2020-001953

DO - 10.1136/bmjdrc-2020-001953

M3 - Journal article

C2 - 33789908

VL - 9

JO - B M J Open Diabetes Research & Care

JF - B M J Open Diabetes Research & Care

SN - 2052-4897

IS - 1

M1 - e001953

ER -

ID: 259833374