Analyzing postprandial metabolomics data using multiway models: a simulation study

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Analyzing postprandial metabolomics data using multiway models : a simulation study. / Li, Lu; Yan, Shi; Bakker, Barbara M.; Hoefsloot, Huub; Chawes, Bo; Horner, David; Rasmussen, Morten A.; Smilde, Age K.; Acar, Evrim.

I: BMC Bioinformatics, Bind 25, 94, 2024.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Li, L, Yan, S, Bakker, BM, Hoefsloot, H, Chawes, B, Horner, D, Rasmussen, MA, Smilde, AK & Acar, E 2024, 'Analyzing postprandial metabolomics data using multiway models: a simulation study', BMC Bioinformatics, bind 25, 94. https://doi.org/10.1186/s12859-024-05686-w

APA

Li, L., Yan, S., Bakker, B. M., Hoefsloot, H., Chawes, B., Horner, D., Rasmussen, M. A., Smilde, A. K., & Acar, E. (2024). Analyzing postprandial metabolomics data using multiway models: a simulation study. BMC Bioinformatics, 25, [94]. https://doi.org/10.1186/s12859-024-05686-w

Vancouver

Li L, Yan S, Bakker BM, Hoefsloot H, Chawes B, Horner D o.a. Analyzing postprandial metabolomics data using multiway models: a simulation study. BMC Bioinformatics. 2024;25. 94. https://doi.org/10.1186/s12859-024-05686-w

Author

Li, Lu ; Yan, Shi ; Bakker, Barbara M. ; Hoefsloot, Huub ; Chawes, Bo ; Horner, David ; Rasmussen, Morten A. ; Smilde, Age K. ; Acar, Evrim. / Analyzing postprandial metabolomics data using multiway models : a simulation study. I: BMC Bioinformatics. 2024 ; Bind 25.

Bibtex

@article{d76559d6185946f18065e94f21652011,
title = "Analyzing postprandial metabolomics data using multiway models: a simulation study",
abstract = "Background: Analysis of time-resolved postprandial metabolomics data can improve the understanding of metabolic mechanisms, potentially revealing biomarkers for early diagnosis of metabolic diseases and advancing precision nutrition and medicine. Postprandial metabolomics measurements at several time points from multiple subjects can be arranged as a subjects by metabolites by time points array. Traditional analysis methods are limited in terms of revealing subject groups, related metabolites, and temporal patterns simultaneously from such three-way data. Results: We introduce an unsupervised multiway analysis approach based on the CANDECOMP/PARAFAC (CP) model for improved analysis of postprandial metabolomics data guided by a simulation study. Because of the lack of ground truth in real data, we generate simulated data using a comprehensive human metabolic model. This allows us to assess the performance of CP models in terms of revealing subject groups and underlying metabolic processes. We study three analysis approaches: analysis of fasting-state data using principal component analysis, T0-corrected data (i.e., data corrected by subtracting fasting-state data) using a CP model and full-dynamic (i.e., full postprandial) data using CP. Through extensive simulations, we demonstrate that CP models capture meaningful and stable patterns from simulated meal challenge data, revealing underlying mechanisms and differences between diseased versus healthy groups. Conclusions: Our experiments show that it is crucial to analyze both fasting-state and T0-corrected data for understanding metabolic differences among subject groups. Depending on the nature of the subject group structure, the best group separation may be achieved by CP models of T0-corrected or full-dynamic data. This study introduces an improved analysis approach for postprandial metabolomics data while also shedding light on the debate about correcting baseline values in longitudinal data analysis.",
keywords = "CANDECOMP/PARAFAC (CP), Meal challenge test, Postprandial metabolomics data, Principal component analysis (PCA), Tensor factorizations (multiway data analysis), Time-resolved metabolomics data, Whole-body metabolic model",
author = "Lu Li and Shi Yan and Bakker, {Barbara M.} and Huub Hoefsloot and Bo Chawes and David Horner and Rasmussen, {Morten A.} and Smilde, {Age K.} and Evrim Acar",
note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",
year = "2024",
doi = "10.1186/s12859-024-05686-w",
language = "English",
volume = "25",
journal = "Computer Applications in the Biosciences",
issn = "1471-2105",
publisher = "Oxford University Press",

}

RIS

TY - JOUR

T1 - Analyzing postprandial metabolomics data using multiway models

T2 - a simulation study

AU - Li, Lu

AU - Yan, Shi

AU - Bakker, Barbara M.

AU - Hoefsloot, Huub

AU - Chawes, Bo

AU - Horner, David

AU - Rasmussen, Morten A.

AU - Smilde, Age K.

AU - Acar, Evrim

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024

Y1 - 2024

N2 - Background: Analysis of time-resolved postprandial metabolomics data can improve the understanding of metabolic mechanisms, potentially revealing biomarkers for early diagnosis of metabolic diseases and advancing precision nutrition and medicine. Postprandial metabolomics measurements at several time points from multiple subjects can be arranged as a subjects by metabolites by time points array. Traditional analysis methods are limited in terms of revealing subject groups, related metabolites, and temporal patterns simultaneously from such three-way data. Results: We introduce an unsupervised multiway analysis approach based on the CANDECOMP/PARAFAC (CP) model for improved analysis of postprandial metabolomics data guided by a simulation study. Because of the lack of ground truth in real data, we generate simulated data using a comprehensive human metabolic model. This allows us to assess the performance of CP models in terms of revealing subject groups and underlying metabolic processes. We study three analysis approaches: analysis of fasting-state data using principal component analysis, T0-corrected data (i.e., data corrected by subtracting fasting-state data) using a CP model and full-dynamic (i.e., full postprandial) data using CP. Through extensive simulations, we demonstrate that CP models capture meaningful and stable patterns from simulated meal challenge data, revealing underlying mechanisms and differences between diseased versus healthy groups. Conclusions: Our experiments show that it is crucial to analyze both fasting-state and T0-corrected data for understanding metabolic differences among subject groups. Depending on the nature of the subject group structure, the best group separation may be achieved by CP models of T0-corrected or full-dynamic data. This study introduces an improved analysis approach for postprandial metabolomics data while also shedding light on the debate about correcting baseline values in longitudinal data analysis.

AB - Background: Analysis of time-resolved postprandial metabolomics data can improve the understanding of metabolic mechanisms, potentially revealing biomarkers for early diagnosis of metabolic diseases and advancing precision nutrition and medicine. Postprandial metabolomics measurements at several time points from multiple subjects can be arranged as a subjects by metabolites by time points array. Traditional analysis methods are limited in terms of revealing subject groups, related metabolites, and temporal patterns simultaneously from such three-way data. Results: We introduce an unsupervised multiway analysis approach based on the CANDECOMP/PARAFAC (CP) model for improved analysis of postprandial metabolomics data guided by a simulation study. Because of the lack of ground truth in real data, we generate simulated data using a comprehensive human metabolic model. This allows us to assess the performance of CP models in terms of revealing subject groups and underlying metabolic processes. We study three analysis approaches: analysis of fasting-state data using principal component analysis, T0-corrected data (i.e., data corrected by subtracting fasting-state data) using a CP model and full-dynamic (i.e., full postprandial) data using CP. Through extensive simulations, we demonstrate that CP models capture meaningful and stable patterns from simulated meal challenge data, revealing underlying mechanisms and differences between diseased versus healthy groups. Conclusions: Our experiments show that it is crucial to analyze both fasting-state and T0-corrected data for understanding metabolic differences among subject groups. Depending on the nature of the subject group structure, the best group separation may be achieved by CP models of T0-corrected or full-dynamic data. This study introduces an improved analysis approach for postprandial metabolomics data while also shedding light on the debate about correcting baseline values in longitudinal data analysis.

KW - CANDECOMP/PARAFAC (CP)

KW - Meal challenge test

KW - Postprandial metabolomics data

KW - Principal component analysis (PCA)

KW - Tensor factorizations (multiway data analysis)

KW - Time-resolved metabolomics data

KW - Whole-body metabolic model

U2 - 10.1186/s12859-024-05686-w

DO - 10.1186/s12859-024-05686-w

M3 - Journal article

C2 - 38438850

AN - SCOPUS:85186578802

VL - 25

JO - Computer Applications in the Biosciences

JF - Computer Applications in the Biosciences

SN - 1471-2105

M1 - 94

ER -

ID: 386154217