华中科技大学学报(医学版) ›› 2026, Vol. 55 ›› Issue (2): 196-203.doi: 10.3870/j.issn.1672-0741.25.07.009

• 论著 • 上一篇    下一篇

基于胆汁酸代谢组学构建妊娠期糖尿病发病风险预测模型*

朱晓琳1#, 周安安1#, 韦颖1, 彭阳1, 仇小强1, 李媛媛2△, 李涵1△   

  1. 1广西医科大学公共卫生学院,南宁 530021
    2华中科技大学同济医学院公共卫生学院,武汉 430030
  • 出版日期:2026-04-15 发布日期:2026-04-16
  • 通讯作者: E-mail: liyuanyuan@hust.edu.cn(李媛媛);leehan1988@126.com(李 涵)
  • 作者简介:#共同第一作者.朱晓琳,女,1998年生,硕士研究生,E-mail: zhuxiaolin2024@163.com;周安安,男,1991年生,博士研究生,E-mail: zhouanan@sr.gxmu.edu.cn
  • 基金资助:
    *国家重点研发计划“政府间国际科技创新合作”专项(No.2022YFE0132900);广西自然科学基金杰出青年科学基金项目(No. 2024GXNSFFA010012)

Risk Prediction of Gestational Diabetes Mellitus Based on Bile Acid Metabolomics

Zhu Xiaolin#, Zhou An'an#, Wei Ying et al   

  1. School of Public Health,Guangxi Medical University,Nanning 530021,China
  • Online:2026-04-15 Published:2026-04-16
  • Contact: E-mail: liyuanyuan@hust.edu.cn;leehan1988@126.com

摘要: 目的 基于靶向血清胆汁酸代谢组学,构建妊娠期糖尿病(gestational diabetes mellitus,GDM)发病风险预测模型。方法 依托前瞻性出生队列人群,测定孕早期血清中32种胆汁酸水平。采用最小绝对值收敛和选择算子(LASSO)筛选GDM生物标志物组,利用极限梯度提升树算法(XGBoost)构建预测模型,并采用bootstrap法评估模型性能;通过SHapley加性解释(SHAP)方法评估各胆汁酸在模型中的重要性及其预测价值。结果 基于胆汁酸代谢组学构建的预测模型表现良好(AUC≥0.840),优于传统风险因素预测模型(AUC=0.828),联合传统风险因素后预测性能更佳(AUC≥0.885)。筛选出的关键胆汁酸包括甘氨熊去氧胆酸-3-硫酸(GUDCA-3S)、牛磺脱氧胆酸-3-硫酸(TDCA-3S)、牛磺石胆酸(TLCA)、甘氨猪胆酸(GHCA)、熊脱氧胆酸(UDCA),以及甘氨胆酸∶胆酸(GCA∶CA)与胆酸∶鹅脱氧胆酸(CA∶CDCA)比值。其中,GUDCA-3S、TDCA-3S浓度升高及GCA∶CA、CA∶CDCA比值升高与GDM风险增加相关;TLCA、GHCA及UDCA浓度升高则与GDM风险降低相关。结论 基于胆汁酸代谢组学构建的GDM发病风险预测模型具有良好效能,相关胆汁酸指标可作为潜在生物标志物,为GDM早期识别、代谢调控及机制研究提供支持。

关键词: 出生队列, 妊娠期糖尿病, 胆汁酸, 预测模型, SHapley加性解释

Abstract: Objective To develop predictive models for the risk of gestational diabetes mellitus(GDM)based on targeted metabolomics of serum bile acids. Methods Within a prospective birth cohort,serum levels of 32 bile acids were quantified in early pregnancy.Biomarkers for GDM were screened by the Least Absolute Shrinkage and Selection Operator(LASSO).The eXtreme Gradient Boosting(XGBoost)algorithm was used to build the predictive models,and bootstrap resampling was applied to evaluate model performance.The SHapley Additive exPlanations(SHAP)method was utilized to assess the importance and predictive value of biomarkers. Results The bile acid metabolomics-based models demonstrated strong performance(AUC ≥ 0.840),which outperformed models based on traditional risk factors(AUC=0.828).After incorporating traditional risk factors,predictive performance improved further(AUC ≥ 0.885).Key bile acids identified included glycoursodeoxycholic acid-3-sulfate(GUDCA-3S),taurodeoxycholic acid-3-sulfate(TDCA-3S),taurolithocholic acid(TLCA),glycohyocholic acid(GHCA),ursodeoxycholic acid(UDCA),and the ratios of glycocholic acid to cholic acid(GCA∶CA)and cholic acid to chenodeoxycholic acid(CA∶CDCA).Elevated levels of GUDCA-3S and TDCA-3S,as well as increased GCA∶CA and CA∶CDCA ratios,were associated with higher GDM risk.In contrast,higher concentrations of TLCA,GHCA,and UDCA were associated with lower GDM risk. Conclusion The bile acid metabolomics-based predictive models exhibit robust predictive efficacy.The identified bile acids may serve as potential biomarkers,providing support for early GDM identification,metabolic regulation,and mechanistic research.

Key words: birth cohort, gestational diabetes mellitus, bile acid, predictive model, SHAP

中图分类号: