Home Research Feeds Integrating respiratory microbiome and host immune response through machine learning for respiratory tract infection diagnosis

Integrating respiratory microbiome and host immune response through machine learning for respiratory tract infection diagnosisOriginal paper

Researched by:

  • Karen Pendergrass

Last Updated: 2026-07-04

Karen Pendergrass
Karen Pendergrass

Karen Pendergrass is a microbiome researcher specializing in microbiome-targeted interventions (MBTIs). She systematically analyzes scientific literature to identify microbial patterns, develop hypotheses, and validate interventions. As the founder of the Microbiome Signatures Database, she bridges microbiome research with clinical practice. In 2012, based on her own investigative research, she became the first documented case of FMT for Celiac Disease, four years before the first published case study.

Read More
Location
China
Sample Site
Lung
Species
Homo sapiens

What was studied?

At present, the diagnosis of lower respiratory tract infections (LRTIs) is difficult, and there is an urgent need for better diagnostic methods. This study enrolled 136 patients from 2020 to 2021 and collected bronchoalveolar lavage fluid (BALF) specimens. We used metatranscriptome to analyze the lower respiratory tract microbiome (LRTM) and host immune response. The diversity of the LRTM in LRTIs significantly decreased, manifested by a decrease in the abundance of normal microbiota and an increase in the abundance of opportunistic pathogens. The upregulated differentially expressed genes (DEGs) in the LRTIs group were mainly enriched in infection immune response-related pathways. Klebsiella pneumoniae had the most significant increase in abundance in LRTIs, which was strongly correlated with host infection or inflammation genes TNFRSF1B, CSF3R, and IL6R. We combined LRTM and host transcriptome data to construct a machine-learning model with 12 screened features to discriminate LRTIs and non-LRTIs. The results showed that the model trained by Random Forest in the validate set had the best performance (ROC AUC: 0.937, 95% CI: 0.832-1). The independent external dataset showed an accuracy of 76.5% for this model. This study suggests that the model integrating LRTM and host transcriptome data can be an effective tool for LRTIs diagnosis.

Join the Roundtable

Contribute to published consensus reports, connect with top clinicians and researchers, and receive exclusive invitations to roundtable conferences.

Join the Waitlist and help shape the future of microbiome medicine.