Published November 29, 2024 | Version v1
Journal article Open

Paired analysis of host and pathogen genomes identifies determinants of human tuberculosis

Description

Infectious disease is the result of interactions between host and pathogen and can depend on genetic variations in both. We conduct a genome-to-genome study of paired human and Mycobacterium tuberculosis genomes from a cohort of 1556 tuberculosis patients in Lima, Peru. We identify an association between a human intronic variant (rs3130660, OR = 10.06, 95%CI: 4.87 − 20.77, P = 7.92 × 10−8) in the FLOT1 gene and a subclavaluee of Mtb Lineage 2. In a human macrophage infection model, we observe hosts with the rs3130660-A allele exhibited stronger interferon gene signatures. The interacting strains have altered redox states due to a thioredoxin reductase mutation. We investigate this association in a 2020 cohort of 699 patients recruited during the COVID-19 pandemic. While the prevalence of the interacting strain almost doubled between 2010 and 2020, its infection is not associated with rs3130660 in this recent cohort. These findings suggest a complex interplay among host, pathogen, and environmental factors in tuberculosis dynamics.

Data availability

The human genotyping data generated in this study have been deposited in the dbGAP database under accession code phs002025.v1.p1[https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs002025.v1.p1] and phs003718.v1 [https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id= phs003718.v1.p1]. The Mycobacterium tuberculosis whole genome sequences in this study have been deposited in the BioProject database under accession code PRJNA1039243. The RNA sequencing data generated in this study have been deposited in the GEO database under accession code GSE262379. The eQTL Catalog release v6 database can be downloaded at https://ftp.ebi.ac.uk/pub/databases/spot/eQTL/. The Genotype Tissue Expression (GTEx release v8) database can be downloaded at https://gtexportal.org/home/protectedDataAccess. The whole-genome sequences of Mtb strains were obtained at the NCBI Sequence Read Archive (SRA) database (https://www.ncbi.nlm.nih.gov/sra) with Study ID listed in the Supplementary Data. The Mtb reference assembly (H37Rv NC_000962.3) can be downloaded at https://www.ncbi.nlm.nih.gov/nuccore/CP003248. Source data are provided with this paper.

All code for generating the figures presented in the manuscript are available at Luo, Y., Huang, C-C., Howard, N.C., Wang X., Liu Q.Y., et al. Paired analysis of host and pathogen genomes identifies determinants of human tuberculosis, https://github.com/yang-luo-lab/TB-g2g, https://doi.org/10.5281/zenodo.13321932, 2024.

Files

Paired-analysis-of-host-and-pathogen-genomes-identifies-determinants-of-human-tuberculosis.pdf

Files (30.2 MB)

Name Size Download all
md5:a370231a1c66d2fe82e45d2a2483aa61
5.9 MB Download
Article
md5:75f3a1cbbaa10e948c985cee323f6c9b
3.7 MB Preview Download
md5:3e0139bac64b2fda4ca609fa30c28cb2
36.4 kB Preview Download
Supplementary information files
md5:aa39773b135425384ffd4b0698375f5f
20.5 MB Preview Download

Additional details

Identifiers

DOI
10.1038/s41467-024-54741-w
Other
oai:uchicago.tind.io:14179

Funding

TB Research Unit Network, National Institutes of Health
U19 AI111224
National Human Genome Research Institute
U01 HG009379
Kennedy Trust
KTRR Senior Research Fellowship
Unknown funder
T32 AI007535
Unknown funder
U19 AI165573
Unknown funder
U19162584
Unknown funder
P01 AI181898
Unknown funder
U19 AI142793

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Genetics, Genomics, and Systems Biology