Whole genome sequencing versus traditional genotyping for investigation of a Mycobacterium tuberculosis outbreak: a longitudinal molecular epidemiological study

PLoS Med. 2013;10(2):e1001387. doi: 10.1371/journal.pmed.1001387. Epub 2013 Feb 12.

Abstract

Background: Understanding Mycobacterium tuberculosis (Mtb) transmission is essential to guide efficient tuberculosis control strategies. Traditional strain typing lacks sufficient discriminatory power to resolve large outbreaks. Here, we tested the potential of using next generation genome sequencing for identification of outbreak-related transmission chains.

Methods and findings: During long-term (1997 to 2010) prospective population-based molecular epidemiological surveillance comprising a total of 2,301 patients, we identified a large outbreak caused by an Mtb strain of the Haarlem lineage. The main performance outcome measure of whole genome sequencing (WGS) analyses was the degree of correlation of the WGS analyses with contact tracing data and the spatio-temporal distribution of the outbreak cases. WGS analyses of the 86 isolates revealed 85 single nucleotide polymorphisms (SNPs), subdividing the outbreak into seven genome clusters (two to 24 isolates each), plus 36 unique SNP profiles. WGS results showed that the first outbreak isolates detected in 1997 were falsely clustered by classical genotyping. In 1998, one clone (termed "Hamburg clone") started expanding, apparently independently from differences in the social environment of early cases. Genome-based clustering patterns were in better accordance with contact tracing data and the geographical distribution of the cases than clustering patterns based on classical genotyping. A maximum of three SNPs were identified in eight confirmed human-to-human transmission chains, involving 31 patients. We estimated the Mtb genome evolutionary rate at 0.4 mutations per genome per year. This rate suggests that Mtb grows in its natural host with a doubling time of approximately 22 h (400 generations per year). Based on the genome variation discovered, emergence of the Hamburg clone was dated back to a period between 1993 and 1997, hence shortly before the discovery of the outbreak through epidemiological surveillance.

Conclusions: Our findings suggest that WGS is superior to conventional genotyping for Mtb pathogen tracing and investigating micro-epidemics. WGS provides a measure of Mtb genome evolution over time in its natural host context.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Bacterial Typing Techniques / methods*
  • Chi-Square Distribution
  • Child
  • Child, Preschool
  • Cluster Analysis
  • Contact Tracing
  • DNA, Bacterial / analysis*
  • Disease Outbreaks*
  • Female
  • Genome, Bacterial*
  • Genome-Wide Association Study
  • Genotype
  • Germany / epidemiology
  • Humans
  • Least-Squares Analysis
  • Linear Models
  • Longitudinal Studies
  • Male
  • Middle Aged
  • Molecular Epidemiology
  • Mycobacterium tuberculosis / classification
  • Mycobacterium tuberculosis / genetics*
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Population Surveillance
  • Prospective Studies
  • Sequence Analysis, DNA / methods*
  • Tuberculosis / epidemiology*
  • Tuberculosis / genetics*
  • Tuberculosis / microbiology
  • Tuberculosis / transmission
  • Young Adult

Substances

  • DNA, Bacterial

Grants and funding

This work was supported by the Schleswig-Holsteinische Gesellschaft zur Verhütung und Bekämpfung der Tuberkulose und der Lungenkrankheiten e.V., the EU FP7 TB-PAN-NET (FP7-223681), EU FP7 Patho-Ngen-Trace (FP7- 278864-2), ATM Muséum National d'Histoire Naturelle “Biodiversité et rôle des microorganismes dans les écosystèmes actuels et passés” and the BMBF funded TBornotTB network (01KI0784) project. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.