• Decrease font size
  • Return font size to normal
  • Increase font size
U.S. Department of Health and Human Services

Scientific Publications by FDA Staff

  • Print
  • Share
  • E-mail

Search Publications



Starting Date

Ending Date

Order by

Entry Details

Nucleic Acids Res 2017 Nov 2;45(19):10989-1003

Separation and assembly of deep sequencing data into discrete sub-population genomes.

Karagiannis K, Simonyan V, Chumakov K, Mazumder R


Sequence heterogeneity is a common characteristic of RNA viruses that is often referred to as sub-populations or quasispecies. Traditional techniques used for assembly of short sequence reads produced by deep sequencing, such as de-novo assemblers, ignore the underlying diversity. Here, we introduce a novel algorithm that simultaneously assembles discrete sequences of multiple genomes present in populations. Using in silico data we were able to detect populations at as low as 0.1% frequency with complete global genome reconstruction and in a single sample detected 16 resolved sequences with no mismatches. We also applied the algorithm to high throughput sequencing data obtained for viruses present in sewage samples and successfully detected multiple sub-populations and recombination events in these diverse mixtures. High sensitivity of the algorithm also enables genomic analysis of heterogeneous pathogen genomes from patient samples and accurate detection of intra-host diversity, enabling not just basic research in personalized medicine but also accurate diagnostics and monitoring drug therapies, which are critical in clinical and regulatory decision-making process.

Category: Journal Article
PubMed ID: #28977510 DOI: 10.1093/nar/gkx755
Includes FDA Authors from Scientific Area(s): Biologics
Entry Created: 2017-10-08 Entry Last Modified: 2017-12-03