The landscape of human STR variation
Author(s)
Willems, Thomas Frederick; Gymrek, Melissa A.; Highnam, Gareth; 1000 Genomes Project Consortium; Mittelman, David; Erlich, Yaniv; ... Show more Show less
DownloadWillems-2014-The landscape of hum.pdf (4.098Mb)
PUBLISHER_CC
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
Short tandem repeats are among the most polymorphic loci in the human genome. These loci play a role in the etiology of a range of genetic diseases and have been frequently utilized in forensics, population genetics, and genetic genealogy. Despite this plethora of applications, little is known about the variation of most STRs in the human population. Here, we report the largest-scale analysis of human STR variation to date. We collected information for nearly 700,000 STR loci across more than 1000 individuals in Phase 1 of the 1000 Genomes Project. Extensive quality controls show that reliable allelic spectra can be obtained for close to 90% of the STR loci in the genome. We utilize this call set to analyze determinants of STR variation, assess the human reference genome’s representation of STR alleles, find STR loci with common loss-of-function alleles, and obtain initial estimates of the linkage disequilibrium between STRs and common SNPs. Overall, these analyses further elucidate the scale of genetic variation beyond classical point mutations.
Date issued
2014-08Department
Harvard University--MIT Division of Health Sciences and Technology; Massachusetts Institute of Technology. Computational and Systems Biology ProgramJournal
Genome Research
Publisher
Cold Spring Harbor Laboratory Press
Citation
Willems, T., M. Gymrek, G. Highnam, D. Mittelman, and Y. Erlich. “The Landscape of Human STR Variation.” Genome Research (August 18, 2014).
Version: Final published version
ISSN
1088-9051