Targeted short read sequencing and assembly of re-arrangements and candidate gene loci provide megabase diplotypes

Gi Won Shin, Stephanie U. Greer, Li C. Xia, Ho Joon Lee, Jun Zhou, T. Christian Boles, Hanlee P. Ji

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

The human genome is composed of two haplotypes, otherwise called diplotypes, which denote phased polymorphisms and structural variations (SVs) that are derived from both parents. Diplotypes place genetic variants in the context of cis-related variants from a diploid genome. As a result, they provide valuable information about hereditary transmission, context of SV, regulation of gene expression and other features which are informative for understanding human genetics. Successful diplotyping with short read whole genome sequencing generally requires either a large population or parent-child trio samples. To overcome these limitations, we developed a targeted sequencing method for generating megabase (Mb)-scale haplotypes with short reads. One selects specific 0.1-0.2 Mb high molecular weight DNA targets with custom-designed Cas9-guide RNA complexes followed by sequencing with barcoded linked reads. To test this approach, we designed three assays, targeting the BRCA1 gene, the entire 4-Mb major histocompatibility complex locus and 18 well-characterized SVs, respectively. Using an integrated alignment- and assembly-based approach, we generated comprehensive variant diplotypes spanning the entirety of the targeted loci and characterized SVs with exact breakpoints. Our results were comparable in quality to long read sequencing.

Original languageEnglish (US)
Pages (from-to)E115
JournalNucleic acids research
Volume47
Issue number19
DOIs
StatePublished - Nov 4 2019
Externally publishedYes

ASJC Scopus subject areas

  • Genetics

Fingerprint

Dive into the research topics of 'Targeted short read sequencing and assembly of re-arrangements and candidate gene loci provide megabase diplotypes'. Together they form a unique fingerprint.

Cite this