Applying HaploReg v4.1 analysis to the 536 variants resulted in 9 categories (Supplementary Table 8): miscRNA (1 variant); snoRNA (2 variants); microRNA (4 variants); snRNA (9 variants); pseudogenes (14 variants); sequencing in progress (43 variants); LINC RNA (86 variants); and 372 variants within protein coding genes. In addition, some variants annotate to the same gene resulting in a total of 139 genes (protein-coding or non-coding). Of those genes, 6 are exceptionally long, containing over a million base-pairs, the longest of which is PTPRT coded by 1117219bp. The shortest genes are the ones coding for micro (MIR3143) or small nuclear (U7) RNAs at 63bp each. There is only partial information regarding the chromatin state of each variant. However, from the information gathered in the analysis we observed 14 transcription start sites and 245 enhancers (Supplementary Table 8).
ASJC Scopus subject areas
- Cell Biology