Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome

Olof Emanuelsson, Ugrappa Nagalakshmi, Deyou Zheng, Joel S. Rozowsky, Alexander E. Urban, Jiang Du, Zheng Lian, Viktor Stolc, Sherman Weissman, Michael Snyder, Mark B. Gerstein

Research output: Contribution to journalArticle

24 Citations (Scopus)

Abstract

Genomic tiling microarrays have become a popular tool for interrogating the transcriptional activity of large regions of the genome in an unbiased fashion. There are several key parameters associated with each tiling experiment (e.g., experimental protocols and genomic tiling density). Here, we assess the role of these parameters as they are manifest in different tiling-array platforms used for transcription mapping. First, we analyze how a number of published tiling-array experiments agree with established gene annotation on human chromosome 22. We observe that the transcription detected from high-density arrays correlates substantially better with annotation than that from other array types. Next, we analyze the transcription-mapping performance of the two main high-density oligonucleotide array platforms in the ENCODE regions of the human genome. We hybridize identical biological samples and develop several ways of scoring the arrays and segmenting the genome into transcribed and nontranscribed regions, with the aim of making the platforms most comparable to each other. Finally, we develop a platform comparison approach based on agreement with known annotation. Overall, we find that the performance improves with more data points per locus, coupled with statistical scoring approaches that properly take advantage of this, where this larger number of data points arises from higher genomic tiling density and the use of replicate arrays and mismatches. While we do find significant differences in the performance of the two high-density platforms, we also find that they complement each other to some extent. Finally, our experiments reveal a significant amount of novel transcription outside of known genes, and an appreciable sample of this was validated by independent experiments.

Original languageEnglish (US)
Pages (from-to)886-897
Number of pages12
JournalGenome Research
Volume17
Issue number6
DOIs
StatePublished - Jun 2007
Externally publishedYes

Fingerprint

Human Genome
Genome
Molecular Sequence Annotation
Chromosomes, Human, Pair 22
Human Chromosomes
Oligonucleotide Array Sequence Analysis
Genes

ASJC Scopus subject areas

  • Genetics

Cite this

Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome. / Emanuelsson, Olof; Nagalakshmi, Ugrappa; Zheng, Deyou; Rozowsky, Joel S.; Urban, Alexander E.; Du, Jiang; Lian, Zheng; Stolc, Viktor; Weissman, Sherman; Snyder, Michael; Gerstein, Mark B.

In: Genome Research, Vol. 17, No. 6, 06.2007, p. 886-897.

Research output: Contribution to journalArticle

Emanuelsson, O, Nagalakshmi, U, Zheng, D, Rozowsky, JS, Urban, AE, Du, J, Lian, Z, Stolc, V, Weissman, S, Snyder, M & Gerstein, MB 2007, 'Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome', Genome Research, vol. 17, no. 6, pp. 886-897. https://doi.org/10.1101/gr.5014606
Emanuelsson, Olof ; Nagalakshmi, Ugrappa ; Zheng, Deyou ; Rozowsky, Joel S. ; Urban, Alexander E. ; Du, Jiang ; Lian, Zheng ; Stolc, Viktor ; Weissman, Sherman ; Snyder, Michael ; Gerstein, Mark B. / Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome. In: Genome Research. 2007 ; Vol. 17, No. 6. pp. 886-897.
@article{8cbead145fa348c3b302a74b697cc899,
title = "Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome",
abstract = "Genomic tiling microarrays have become a popular tool for interrogating the transcriptional activity of large regions of the genome in an unbiased fashion. There are several key parameters associated with each tiling experiment (e.g., experimental protocols and genomic tiling density). Here, we assess the role of these parameters as they are manifest in different tiling-array platforms used for transcription mapping. First, we analyze how a number of published tiling-array experiments agree with established gene annotation on human chromosome 22. We observe that the transcription detected from high-density arrays correlates substantially better with annotation than that from other array types. Next, we analyze the transcription-mapping performance of the two main high-density oligonucleotide array platforms in the ENCODE regions of the human genome. We hybridize identical biological samples and develop several ways of scoring the arrays and segmenting the genome into transcribed and nontranscribed regions, with the aim of making the platforms most comparable to each other. Finally, we develop a platform comparison approach based on agreement with known annotation. Overall, we find that the performance improves with more data points per locus, coupled with statistical scoring approaches that properly take advantage of this, where this larger number of data points arises from higher genomic tiling density and the use of replicate arrays and mismatches. While we do find significant differences in the performance of the two high-density platforms, we also find that they complement each other to some extent. Finally, our experiments reveal a significant amount of novel transcription outside of known genes, and an appreciable sample of this was validated by independent experiments.",
author = "Olof Emanuelsson and Ugrappa Nagalakshmi and Deyou Zheng and Rozowsky, {Joel S.} and Urban, {Alexander E.} and Jiang Du and Zheng Lian and Viktor Stolc and Sherman Weissman and Michael Snyder and Gerstein, {Mark B.}",
year = "2007",
month = "6",
doi = "10.1101/gr.5014606",
language = "English (US)",
volume = "17",
pages = "886--897",
journal = "Genome Research",
issn = "1088-9051",
publisher = "Cold Spring Harbor Laboratory Press",
number = "6",

}

TY - JOUR

T1 - Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome

AU - Emanuelsson, Olof

AU - Nagalakshmi, Ugrappa

AU - Zheng, Deyou

AU - Rozowsky, Joel S.

AU - Urban, Alexander E.

AU - Du, Jiang

AU - Lian, Zheng

AU - Stolc, Viktor

AU - Weissman, Sherman

AU - Snyder, Michael

AU - Gerstein, Mark B.

PY - 2007/6

Y1 - 2007/6

N2 - Genomic tiling microarrays have become a popular tool for interrogating the transcriptional activity of large regions of the genome in an unbiased fashion. There are several key parameters associated with each tiling experiment (e.g., experimental protocols and genomic tiling density). Here, we assess the role of these parameters as they are manifest in different tiling-array platforms used for transcription mapping. First, we analyze how a number of published tiling-array experiments agree with established gene annotation on human chromosome 22. We observe that the transcription detected from high-density arrays correlates substantially better with annotation than that from other array types. Next, we analyze the transcription-mapping performance of the two main high-density oligonucleotide array platforms in the ENCODE regions of the human genome. We hybridize identical biological samples and develop several ways of scoring the arrays and segmenting the genome into transcribed and nontranscribed regions, with the aim of making the platforms most comparable to each other. Finally, we develop a platform comparison approach based on agreement with known annotation. Overall, we find that the performance improves with more data points per locus, coupled with statistical scoring approaches that properly take advantage of this, where this larger number of data points arises from higher genomic tiling density and the use of replicate arrays and mismatches. While we do find significant differences in the performance of the two high-density platforms, we also find that they complement each other to some extent. Finally, our experiments reveal a significant amount of novel transcription outside of known genes, and an appreciable sample of this was validated by independent experiments.

AB - Genomic tiling microarrays have become a popular tool for interrogating the transcriptional activity of large regions of the genome in an unbiased fashion. There are several key parameters associated with each tiling experiment (e.g., experimental protocols and genomic tiling density). Here, we assess the role of these parameters as they are manifest in different tiling-array platforms used for transcription mapping. First, we analyze how a number of published tiling-array experiments agree with established gene annotation on human chromosome 22. We observe that the transcription detected from high-density arrays correlates substantially better with annotation than that from other array types. Next, we analyze the transcription-mapping performance of the two main high-density oligonucleotide array platforms in the ENCODE regions of the human genome. We hybridize identical biological samples and develop several ways of scoring the arrays and segmenting the genome into transcribed and nontranscribed regions, with the aim of making the platforms most comparable to each other. Finally, we develop a platform comparison approach based on agreement with known annotation. Overall, we find that the performance improves with more data points per locus, coupled with statistical scoring approaches that properly take advantage of this, where this larger number of data points arises from higher genomic tiling density and the use of replicate arrays and mismatches. While we do find significant differences in the performance of the two high-density platforms, we also find that they complement each other to some extent. Finally, our experiments reveal a significant amount of novel transcription outside of known genes, and an appreciable sample of this was validated by independent experiments.

UR - http://www.scopus.com/inward/record.url?scp=34250376888&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34250376888&partnerID=8YFLogxK

U2 - 10.1101/gr.5014606

DO - 10.1101/gr.5014606

M3 - Article

C2 - 17119069

AN - SCOPUS:34250376888

VL - 17

SP - 886

EP - 897

JO - Genome Research

JF - Genome Research

SN - 1088-9051

IS - 6

ER -