Gene length and proximity to neighbors affect genome-wide expression levels

Francesca Chiaromonte, Webb Miller, Eric E. Bouhassira

Research output: Contribution to journalArticle

51 Citations (Scopus)

Abstract

Steady-state levels of mRNA in cells theoretically depend on the rate and efficiency of transcription and posttranscriptional processing, on mRNA stability, on transcriptional interference from other genes, and on poorly defined long-range chromatin effects. Although each of these cellular processes has been studied in detail for a few genes, it is not possible to predict expression levels by simply examining gene sequences. In this report, we have used a bioinformatics approach to identify critical factors that influence expression levels. To simplify the problem, we have limited our analysis to the collection of genes expressed in all tissues, because such genes provide a unique opportunity to distinguish the role of general genomic features that constrain gene expression from the effect of tissue-specific factors. Using correlation and regression techniques, we have investigated the dependence between expression level and morphological parameters (distance to neighbors, gene, mRNA or 3′-UTR length, number of exons, etc.) that can be directly related to transcription, posttranscriptional processing, mRNA stability, or transcriptional interference. We found that, on a genome-wide scale, highly expressed genes are significantly farther from their closest neighboring genes, are smaller, contain a moderate number of exons, and produce shorter mRNAs with shorter 3′-UTRs. This confirms that transcriptional and posttranscriptional processes are highly interrelated and implies that transcriptional interference plays a role in determining steady-state levels of mRNA in cells.

Original languageEnglish (US)
Pages (from-to)2602-2608
Number of pages7
JournalGenome Research
Volume13
Issue number12
DOIs
StatePublished - Dec 2003

Fingerprint

Genome
Genes
Messenger RNA
RNA Stability
3' Untranslated Regions
Exons
Thromboplastin
Computational Biology
Chromatin
Gene Expression

ASJC Scopus subject areas

  • Genetics

Cite this

Gene length and proximity to neighbors affect genome-wide expression levels. / Chiaromonte, Francesca; Miller, Webb; Bouhassira, Eric E.

In: Genome Research, Vol. 13, No. 12, 12.2003, p. 2602-2608.

Research output: Contribution to journalArticle

Chiaromonte, Francesca ; Miller, Webb ; Bouhassira, Eric E. / Gene length and proximity to neighbors affect genome-wide expression levels. In: Genome Research. 2003 ; Vol. 13, No. 12. pp. 2602-2608.
@article{6b492eb4f1eb4083b7a0dabbebe466a2,
title = "Gene length and proximity to neighbors affect genome-wide expression levels",
abstract = "Steady-state levels of mRNA in cells theoretically depend on the rate and efficiency of transcription and posttranscriptional processing, on mRNA stability, on transcriptional interference from other genes, and on poorly defined long-range chromatin effects. Although each of these cellular processes has been studied in detail for a few genes, it is not possible to predict expression levels by simply examining gene sequences. In this report, we have used a bioinformatics approach to identify critical factors that influence expression levels. To simplify the problem, we have limited our analysis to the collection of genes expressed in all tissues, because such genes provide a unique opportunity to distinguish the role of general genomic features that constrain gene expression from the effect of tissue-specific factors. Using correlation and regression techniques, we have investigated the dependence between expression level and morphological parameters (distance to neighbors, gene, mRNA or 3′-UTR length, number of exons, etc.) that can be directly related to transcription, posttranscriptional processing, mRNA stability, or transcriptional interference. We found that, on a genome-wide scale, highly expressed genes are significantly farther from their closest neighboring genes, are smaller, contain a moderate number of exons, and produce shorter mRNAs with shorter 3′-UTRs. This confirms that transcriptional and posttranscriptional processes are highly interrelated and implies that transcriptional interference plays a role in determining steady-state levels of mRNA in cells.",
author = "Francesca Chiaromonte and Webb Miller and Bouhassira, {Eric E.}",
year = "2003",
month = "12",
doi = "10.1101/gr.1169203",
language = "English (US)",
volume = "13",
pages = "2602--2608",
journal = "Genome Research",
issn = "1088-9051",
publisher = "Cold Spring Harbor Laboratory Press",
number = "12",

}

TY - JOUR

T1 - Gene length and proximity to neighbors affect genome-wide expression levels

AU - Chiaromonte, Francesca

AU - Miller, Webb

AU - Bouhassira, Eric E.

PY - 2003/12

Y1 - 2003/12

N2 - Steady-state levels of mRNA in cells theoretically depend on the rate and efficiency of transcription and posttranscriptional processing, on mRNA stability, on transcriptional interference from other genes, and on poorly defined long-range chromatin effects. Although each of these cellular processes has been studied in detail for a few genes, it is not possible to predict expression levels by simply examining gene sequences. In this report, we have used a bioinformatics approach to identify critical factors that influence expression levels. To simplify the problem, we have limited our analysis to the collection of genes expressed in all tissues, because such genes provide a unique opportunity to distinguish the role of general genomic features that constrain gene expression from the effect of tissue-specific factors. Using correlation and regression techniques, we have investigated the dependence between expression level and morphological parameters (distance to neighbors, gene, mRNA or 3′-UTR length, number of exons, etc.) that can be directly related to transcription, posttranscriptional processing, mRNA stability, or transcriptional interference. We found that, on a genome-wide scale, highly expressed genes are significantly farther from their closest neighboring genes, are smaller, contain a moderate number of exons, and produce shorter mRNAs with shorter 3′-UTRs. This confirms that transcriptional and posttranscriptional processes are highly interrelated and implies that transcriptional interference plays a role in determining steady-state levels of mRNA in cells.

AB - Steady-state levels of mRNA in cells theoretically depend on the rate and efficiency of transcription and posttranscriptional processing, on mRNA stability, on transcriptional interference from other genes, and on poorly defined long-range chromatin effects. Although each of these cellular processes has been studied in detail for a few genes, it is not possible to predict expression levels by simply examining gene sequences. In this report, we have used a bioinformatics approach to identify critical factors that influence expression levels. To simplify the problem, we have limited our analysis to the collection of genes expressed in all tissues, because such genes provide a unique opportunity to distinguish the role of general genomic features that constrain gene expression from the effect of tissue-specific factors. Using correlation and regression techniques, we have investigated the dependence between expression level and morphological parameters (distance to neighbors, gene, mRNA or 3′-UTR length, number of exons, etc.) that can be directly related to transcription, posttranscriptional processing, mRNA stability, or transcriptional interference. We found that, on a genome-wide scale, highly expressed genes are significantly farther from their closest neighboring genes, are smaller, contain a moderate number of exons, and produce shorter mRNAs with shorter 3′-UTRs. This confirms that transcriptional and posttranscriptional processes are highly interrelated and implies that transcriptional interference plays a role in determining steady-state levels of mRNA in cells.

UR - http://www.scopus.com/inward/record.url?scp=0348013067&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0348013067&partnerID=8YFLogxK

U2 - 10.1101/gr.1169203

DO - 10.1101/gr.1169203

M3 - Article

C2 - 14613975

AN - SCOPUS:0348013067

VL - 13

SP - 2602

EP - 2608

JO - Genome Research

JF - Genome Research

SN - 1088-9051

IS - 12

ER -