A stationary wavelet entropy-based clustering approach accurately predicts gene expression

Nha H. Nguyen, An Vo, Inchan Choi, Kyoung Jae Won

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

Studying epigenetic landscapes is important to understand the condition for gene regulation. Clustering is a useful approach to study epigenetic landscapes by grouping genes based on their epigenetic conditions. However, classical clustering approaches that often use a representative value of the signals in a fixed-sized window do not fully use the information written in the epigenetic landscapes. Clustering approaches to maximize the information of the epigenetic signals are necessary for better understanding gene regulatory environments. For effective clustering of multidimensional epigenetic signals, we developed a method called Dewer, which uses the entropy of stationary wavelet of epigenetic signals inside enriched regions for gene clustering. Interestingly, the gene expression levels were highly correlated with the entropy levels of epigenetic signals. Dewer separates genes better than a window-based approach in the assessment using gene expression and achieved a correlation coefficient above 0.9 without using any training procedure. Our results show that the changes of the epigenetic signals are useful to study gene regulation.

Original languageEnglish (US)
Pages (from-to)236-249
Number of pages14
JournalJournal of Computational Biology
Volume22
Issue number3
DOIs
StatePublished - Jan 1 2015
Externally publishedYes

Fingerprint

Entropy
Epigenomics
Gene expression
Gene Expression
Cluster Analysis
Wavelets
Genes
Clustering
Predict
Gene
Gene Regulation
Correlation coefficient
Grouping
Maximise
Regulator Genes
Necessary

Keywords

  • algorithms
  • gene expression
  • genetic analysis
  • genome analysis
  • next-generation sequencing

ASJC Scopus subject areas

  • Modeling and Simulation
  • Molecular Biology
  • Genetics
  • Computational Mathematics
  • Computational Theory and Mathematics

Cite this

A stationary wavelet entropy-based clustering approach accurately predicts gene expression. / Nguyen, Nha H.; Vo, An; Choi, Inchan; Won, Kyoung Jae.

In: Journal of Computational Biology, Vol. 22, No. 3, 01.01.2015, p. 236-249.

Research output: Contribution to journalArticle

@article{12b23afe00fa461f97936bb26cca9008,
title = "A stationary wavelet entropy-based clustering approach accurately predicts gene expression",
abstract = "Studying epigenetic landscapes is important to understand the condition for gene regulation. Clustering is a useful approach to study epigenetic landscapes by grouping genes based on their epigenetic conditions. However, classical clustering approaches that often use a representative value of the signals in a fixed-sized window do not fully use the information written in the epigenetic landscapes. Clustering approaches to maximize the information of the epigenetic signals are necessary for better understanding gene regulatory environments. For effective clustering of multidimensional epigenetic signals, we developed a method called Dewer, which uses the entropy of stationary wavelet of epigenetic signals inside enriched regions for gene clustering. Interestingly, the gene expression levels were highly correlated with the entropy levels of epigenetic signals. Dewer separates genes better than a window-based approach in the assessment using gene expression and achieved a correlation coefficient above 0.9 without using any training procedure. Our results show that the changes of the epigenetic signals are useful to study gene regulation.",
keywords = "algorithms, gene expression, genetic analysis, genome analysis, next-generation sequencing",
author = "Nguyen, {Nha H.} and An Vo and Inchan Choi and Won, {Kyoung Jae}",
year = "2015",
month = "1",
day = "1",
doi = "10.1089/cmb.2014.0221",
language = "English (US)",
volume = "22",
pages = "236--249",
journal = "Journal of Computational Biology",
issn = "1066-5277",
publisher = "Mary Ann Liebert Inc.",
number = "3",

}

TY - JOUR

T1 - A stationary wavelet entropy-based clustering approach accurately predicts gene expression

AU - Nguyen, Nha H.

AU - Vo, An

AU - Choi, Inchan

AU - Won, Kyoung Jae

PY - 2015/1/1

Y1 - 2015/1/1

N2 - Studying epigenetic landscapes is important to understand the condition for gene regulation. Clustering is a useful approach to study epigenetic landscapes by grouping genes based on their epigenetic conditions. However, classical clustering approaches that often use a representative value of the signals in a fixed-sized window do not fully use the information written in the epigenetic landscapes. Clustering approaches to maximize the information of the epigenetic signals are necessary for better understanding gene regulatory environments. For effective clustering of multidimensional epigenetic signals, we developed a method called Dewer, which uses the entropy of stationary wavelet of epigenetic signals inside enriched regions for gene clustering. Interestingly, the gene expression levels were highly correlated with the entropy levels of epigenetic signals. Dewer separates genes better than a window-based approach in the assessment using gene expression and achieved a correlation coefficient above 0.9 without using any training procedure. Our results show that the changes of the epigenetic signals are useful to study gene regulation.

AB - Studying epigenetic landscapes is important to understand the condition for gene regulation. Clustering is a useful approach to study epigenetic landscapes by grouping genes based on their epigenetic conditions. However, classical clustering approaches that often use a representative value of the signals in a fixed-sized window do not fully use the information written in the epigenetic landscapes. Clustering approaches to maximize the information of the epigenetic signals are necessary for better understanding gene regulatory environments. For effective clustering of multidimensional epigenetic signals, we developed a method called Dewer, which uses the entropy of stationary wavelet of epigenetic signals inside enriched regions for gene clustering. Interestingly, the gene expression levels were highly correlated with the entropy levels of epigenetic signals. Dewer separates genes better than a window-based approach in the assessment using gene expression and achieved a correlation coefficient above 0.9 without using any training procedure. Our results show that the changes of the epigenetic signals are useful to study gene regulation.

KW - algorithms

KW - gene expression

KW - genetic analysis

KW - genome analysis

KW - next-generation sequencing

UR - http://www.scopus.com/inward/record.url?scp=84924763000&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84924763000&partnerID=8YFLogxK

U2 - 10.1089/cmb.2014.0221

DO - 10.1089/cmb.2014.0221

M3 - Article

VL - 22

SP - 236

EP - 249

JO - Journal of Computational Biology

JF - Journal of Computational Biology

SN - 1066-5277

IS - 3

ER -