Heavy-Tailed Noise Suppression and Derivative Wavelet Scalogram for Detecting DNA Copy Number Aberrations

Nha H. Nguyen, An Vo, Haibin Sun, Heng Huang

Research output: Contribution to journalArticle

2 Scopus citations

Abstract

Most existing array comparative genomic hybridization (array CGH) data processing methods and evaluation models assumed that the probability density function of noise in array CGH is a Gaussian distribution. However, in practice such noise distribution is peaky and heavy-tailed. A more accurate and sufficient model of noise in array CGH data is necessary and beneficial to the detection of DNA copy number variations. We analyze the real array CGH data from different platforms and show that the distribution of noise in array CGH data is fitted very well by generalized Gaussian distribution (GGD). Based on our new noise model, we propose a novel array CGH processing method combining the advantages of both smoothing and segmentation approaches. The new method uses generalized Gaussian bivariate shrinkage function and one-directional derivative wavelet scalogram in generalized Gaussian noise. In smoothing step, with the new generalized Gaussian noise model, we derive the heavy-tailed noise suppression algorithm in stationary wavelet domain. In segmentation step, the 1D Gaussian derivative wavelet scalogram is employed to detect break points. Both real and simulated data are used in our experiments. Our new method outperforms other state-of-the-art methods, in terms of both root mean squared errors and receiver operating characteristic curves.

Original languageEnglish (US)
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
DOIs
StateAccepted/In press - Jul 6 2017
Externally publishedYes

    Fingerprint

Keywords

  • aCGH
  • Arrays
  • Biological cells
  • Data models
  • DNA
  • DNA copy number variations
  • Gaussian distribution
  • Heavy-tailed noise
  • Probability density function
  • Probes
  • wavelet

ASJC Scopus subject areas

  • Biotechnology
  • Genetics
  • Applied Mathematics

Cite this