OPUS-Dom

Applying the Folding-Based Method VECFOLD to Determine Protein Domain Boundaries

Yinghao Wu, Athanasios D. Dousis, Mingzhi Chen, Jialin Li, Jianpeng Ma

Research output: Contribution to journalArticle

10 Citations (Scopus)

Abstract

In this article, we present a de novo method for predicting protein domain boundaries, called OPUS-Dom. The core of the method is a novel coarse-grained folding method, VECFOLD, which constructs low-resolution structural models from a target sequence by folding a chain of vectors representing the predicted secondary-structure elements. OPUS-Dom generates a large ensemble of folded structure decoys by VECFOLD and labels the domain boundaries of each decoy by a domain parsing algorithm. Consensus domain boundaries are then derived from the statistical distribution of the putative boundaries and three empirical sequence-based domain profiles. OPUS-Dom generally outperformed several state-of-the-art domain prediction algorithms over various benchmark protein sets. Even though each VECFOLD-generated structure contains large errors, collectively these structures provide a more robust delineation of domain boundaries. The success of OPUS-Dom suggests that the arrangement of protein domains is more a consequence of limited coordination patterns per domain arising from tertiary packing of secondary-structure segments, rather than sequence-specific constraints.

Original languageEnglish (US)
Pages (from-to)1314-1329
Number of pages16
JournalJournal of Molecular Biology
Volume385
Issue number4
DOIs
StatePublished - Jan 30 2009
Externally publishedYes

Fingerprint

Statistical Distributions
Benchmarking
Structural Models
Protein Domains
Proteins

Keywords

  • chain skeleton
  • domain boundary
  • protein folding
  • statistical scores
  • structure prediction

ASJC Scopus subject areas

  • Molecular Biology

Cite this

OPUS-Dom : Applying the Folding-Based Method VECFOLD to Determine Protein Domain Boundaries. / Wu, Yinghao; Dousis, Athanasios D.; Chen, Mingzhi; Li, Jialin; Ma, Jianpeng.

In: Journal of Molecular Biology, Vol. 385, No. 4, 30.01.2009, p. 1314-1329.

Research output: Contribution to journalArticle

Wu, Yinghao ; Dousis, Athanasios D. ; Chen, Mingzhi ; Li, Jialin ; Ma, Jianpeng. / OPUS-Dom : Applying the Folding-Based Method VECFOLD to Determine Protein Domain Boundaries. In: Journal of Molecular Biology. 2009 ; Vol. 385, No. 4. pp. 1314-1329.
@article{25bf5166d6dc423088c89351c7b21046,
title = "OPUS-Dom: Applying the Folding-Based Method VECFOLD to Determine Protein Domain Boundaries",
abstract = "In this article, we present a de novo method for predicting protein domain boundaries, called OPUS-Dom. The core of the method is a novel coarse-grained folding method, VECFOLD, which constructs low-resolution structural models from a target sequence by folding a chain of vectors representing the predicted secondary-structure elements. OPUS-Dom generates a large ensemble of folded structure decoys by VECFOLD and labels the domain boundaries of each decoy by a domain parsing algorithm. Consensus domain boundaries are then derived from the statistical distribution of the putative boundaries and three empirical sequence-based domain profiles. OPUS-Dom generally outperformed several state-of-the-art domain prediction algorithms over various benchmark protein sets. Even though each VECFOLD-generated structure contains large errors, collectively these structures provide a more robust delineation of domain boundaries. The success of OPUS-Dom suggests that the arrangement of protein domains is more a consequence of limited coordination patterns per domain arising from tertiary packing of secondary-structure segments, rather than sequence-specific constraints.",
keywords = "chain skeleton, domain boundary, protein folding, statistical scores, structure prediction",
author = "Yinghao Wu and Dousis, {Athanasios D.} and Mingzhi Chen and Jialin Li and Jianpeng Ma",
year = "2009",
month = "1",
day = "30",
doi = "10.1016/j.jmb.2008.10.093",
language = "English (US)",
volume = "385",
pages = "1314--1329",
journal = "Journal of Molecular Biology",
issn = "0022-2836",
publisher = "Academic Press Inc.",
number = "4",

}

TY - JOUR

T1 - OPUS-Dom

T2 - Applying the Folding-Based Method VECFOLD to Determine Protein Domain Boundaries

AU - Wu, Yinghao

AU - Dousis, Athanasios D.

AU - Chen, Mingzhi

AU - Li, Jialin

AU - Ma, Jianpeng

PY - 2009/1/30

Y1 - 2009/1/30

N2 - In this article, we present a de novo method for predicting protein domain boundaries, called OPUS-Dom. The core of the method is a novel coarse-grained folding method, VECFOLD, which constructs low-resolution structural models from a target sequence by folding a chain of vectors representing the predicted secondary-structure elements. OPUS-Dom generates a large ensemble of folded structure decoys by VECFOLD and labels the domain boundaries of each decoy by a domain parsing algorithm. Consensus domain boundaries are then derived from the statistical distribution of the putative boundaries and three empirical sequence-based domain profiles. OPUS-Dom generally outperformed several state-of-the-art domain prediction algorithms over various benchmark protein sets. Even though each VECFOLD-generated structure contains large errors, collectively these structures provide a more robust delineation of domain boundaries. The success of OPUS-Dom suggests that the arrangement of protein domains is more a consequence of limited coordination patterns per domain arising from tertiary packing of secondary-structure segments, rather than sequence-specific constraints.

AB - In this article, we present a de novo method for predicting protein domain boundaries, called OPUS-Dom. The core of the method is a novel coarse-grained folding method, VECFOLD, which constructs low-resolution structural models from a target sequence by folding a chain of vectors representing the predicted secondary-structure elements. OPUS-Dom generates a large ensemble of folded structure decoys by VECFOLD and labels the domain boundaries of each decoy by a domain parsing algorithm. Consensus domain boundaries are then derived from the statistical distribution of the putative boundaries and three empirical sequence-based domain profiles. OPUS-Dom generally outperformed several state-of-the-art domain prediction algorithms over various benchmark protein sets. Even though each VECFOLD-generated structure contains large errors, collectively these structures provide a more robust delineation of domain boundaries. The success of OPUS-Dom suggests that the arrangement of protein domains is more a consequence of limited coordination patterns per domain arising from tertiary packing of secondary-structure segments, rather than sequence-specific constraints.

KW - chain skeleton

KW - domain boundary

KW - protein folding

KW - statistical scores

KW - structure prediction

UR - http://www.scopus.com/inward/record.url?scp=58149336791&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=58149336791&partnerID=8YFLogxK

U2 - 10.1016/j.jmb.2008.10.093

DO - 10.1016/j.jmb.2008.10.093

M3 - Article

VL - 385

SP - 1314

EP - 1329

JO - Journal of Molecular Biology

JF - Journal of Molecular Biology

SN - 0022-2836

IS - 4

ER -