Comparative protein structure modeling of genes and genomes

Marc A. Martí-Renom, Ashley C. Stuart, Andras Fiser, Roberto Sánchez, Francisco Melo, Andrej Šali

Research output: Contribution to journalArticle

2261 Citations (Scopus)

Abstract

Comparative modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. The number of protein sequences that can be modeled and the accuracy of the predictions are increasing steadily because of the growth in the number of known protein structures and because of the improvements in the modeling software. Further advances are necessary in recognizing weak sequence-structure similarities, aligning sequences with structures, modeling of rigid body shifts, distortions, loops and side chains, as well as detecting errors in a model. Despite these problems, it is currently possible to model with useful accuracy significant parts of approximately one third of all known protein sequences. The use of individual comparative models in biology is already rewarding and increasingly widespread. A major new challenge for comparative modeling is the integration of it with the torrents of data from genome sequencing projects as well as from functional and structural genomics. In particular, there is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes. Such large-scale modeling is likely to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.

Original languageEnglish (US)
Pages (from-to)291-325
Number of pages35
JournalAnnual Review of Biophysics and Biomolecular Structure
Volume29
DOIs
StatePublished - 2000
Externally publishedYes

Fingerprint

Genes
Genome
Proteins
Genomics
Software
Pipelines
Growth

Keywords

  • Alignment
  • Fold assignment
  • Fully automated modeling
  • Homology modeling
  • Model evaluation
  • Protein structure prediction
  • Structural genomics

ASJC Scopus subject areas

  • Biophysics
  • Structural Biology

Cite this

Comparative protein structure modeling of genes and genomes. / Martí-Renom, Marc A.; Stuart, Ashley C.; Fiser, Andras; Sánchez, Roberto; Melo, Francisco; Šali, Andrej.

In: Annual Review of Biophysics and Biomolecular Structure, Vol. 29, 2000, p. 291-325.

Research output: Contribution to journalArticle

Martí-Renom, Marc A. ; Stuart, Ashley C. ; Fiser, Andras ; Sánchez, Roberto ; Melo, Francisco ; Šali, Andrej. / Comparative protein structure modeling of genes and genomes. In: Annual Review of Biophysics and Biomolecular Structure. 2000 ; Vol. 29. pp. 291-325.
@article{73c2f9840e4d4ec19268d5885a561f6f,
title = "Comparative protein structure modeling of genes and genomes",
abstract = "Comparative modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. The number of protein sequences that can be modeled and the accuracy of the predictions are increasing steadily because of the growth in the number of known protein structures and because of the improvements in the modeling software. Further advances are necessary in recognizing weak sequence-structure similarities, aligning sequences with structures, modeling of rigid body shifts, distortions, loops and side chains, as well as detecting errors in a model. Despite these problems, it is currently possible to model with useful accuracy significant parts of approximately one third of all known protein sequences. The use of individual comparative models in biology is already rewarding and increasingly widespread. A major new challenge for comparative modeling is the integration of it with the torrents of data from genome sequencing projects as well as from functional and structural genomics. In particular, there is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes. Such large-scale modeling is likely to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.",
keywords = "Alignment, Fold assignment, Fully automated modeling, Homology modeling, Model evaluation, Protein structure prediction, Structural genomics",
author = "Mart{\'i}-Renom, {Marc A.} and Stuart, {Ashley C.} and Andras Fiser and Roberto S{\'a}nchez and Francisco Melo and Andrej Šali",
year = "2000",
doi = "10.1146/annurev.biophys.29.1.291",
language = "English (US)",
volume = "29",
pages = "291--325",
journal = "Annual Review of Biophysics",
issn = "1936-122X",
publisher = "Annual Reviews Inc.",

}

TY - JOUR

T1 - Comparative protein structure modeling of genes and genomes

AU - Martí-Renom, Marc A.

AU - Stuart, Ashley C.

AU - Fiser, Andras

AU - Sánchez, Roberto

AU - Melo, Francisco

AU - Šali, Andrej

PY - 2000

Y1 - 2000

N2 - Comparative modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. The number of protein sequences that can be modeled and the accuracy of the predictions are increasing steadily because of the growth in the number of known protein structures and because of the improvements in the modeling software. Further advances are necessary in recognizing weak sequence-structure similarities, aligning sequences with structures, modeling of rigid body shifts, distortions, loops and side chains, as well as detecting errors in a model. Despite these problems, it is currently possible to model with useful accuracy significant parts of approximately one third of all known protein sequences. The use of individual comparative models in biology is already rewarding and increasingly widespread. A major new challenge for comparative modeling is the integration of it with the torrents of data from genome sequencing projects as well as from functional and structural genomics. In particular, there is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes. Such large-scale modeling is likely to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.

AB - Comparative modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. The number of protein sequences that can be modeled and the accuracy of the predictions are increasing steadily because of the growth in the number of known protein structures and because of the improvements in the modeling software. Further advances are necessary in recognizing weak sequence-structure similarities, aligning sequences with structures, modeling of rigid body shifts, distortions, loops and side chains, as well as detecting errors in a model. Despite these problems, it is currently possible to model with useful accuracy significant parts of approximately one third of all known protein sequences. The use of individual comparative models in biology is already rewarding and increasingly widespread. A major new challenge for comparative modeling is the integration of it with the torrents of data from genome sequencing projects as well as from functional and structural genomics. In particular, there is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes. Such large-scale modeling is likely to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.

KW - Alignment

KW - Fold assignment

KW - Fully automated modeling

KW - Homology modeling

KW - Model evaluation

KW - Protein structure prediction

KW - Structural genomics

UR - http://www.scopus.com/inward/record.url?scp=0033873929&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0033873929&partnerID=8YFLogxK

U2 - 10.1146/annurev.biophys.29.1.291

DO - 10.1146/annurev.biophys.29.1.291

M3 - Article

C2 - 10940251

AN - SCOPUS:0033873929

VL - 29

SP - 291

EP - 325

JO - Annual Review of Biophysics

JF - Annual Review of Biophysics

SN - 1936-122X

ER -