Structural genomics is the largest contributor of novel structural leverage

Rajesh Nair, Jinfeng Liu, Ta Tsen Soong, Thomas B. Acton, John K. Everett, Andrei Kouranov, Andras Fiser, Adam Godzik, Lukasz Jaroszewski, Christine Orengo, Gaetano T. Montelione, Burkhard Rost

Research output: Contribution to journalArticle

70 Citations (Scopus)

Abstract

The Protein Structural Initiative (PSI) at the US National Institutes of Health (NIH) is funding four large-scale centers for structural genomics (SG). These centers systematically target many large families without structural coverage, as well as very large families with inadequate structural coverage. Here, we report a few simple metrics that demonstrate how successfully these efforts optimize structural coverage: while the PSI-2 (2005-now) contributed more than 8% of all structures deposited into the PDB, it contributed over 20% of all novel structures (i.e. structures for protein sequences with no structural representative in the PDB on the date of deposition). The structural coverage of the protein universe represented by today's UniProt (v12.8) has increased linearly from 1992 to 2008; structural genomics has contributed significantly to the maintenance of this growth rate. Success in increasing novel leverage (defined in Liu et al. in Nat Biotechnol 25:849-851, 2007) has resulted from systematic targeting of large families. PSI's per structure contribution to novel leverage was over 4-fold higher than that for non-PSI structural biology efforts during the past 8 years. If the success of the PSI continues, it may just take another ∼15 years to cover most sequences in the current UniProt database.

Original languageEnglish (US)
Pages (from-to)181-191
Number of pages11
JournalJournal of Structural and Functional Genomics
Volume10
Issue number2
DOIs
StatePublished - Apr 2009

Fingerprint

Genomics
Proteins
National Institutes of Health (U.S.)
Maintenance
Health
Databases
Growth

Keywords

  • Evolution
  • Protein structure determination
  • Protein universe
  • Structural genomics

ASJC Scopus subject areas

  • Genetics
  • Structural Biology
  • Biochemistry

Cite this

Nair, R., Liu, J., Soong, T. T., Acton, T. B., Everett, J. K., Kouranov, A., ... Rost, B. (2009). Structural genomics is the largest contributor of novel structural leverage. Journal of Structural and Functional Genomics, 10(2), 181-191. https://doi.org/10.1007/s10969-008-9055-6

Structural genomics is the largest contributor of novel structural leverage. / Nair, Rajesh; Liu, Jinfeng; Soong, Ta Tsen; Acton, Thomas B.; Everett, John K.; Kouranov, Andrei; Fiser, Andras; Godzik, Adam; Jaroszewski, Lukasz; Orengo, Christine; Montelione, Gaetano T.; Rost, Burkhard.

In: Journal of Structural and Functional Genomics, Vol. 10, No. 2, 04.2009, p. 181-191.

Research output: Contribution to journalArticle

Nair, R, Liu, J, Soong, TT, Acton, TB, Everett, JK, Kouranov, A, Fiser, A, Godzik, A, Jaroszewski, L, Orengo, C, Montelione, GT & Rost, B 2009, 'Structural genomics is the largest contributor of novel structural leverage', Journal of Structural and Functional Genomics, vol. 10, no. 2, pp. 181-191. https://doi.org/10.1007/s10969-008-9055-6
Nair, Rajesh ; Liu, Jinfeng ; Soong, Ta Tsen ; Acton, Thomas B. ; Everett, John K. ; Kouranov, Andrei ; Fiser, Andras ; Godzik, Adam ; Jaroszewski, Lukasz ; Orengo, Christine ; Montelione, Gaetano T. ; Rost, Burkhard. / Structural genomics is the largest contributor of novel structural leverage. In: Journal of Structural and Functional Genomics. 2009 ; Vol. 10, No. 2. pp. 181-191.
@article{a3752b914d034cabbba8fdf23a1c5ef1,
title = "Structural genomics is the largest contributor of novel structural leverage",
abstract = "The Protein Structural Initiative (PSI) at the US National Institutes of Health (NIH) is funding four large-scale centers for structural genomics (SG). These centers systematically target many large families without structural coverage, as well as very large families with inadequate structural coverage. Here, we report a few simple metrics that demonstrate how successfully these efforts optimize structural coverage: while the PSI-2 (2005-now) contributed more than 8{\%} of all structures deposited into the PDB, it contributed over 20{\%} of all novel structures (i.e. structures for protein sequences with no structural representative in the PDB on the date of deposition). The structural coverage of the protein universe represented by today's UniProt (v12.8) has increased linearly from 1992 to 2008; structural genomics has contributed significantly to the maintenance of this growth rate. Success in increasing novel leverage (defined in Liu et al. in Nat Biotechnol 25:849-851, 2007) has resulted from systematic targeting of large families. PSI's per structure contribution to novel leverage was over 4-fold higher than that for non-PSI structural biology efforts during the past 8 years. If the success of the PSI continues, it may just take another ∼15 years to cover most sequences in the current UniProt database.",
keywords = "Evolution, Protein structure determination, Protein universe, Structural genomics",
author = "Rajesh Nair and Jinfeng Liu and Soong, {Ta Tsen} and Acton, {Thomas B.} and Everett, {John K.} and Andrei Kouranov and Andras Fiser and Adam Godzik and Lukasz Jaroszewski and Christine Orengo and Montelione, {Gaetano T.} and Burkhard Rost",
year = "2009",
month = "4",
doi = "10.1007/s10969-008-9055-6",
language = "English (US)",
volume = "10",
pages = "181--191",
journal = "Journal of Structural and Functional Genomics",
issn = "1345-711X",
publisher = "Springer Netherlands",
number = "2",

}

TY - JOUR

T1 - Structural genomics is the largest contributor of novel structural leverage

AU - Nair, Rajesh

AU - Liu, Jinfeng

AU - Soong, Ta Tsen

AU - Acton, Thomas B.

AU - Everett, John K.

AU - Kouranov, Andrei

AU - Fiser, Andras

AU - Godzik, Adam

AU - Jaroszewski, Lukasz

AU - Orengo, Christine

AU - Montelione, Gaetano T.

AU - Rost, Burkhard

PY - 2009/4

Y1 - 2009/4

N2 - The Protein Structural Initiative (PSI) at the US National Institutes of Health (NIH) is funding four large-scale centers for structural genomics (SG). These centers systematically target many large families without structural coverage, as well as very large families with inadequate structural coverage. Here, we report a few simple metrics that demonstrate how successfully these efforts optimize structural coverage: while the PSI-2 (2005-now) contributed more than 8% of all structures deposited into the PDB, it contributed over 20% of all novel structures (i.e. structures for protein sequences with no structural representative in the PDB on the date of deposition). The structural coverage of the protein universe represented by today's UniProt (v12.8) has increased linearly from 1992 to 2008; structural genomics has contributed significantly to the maintenance of this growth rate. Success in increasing novel leverage (defined in Liu et al. in Nat Biotechnol 25:849-851, 2007) has resulted from systematic targeting of large families. PSI's per structure contribution to novel leverage was over 4-fold higher than that for non-PSI structural biology efforts during the past 8 years. If the success of the PSI continues, it may just take another ∼15 years to cover most sequences in the current UniProt database.

AB - The Protein Structural Initiative (PSI) at the US National Institutes of Health (NIH) is funding four large-scale centers for structural genomics (SG). These centers systematically target many large families without structural coverage, as well as very large families with inadequate structural coverage. Here, we report a few simple metrics that demonstrate how successfully these efforts optimize structural coverage: while the PSI-2 (2005-now) contributed more than 8% of all structures deposited into the PDB, it contributed over 20% of all novel structures (i.e. structures for protein sequences with no structural representative in the PDB on the date of deposition). The structural coverage of the protein universe represented by today's UniProt (v12.8) has increased linearly from 1992 to 2008; structural genomics has contributed significantly to the maintenance of this growth rate. Success in increasing novel leverage (defined in Liu et al. in Nat Biotechnol 25:849-851, 2007) has resulted from systematic targeting of large families. PSI's per structure contribution to novel leverage was over 4-fold higher than that for non-PSI structural biology efforts during the past 8 years. If the success of the PSI continues, it may just take another ∼15 years to cover most sequences in the current UniProt database.

KW - Evolution

KW - Protein structure determination

KW - Protein universe

KW - Structural genomics

UR - http://www.scopus.com/inward/record.url?scp=62949239803&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=62949239803&partnerID=8YFLogxK

U2 - 10.1007/s10969-008-9055-6

DO - 10.1007/s10969-008-9055-6

M3 - Article

VL - 10

SP - 181

EP - 191

JO - Journal of Structural and Functional Genomics

JF - Journal of Structural and Functional Genomics

SN - 1345-711X

IS - 2

ER -