PSI-2

Structural Genomics to Cover Protein Domain Family Space

Benoît H. Dessailly, Rajesh Nair, Lukasz Jaroszewski, Jorge E. Fajardo, Andrei Kouranov, David Lee, Andras Fiser, Adam Godzik, Burkhard Rost, Christine Orengo

Research output: Contribution to journalArticle

103 Citations (Scopus)

Abstract

One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from structurally uncharacterized subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly overrepresented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first 3 years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space.

Original languageEnglish (US)
Pages (from-to)869-881
Number of pages13
JournalStructure
Volume17
Issue number6
DOIs
StatePublished - Jun 10 2009

Fingerprint

Genomics
Proteins
Proteome
Protein Domains
Computational Biology
Genome

Keywords

  • PROTEINS

ASJC Scopus subject areas

  • Molecular Biology
  • Structural Biology

Cite this

Dessailly, B. H., Nair, R., Jaroszewski, L., Fajardo, J. E., Kouranov, A., Lee, D., ... Orengo, C. (2009). PSI-2: Structural Genomics to Cover Protein Domain Family Space. Structure, 17(6), 869-881. https://doi.org/10.1016/j.str.2009.03.015

PSI-2 : Structural Genomics to Cover Protein Domain Family Space. / Dessailly, Benoît H.; Nair, Rajesh; Jaroszewski, Lukasz; Fajardo, Jorge E.; Kouranov, Andrei; Lee, David; Fiser, Andras; Godzik, Adam; Rost, Burkhard; Orengo, Christine.

In: Structure, Vol. 17, No. 6, 10.06.2009, p. 869-881.

Research output: Contribution to journalArticle

Dessailly, BH, Nair, R, Jaroszewski, L, Fajardo, JE, Kouranov, A, Lee, D, Fiser, A, Godzik, A, Rost, B & Orengo, C 2009, 'PSI-2: Structural Genomics to Cover Protein Domain Family Space', Structure, vol. 17, no. 6, pp. 869-881. https://doi.org/10.1016/j.str.2009.03.015
Dessailly BH, Nair R, Jaroszewski L, Fajardo JE, Kouranov A, Lee D et al. PSI-2: Structural Genomics to Cover Protein Domain Family Space. Structure. 2009 Jun 10;17(6):869-881. https://doi.org/10.1016/j.str.2009.03.015
Dessailly, Benoît H. ; Nair, Rajesh ; Jaroszewski, Lukasz ; Fajardo, Jorge E. ; Kouranov, Andrei ; Lee, David ; Fiser, Andras ; Godzik, Adam ; Rost, Burkhard ; Orengo, Christine. / PSI-2 : Structural Genomics to Cover Protein Domain Family Space. In: Structure. 2009 ; Vol. 17, No. 6. pp. 869-881.
@article{2751bfd117d349f397a4aeaf61ae2cfa,
title = "PSI-2: Structural Genomics to Cover Protein Domain Family Space",
abstract = "One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from structurally uncharacterized subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly overrepresented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first 3 years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space.",
keywords = "PROTEINS",
author = "Dessailly, {Beno{\^i}t H.} and Rajesh Nair and Lukasz Jaroszewski and Fajardo, {Jorge E.} and Andrei Kouranov and David Lee and Andras Fiser and Adam Godzik and Burkhard Rost and Christine Orengo",
year = "2009",
month = "6",
day = "10",
doi = "10.1016/j.str.2009.03.015",
language = "English (US)",
volume = "17",
pages = "869--881",
journal = "Structure with Folding & design",
issn = "0969-2126",
publisher = "Cell Press",
number = "6",

}

TY - JOUR

T1 - PSI-2

T2 - Structural Genomics to Cover Protein Domain Family Space

AU - Dessailly, Benoît H.

AU - Nair, Rajesh

AU - Jaroszewski, Lukasz

AU - Fajardo, Jorge E.

AU - Kouranov, Andrei

AU - Lee, David

AU - Fiser, Andras

AU - Godzik, Adam

AU - Rost, Burkhard

AU - Orengo, Christine

PY - 2009/6/10

Y1 - 2009/6/10

N2 - One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from structurally uncharacterized subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly overrepresented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first 3 years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space.

AB - One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from structurally uncharacterized subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly overrepresented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first 3 years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space.

KW - PROTEINS

UR - http://www.scopus.com/inward/record.url?scp=66349104770&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=66349104770&partnerID=8YFLogxK

U2 - 10.1016/j.str.2009.03.015

DO - 10.1016/j.str.2009.03.015

M3 - Article

VL - 17

SP - 869

EP - 881

JO - Structure with Folding & design

JF - Structure with Folding & design

SN - 0969-2126

IS - 6

ER -