TY - JOUR
T1 - PSI-2
T2 - Structural Genomics to Cover Protein Domain Family Space
AU - Dessailly, Benoît H.
AU - Nair, Rajesh
AU - Jaroszewski, Lukasz
AU - Fajardo, J. Eduardo
AU - Kouranov, Andrei
AU - Lee, David
AU - Fiser, Andras
AU - Godzik, Adam
AU - Rost, Burkhard
AU - Orengo, Christine
N1 - Funding Information:
This work was supported by a grant from the Protein Structure Initiative of the National Institute for General Medicine at the National Institutes of Health.
PY - 2009/6/10
Y1 - 2009/6/10
N2 - One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from structurally uncharacterized subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly overrepresented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first 3 years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space.
AB - One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from structurally uncharacterized subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly overrepresented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first 3 years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space.
KW - PROTEINS
UR - http://www.scopus.com/inward/record.url?scp=66349104770&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=66349104770&partnerID=8YFLogxK
U2 - 10.1016/j.str.2009.03.015
DO - 10.1016/j.str.2009.03.015
M3 - Article
C2 - 19523904
AN - SCOPUS:66349104770
SN - 0969-2126
VL - 17
SP - 869
EP - 881
JO - Structure
JF - Structure
IS - 6
ER -