TY - JOUR
T1 - EPIC-DB
T2 - A proteomics database for studying Apicomplexan organisms
AU - Madrid-Aliste, Carlos J.
AU - Dybas, Joseph M.
AU - Angeletti, Ruth Hogue
AU - Weiss, Louis M.
AU - Kim, Kami
AU - Simon, Istvan
AU - Fiser, Andras
N1 - Funding Information:
Financial support was provided by NIH-NIAID HHSN266200400054C. AF was supported by the HuBi (Hungarian Bioinformatics) project, in the framework of the European Community's "Structuring the European Research Area" programme.
PY - 2009/1/21
Y1 - 2009/1/21
N2 - Background: High throughput proteomics experiments are useful for analyzing the protein expression of an organism, identifying the correct gene structure of a genome, or locating possible post-translational modifications within proteins. High throughput methods necessitate publicly accessible and easily queried databases for efficiently and logically storing, displaying, and analyzing the large volume of data. Description: EPICDB is a publicly accessible, queryable, relational database that organizes and displays experimental, high throughput proteomics data for Toxoplasma gondii and Cryptosporidium parvum. Along with detailed information on mass spectrometry experiments, the database also provides antibody experimental results and analysis of functional annotations, comparative genomics, and aligned expressed sequence tag (EST) and genomic open reading frame (ORF) sequences. The database contains all available alternative gene datasets for each organism, which comprises a complete theoretical proteome for the respective organism, and all data is referenced to these sequences. The database is structured around clusters of protein sequences, which allows for the evaluation of redundancy, protein prediction discrepancies, and possible splice variants. The database can be expanded to include genomes of other organisms for which proteome-wide experimental data are available. Conclusion: EPICDB is a comprehensive database of genome-wide T. gondiiand C. parvum proteomics data and incorporates many features that allow for the analysis of the entire proteomes and/or annotation of specific protein sequences. EPICDB is complementary to other -genomics- databases of these organisms by offering complete mass spectrometry analysis on a comprehensive set of all available protein sequences.
AB - Background: High throughput proteomics experiments are useful for analyzing the protein expression of an organism, identifying the correct gene structure of a genome, or locating possible post-translational modifications within proteins. High throughput methods necessitate publicly accessible and easily queried databases for efficiently and logically storing, displaying, and analyzing the large volume of data. Description: EPICDB is a publicly accessible, queryable, relational database that organizes and displays experimental, high throughput proteomics data for Toxoplasma gondii and Cryptosporidium parvum. Along with detailed information on mass spectrometry experiments, the database also provides antibody experimental results and analysis of functional annotations, comparative genomics, and aligned expressed sequence tag (EST) and genomic open reading frame (ORF) sequences. The database contains all available alternative gene datasets for each organism, which comprises a complete theoretical proteome for the respective organism, and all data is referenced to these sequences. The database is structured around clusters of protein sequences, which allows for the evaluation of redundancy, protein prediction discrepancies, and possible splice variants. The database can be expanded to include genomes of other organisms for which proteome-wide experimental data are available. Conclusion: EPICDB is a comprehensive database of genome-wide T. gondiiand C. parvum proteomics data and incorporates many features that allow for the analysis of the entire proteomes and/or annotation of specific protein sequences. EPICDB is complementary to other -genomics- databases of these organisms by offering complete mass spectrometry analysis on a comprehensive set of all available protein sequences.
UR - http://www.scopus.com/inward/record.url?scp=61949346123&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=61949346123&partnerID=8YFLogxK
U2 - 10.1186/1471-2164-10-38
DO - 10.1186/1471-2164-10-38
M3 - Article
C2 - 19159464
AN - SCOPUS:61949346123
SN - 1471-2164
VL - 10
JO - BMC Genomics
JF - BMC Genomics
M1 - 38
ER -