TY - GEN
T1 - Prioritization of schizophrenia risk genes by a network-regularized logistic regression method
AU - Zhang, Wen
AU - Lin, Jhin Rong
AU - Nogales-Cadenas, Rubén
AU - Zhang, Quanwei
AU - Cai, Ying
AU - Zhang, Zhengdong D.
N1 - Funding Information:
This work was supported by the NIH Pathway to Independence Award from National Library of Medicine (5R00LM009770-06) and the American Heart Association Grant-in-Aid (13GRNT16850016) to Z.D.Z.
Publisher Copyright:
© Springer International Publishing Switzerland 2016.
PY - 2016
Y1 - 2016
N2 - Schizophrenia (SCZ) is a severe mental disorder with a large genetic component. While recent large-scale microarray- and sequencing-based genome wide association studies have made significant progress toward finding SCZ risk variants and genes of subtle effect, the interactions among them were not considered in those studies. Using a protein-protein interaction network both in our regression model and to generate a SCZ gene subnetwork, we developed an analytical framework with Logit-Lapnet, the graphical Laplacian-regularized logistic regression, for whole exome sequencing (WES) data analysis to detect SCZ gene subnetworks. Using simulated data from sequencing-based association study, we compared the performances of Logit-Lapnet with other logistic regression (LR)-based models. We use Logit-Lapnet to prioritize genes according to their coefficients and select top-ranked genes as seeds to generate the gene sub-network that is associated to SCZ. The comparison demonstrated not only the applicability but also better performance of Logit-Lapnet to score disease risk genes using sequencing-based association data. We applied our method to SCZ whole exome sequencing data and selected top-ranked risk genes, the majority of which are either known SCZ genes or genes potentially associated with SCZ. We then used the seed genes to construct SCZ gene subnetworks. This result demonstrates that by ranking gene according to their disease contributions our method scores and thus prioritizes disease risk genes for further investigation. An implementation of our approach in MATLAB is freely available for download at: http://zdzlab.einstein.yu.edu/1/publications/ LapNet-MATLAB.zip.
AB - Schizophrenia (SCZ) is a severe mental disorder with a large genetic component. While recent large-scale microarray- and sequencing-based genome wide association studies have made significant progress toward finding SCZ risk variants and genes of subtle effect, the interactions among them were not considered in those studies. Using a protein-protein interaction network both in our regression model and to generate a SCZ gene subnetwork, we developed an analytical framework with Logit-Lapnet, the graphical Laplacian-regularized logistic regression, for whole exome sequencing (WES) data analysis to detect SCZ gene subnetworks. Using simulated data from sequencing-based association study, we compared the performances of Logit-Lapnet with other logistic regression (LR)-based models. We use Logit-Lapnet to prioritize genes according to their coefficients and select top-ranked genes as seeds to generate the gene sub-network that is associated to SCZ. The comparison demonstrated not only the applicability but also better performance of Logit-Lapnet to score disease risk genes using sequencing-based association data. We applied our method to SCZ whole exome sequencing data and selected top-ranked risk genes, the majority of which are either known SCZ genes or genes potentially associated with SCZ. We then used the seed genes to construct SCZ gene subnetworks. This result demonstrates that by ranking gene according to their disease contributions our method scores and thus prioritizes disease risk genes for further investigation. An implementation of our approach in MATLAB is freely available for download at: http://zdzlab.einstein.yu.edu/1/publications/ LapNet-MATLAB.zip.
UR - http://www.scopus.com/inward/record.url?scp=84973902145&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84973902145&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-31744-1_39
DO - 10.1007/978-3-319-31744-1_39
M3 - Conference contribution
AN - SCOPUS:84973902145
SN - 9783319317434
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 434
EP - 445
BT - Bioinformatics and Biomedical Engineering - 4th International Conference, IWBBIO 2016, Proceedings
A2 - Ortuno, Francisco
A2 - Rojas, Ignacio
PB - Springer Verlag
T2 - 4th International Work-Conference on Bioinformatics and Biomedical Engineering, IWBBIO 2016
Y2 - 20 April 2016 through 22 April 2016
ER -