TY - GEN
T1 - Network based subcellular localization prediction for multi-label proteins
AU - Mondal, Ananda Mohan
AU - Lin, Jhih Rong
AU - Hu, Jianjun
PY - 2011/12/1
Y1 - 2011/12/1
N2 - Many proteins are sorted to multiple subcellular localizations within the cell. However, computational prediction of multi-location proteins remains a challenging task. Here we applied a logistic regression and diffusion kernel based algorithm NetLoc for predicting multiplex proteins and explored its capability and limitations. Experiment shows that the overall and true success rates for physical protein-protein interaction network are 65% and 41% respectively, and for mixed PPI network these values are 88% and 75% respectively. Our study also showed that the performance of NetLoc in predicting protein localization is limited by the network characteristics such as ratio of the number of co-localized protein-protein interactions (coPPI) to the number of non-co-localized PPI (ncPPI) and the density of annotated coPPI in the network. For a given network with a specific number of proteins, NetLoc performance increases with higher coPPI/ncPPI ratio and higher density of annotated coPPI.
AB - Many proteins are sorted to multiple subcellular localizations within the cell. However, computational prediction of multi-location proteins remains a challenging task. Here we applied a logistic regression and diffusion kernel based algorithm NetLoc for predicting multiplex proteins and explored its capability and limitations. Experiment shows that the overall and true success rates for physical protein-protein interaction network are 65% and 41% respectively, and for mixed PPI network these values are 88% and 75% respectively. Our study also showed that the performance of NetLoc in predicting protein localization is limited by the network characteristics such as ratio of the number of co-localized protein-protein interactions (coPPI) to the number of non-co-localized PPI (ncPPI) and the density of annotated coPPI in the network. For a given network with a specific number of proteins, NetLoc performance increases with higher coPPI/ncPPI ratio and higher density of annotated coPPI.
KW - NetLoc
KW - bioinformatics
KW - data mining
KW - diffusion kernel
KW - multi-label proteins
KW - multiplex protein localization
KW - multiplex proteins
KW - network based protein localization
KW - protein localization
KW - protein subcellular localization
UR - http://www.scopus.com/inward/record.url?scp=84862929318&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84862929318&partnerID=8YFLogxK
U2 - 10.1109/BIBMW.2011.6112416
DO - 10.1109/BIBMW.2011.6112416
M3 - Conference contribution
AN - SCOPUS:84862929318
SN - 9781457716133
T3 - 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2011
SP - 473
EP - 480
BT - 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2011
T2 - 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2011
Y2 - 12 November 2011 through 15 November 2011
ER -