TY - JOUR
T1 - Link prediction methods and their accuracy for different social networks and network metrics
AU - Gao, Fei
AU - Musial, Katarzyna
AU - Cooper, Colin
AU - Tsoka, Sophia
PY - 2015
Y1 - 2015
N2 - Currently, we are experiencing a rapid growth of the number of social-based online systems. The availability of the vast amounts of data gathered in those systems brings new challenges that we face when trying to analyse it. One of the intensively researched topics is the prediction of social connections between users. Although a lot of effort has been made to develop new prediction approaches, the existing methods are not comprehensively analysed. In this paper we investigate the correlation between network metrics and accuracy of different prediction methods.We selected six time-stamped real-world social networks and ten most widely used link prediction methods. The results of the experiments show that the performance of some methods has a strong correlation with certain network metrics. We managed to distinguish "prediction friendly" networks, for which most of the prediction methods give good performance, as well as "prediction unfriendly" networks, for which most of the methods result in high prediction error. Correlation analysis between networkmetrics and prediction accuracy of prediction methodsmay formthe basis of ametalearning system where based on network characteristics it will be able to recommend the right prediction method for a given network.
AB - Currently, we are experiencing a rapid growth of the number of social-based online systems. The availability of the vast amounts of data gathered in those systems brings new challenges that we face when trying to analyse it. One of the intensively researched topics is the prediction of social connections between users. Although a lot of effort has been made to develop new prediction approaches, the existing methods are not comprehensively analysed. In this paper we investigate the correlation between network metrics and accuracy of different prediction methods.We selected six time-stamped real-world social networks and ten most widely used link prediction methods. The results of the experiments show that the performance of some methods has a strong correlation with certain network metrics. We managed to distinguish "prediction friendly" networks, for which most of the prediction methods give good performance, as well as "prediction unfriendly" networks, for which most of the methods result in high prediction error. Correlation analysis between networkmetrics and prediction accuracy of prediction methodsmay formthe basis of ametalearning system where based on network characteristics it will be able to recommend the right prediction method for a given network.
UR - http://www.scopus.com/inward/record.url?scp=84934326301&partnerID=8YFLogxK
U2 - 10.1155/2015/172879
DO - 10.1155/2015/172879
M3 - Article
AN - SCOPUS:84934326301
SN - 1058-9244
VL - 2015
JO - Scientific Programming
JF - Scientific Programming
M1 - 172879
ER -