TY - GEN
T1 - Pseudo cold start link prediction with multiple sources in social networks
AU - Ge, Liang
AU - Zhang, Aidong
PY - 2012
Y1 - 2012
N2 - Link prediction is an important task in social networks and data mining for understanding the mechanisms by which the social networks form and evolve. In most link prediction researches, it is assumed either a snapshot of the social network or a social network with some missing links is available. Most existing researches therefore approach this problem by exploring the topological structure of the social network using only one source of information. However, in many application domains, in addition to the social network of interest, there are a number of auxiliary information available. In this work, we introduce the pseudo cold start link prediction with multiple sources as the problem of predicting the structure of a social network when only a small subgraph of the social network is known and multiple heterogeneous sources are available. We propose a two-phase supervised method: The first phase generates an efficient feature selec- Tion scheme to find the best feature from multiple sources that is used for predicting the structure in the social network. In the second phase, we propose a regularization method to control the risk of over-fitting induced by the first phase. We assess our method empirically over a large data collec- Tion obtained from Youtube. The extensive experimental evaluations confirm the effectiveness of our approach.
AB - Link prediction is an important task in social networks and data mining for understanding the mechanisms by which the social networks form and evolve. In most link prediction researches, it is assumed either a snapshot of the social network or a social network with some missing links is available. Most existing researches therefore approach this problem by exploring the topological structure of the social network using only one source of information. However, in many application domains, in addition to the social network of interest, there are a number of auxiliary information available. In this work, we introduce the pseudo cold start link prediction with multiple sources as the problem of predicting the structure of a social network when only a small subgraph of the social network is known and multiple heterogeneous sources are available. We propose a two-phase supervised method: The first phase generates an efficient feature selec- Tion scheme to find the best feature from multiple sources that is used for predicting the structure in the social network. In the second phase, we propose a regularization method to control the risk of over-fitting induced by the first phase. We assess our method empirically over a large data collec- Tion obtained from Youtube. The extensive experimental evaluations confirm the effectiveness of our approach.
UR - https://www.scopus.com/pages/publications/84880194028
U2 - 10.1137/1.9781611972825.66
DO - 10.1137/1.9781611972825.66
M3 - Conference contribution
AN - SCOPUS:84880194028
SN - 9781611972320
T3 - Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012
SP - 768
EP - 779
BT - Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012
PB - Society for Industrial and Applied Mathematics Publications
T2 - 12th SIAM International Conference on Data Mining, SDM 2012
Y2 - 26 April 2012 through 28 April 2012
ER -