Editing Machine Learning/Kaggle Social Network Contest/Features
Jump to navigation
Jump to search
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 3: | Line 3: | ||
== Possible Features == | == Possible Features == | ||
*nodeid | |||
*nodetofollowid | |||
* | *median path length | ||
* | *shortest distance from nodeid to nodetofollowid | ||
* | *inbound edges | ||
* | *outbound edges | ||
**reciprocation | *clustering coefficient | ||
*reciprocation probability (num of edges returned / num of outbound edges) | |||
The response variable is the probability that the nodeid to nodetofollowid edge will be created in the future | |||
From the Backstrom and Leskovec, for a node s and a potential target c | |||
* Network features | * Network features | ||
** unweighted random walk score | ** unweighted random walk score | ||
** Adamic-Adar score | ** Adamic-Adar score | ||
*** see [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.108.1370&rep=rep1&type=pdf original paper] | *** see [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.108.1370&rep=rep1&type=pdf original paper] | ||
** | ** number of common friends | ||
** indegrees and outdegrees of s | |||
*** the indegree is the number of edges coming into node s | |||
*** the outdegree is the number of edges leaving node s | |||
** | ** indegrees and outdegrees of c | ||
* | |||
** | |||
* | |||
** | |||
** | |||