In complex networks, identifying influential nodes is of great importance for its wide applications. Traditional centrality methods are usually directly based on topological structures of networks, and different centrality methods consider different structural characteristics related to the functional importance. However, in many scenarios, it always exists a complex and nonlinear relationship between the functional importance of a node and its various features including local location, global location, etc., which is hard to be described by one centrality. In order to solve this problem, this paper proposes a framework based on machine learning to measure the importance of nodes in the propagation scenario. This framework first constructs the feature vector of each node based on the existing centrality methods which can reflect nodes' different topological structures and the infection rate which is an important factor in the propagation scenarios, then labels each node based on the real propagation ability obtained from simulated propagation experiments based on SIR model, last uses seven machine learning algorithms to learn the complex relationship between the real propagation ability of each node and its various structural features. The experimental results in real-world networks show that the classification accuracy of the model based on machine learning is generally higher than that of the traditional centrality methods based on one certain topology. INDEX TERMS Complex networks, influential nodes, machine learning, centrality.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.