Data sets and source code for Influential Users in Social Networks.

Flickr's dataset

The Flickr dataset is downloaded from one of the groups in Flickr that includes users, photos and interactions. A total of 30758 users have participated in the group where some of them can be popular by posting photos, while others only interact with other users. 1559 users have posted photos uploading 4991 uploaded photos. There are 46059 interactions between users by commenting on or favoring uploaded photos. The users IDs have been anonymized. The source code along with the read me file are also atatched below. It is written in java.



Digg's dataset

For the digg data set. Please visit the following website and use The votes data set. http://www.isi.edu/~lerman/downloads/digg2009.html

Please note that. You need to format the data file and generte two text files to be able to work in our source code. The first one is in following format: The first raw should include the total number of nodes in the network. Followed by the nodes and then the edges. While the second file contains the users who uploaded posts and how many posts they upload. To make it easy, we have created these data files.

If you use the source code, Flickr's dataset or both please cite us using: Almgren, K., Lee, J. (2015). Who Influences Whom: Content-based Approach for Predicting Influential Users in Social Networks:International Conference on Advances in Big Data Analytics (ABDA), (pp. 89-99), Las Vegas, USA


Last updated: 08/19/2015