Big Data is a generic term that involves the storing and processing of a large amount of data. This large amount of data has been promoted by technologies such as mobile applications, Internet of Things (IoT), and Geographic Information Systems (GIS). An example of GIS is a Spatio-Temporal Database (STDB). A complex problem to address in terms of processing time is pattern searching on STDB. Nowadays, high information processing capacity is available everywhere. Nevertheless, the pattern searching problem on STDB using traditional Data Mining techniques is complex because the data incorporate the temporal aspect. Traditional techniques of pattern searching, such as time series, do not incorporate the spatial aspect. For this reason, traditional algorithms based on association rules must be adapted to find these patterns. Most of the algorithms take exponential processing times. In this paper, a new efficient algorithm (named Minus-F1) to look for periodic patterns on STDB is presented. Our algorithm is compared with Apriori, Max-Subpattern, and PPA algorithms on synthetic and real STDB. Additionally, the computational complexities for each algorithm in the worst cases are presented. Empirical results show that Minus-F1 is not only more efficient than Apriori, Max-Subpattern, and PAA, but also it presents a polynomial behavior.
In this paper we outline important differences between (1) protein interaction networks and (2) social and other complex networks, in terms of fine-grained network community profiles. While these families of networks present some general similarities, they also have some stark differences in the way the communities are formed. Namely, we find that the sizes of the best communities in such biological networks are an order of magnitude smaller than in social and other complex networks. We furthermore find that the generative model describing biological networks is very different from the model describing social networks. While for latter the Forest-Fire model best approximates their network community profile, for biological networks it is a random rewiring model that generates networks with the observed profiles. Our study suggests that these families of networks should be treated differently when deriving results from network analysis, and a fine-grained analysis is needed to better understand their structure.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.