Abstract:In many data mining applications, we use a clustering algorithm on a large amount of uncertain data. In this paper, we adapt an uncertain data clustering algorithm called fast density-based spatial clustering of applications with noise (FDBSCAN) to multicore systems in order to have fast processing. The new algorithm, which we call multicore FDBSCAN (M-FDBSCAN), splits the data domain into c rectangular regions, where c is the number of cores in the system. The FDBSCAN algorithm is then applied to each rectangular region simultaneously. After the clustering operation is completed, semiclusters that occur during splitting are detected and merged to construct the final clusters.M-FDBSCAN is tested for correctness and performance. The experiments show that there is a significant performance increase due to M-FDBSCAN, which is not just due to multicore usage.
Database as a service" paradigm has gained a lot of interest in recent years. This has raised questions about the security of data in the servers. Firms outsourcing their XML databases to untrusted parties started to look for new ways to securely store data and efficiently query them. In this paper, encrypted XML documents, their crypto index structures and query processing using these structures are investigated. A comparison of various algorithms in the literature is given.
Moving object databases (MOD) are being used in a wide range of location-based services that are of growing interest in many application areas. In the literature, several query types such as nearest neighbor, reverse nearest neighbor, k-nearest neighbor, and proximity queries have been considered in MOD. In this paper, we propose a novel operator called the assignment operator as a query type for MOD. The assignment operator is an operator used in a query to solve the assignment problem (also known as the weighted bipartite graph-matching problem). Assignment operator finds a perfect match between two sets of objects in a manner that minimizes a total cost. For instance, a set of moving objects such as taxi cabs are assigned to a set of customers in a manner that minimizes the total cost of traveling for the taxis. A possible implementation of the assignment operator in MOD and its performance evaluation are given.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.