Utilization data are defined as the time series data consisting of time fractions of busy periods in fixed time intervals and are practically used to represent server conditions, such as CPU utilization. In general, it is more challenging to estimate the model parameters from the utilization data since we do not know the exact job arrival time and the service time from the utilization data. In this paper, we consider an approach to estimate the model parameters from the utilization data by assuming a few model assumptions. In particular, we suppose an M t /M /1/K queueing system whose job arrival follows a Non-homogeneous Poisson Process (NHPP) and propose a parameter estimation method for the NHPP approximately from the utilization data based on the maximum likelihood estimation (MLE) via the expectation maximization (EM) algorithm. In numerical experiments, we generate the simulated utilization data of an M t /M /1/K queueing system and investigate the effectiveness of our method. Also, we use the real CPU utilization data to exhibit the performance evaluation. INDEX TERMS Queueing systems, time intervals, utilization data, EM algorithm.
Malicious software, called malware, can perform harmful actions on computer systems, which may cause economic damage and information leakage. Therefore, malware classification is meaningful and required to prevent malware attacks. Application programming interface (API) call sequences are easily observed and are good choices as features for malware classification. However, one of the main issues is how to generate a suitable feature for the algorithms of classification to achieve a high classification accuracy. Different malware sample brings API call sequence with different lengths, and these lengths may reach millions, which may cause computation cost and time complexities. Recurrent neural networks (RNNs) is one of the most versatile approaches to process time series data, which can be used to API call-based Malware calssification. In this paper, we propose a malware classification model with RNN, especially the long short-term memory (LSTM) and the gated recurrent unit (GRU), to classify variants of malware by using long-sequences of API calls. In numerical experiments, a benchmark dataset is used to illustrate the proposed approach and validate its accuracy. The numerical results show that the proposed RNN model works well on the malware classification.
Nowadays, the management and analyses of 'big data' are becoming indispensable for numerous organizations all over the world. In many cases, multiple organizations want to perform data analyses on their combined databases. Skyline query is one of the popular operations for selecting representative objects from a large database, where any other object within the database does not dominate each of the representative objects, called 'skyline'. Like other data analytics operations, the multi-party skyline query can provide benefits to the participating organizations by retrieving the skyline objects from their combined databases. Such a multi-party skyline query demands the disclosure of individual parties' objects to others during the computation. But, owing to the data privacy and security concern of the present IT era, such disclosure of the individual parties' databases is strictly prohibited. Considering this issue, we are proposing a new framework for the privacy-preserving multi-party skyline query, exploiting additive homomorphic encryption along with data anonymization, perturbation, and randomization techniques. The underlying protocols within our proposed framework ensure that every participating party can identify its multi-party skyline objects without revealing the objects to others during the multi-party skyline query. The detailed privacy and security analyses show that the proposed framework can achieve the desired computation goal without privacy leakage. Besides, the performance evaluation through complexity analyses, extensive simulations, and comprehensive comparison also demonstrate the utility and the efficiency of the proposed framework.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.