Energy fraud is a critical economical burden for electric power orga-nizations in Brazil. In this paper we present the application of novel MachineLearning algorithms to boost efficiency in detection of energy frauds. More-over, we also propose a generalized and unsupervised model for fraud detectionbased on consumption anomalies.
Detecting the early stages of failures is an old concern of petroleum industry. In order to tackle this problem, a novel sensor analysis methodology is proposed. The assessment of production sensors' behavior, individually or in a group, leads to a better understanding of failure modes during oil and gas production. Thus, Principal Components Analysis and Logistic Regression are incorporated as multivariate statistical modeling for studying the impact of different anomalies in production sensors. Therefore, a deep statistical analysis of these sensors can strengthen assumptions for supporting the modeling process of early fault detection systems. Based on a reliable public data set containing data from real wells, the application of the PCA approach combined with a Logistic Regression resulted in better visualization and understanding of some failures that occurred during petroleum production, such as the abrupt increase in BSW (Basic sediment and water), spurious closure of DHSV (Down hole Safety Valve), severe slugging, flow instability, productivity loss, quick restriction in PCK (production choke), scaling in PCK and hydrate formation in production lines. The two statistical approaches were used as a combined method to provide useful information regarding the failure modes in the dataset. Also, the dataset presented two classes that are important for anomaly detection in oil wells: "normal" and "abnormal", which allow detecting when production is outside its normal condition. Then, using the production sensors analysis with failure data can help to formulate better detection algorithms. By using PCA and Logistic Regression it was possible to identify which set of variables is better for detecting a specific type of problem. The application of these techniques boosts the modeling of early detection systems in oil and gas production. Besides, the assumptions led to conclusions about how to put groups of sensors and abnormalities together and how much time a well stands in a steady normal condition. Other conclusions showed the significance of transient information for fault detection modeling and the need for individual wells analyses. Hence, using PCA for treating and transforming the data brings important contributions for early fault detection modeling, once it allowed insight into how sensors and abnormal events can be related. Consequentially, the present paper has significant novelty contribution: it raises important assumptions that help to build solid knowledge about the anomalies behavior and help researchers to implement a better modeling strategy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.