Stock price manipulation uses illegitimate means to artificially influence market prices of several stocks. It causes massive losses and undermines investors' confidence and the integrity of the stock market. Several existing research works focused on detecting a specific manipulation scheme using supervised learning but lacks the adaptive capability to capture different manipulative strategies. This begets the assumption of model parameter values specific to the underlying manipulation scheme. In addition, supervised learning requires the use of labelled data which is difficult to acquire due to confidentiality and the proprietary nature of trading data. The proposed research establishes a detection model based on unsupervised learning using Kernel Principal Component Analysis (KPCA) and applied increased variance of selected latent features in higher dimensions. A proposed Multidimensional Kernel Density Estimation (MKDE) clustering is then applied upon the selected components to identify abnormal patterns of manipulation in data. This research has an advantage over the existing methods in overcoming the ambiguity of assuming values of several parameters, reducing the high dimensions obtained from conventional KPCA and thereby reducing computational complexity. The robustness of the detection model has also been evaluated when two or more manipulative activities occur within a short duration of each other and by varying the window length of the dataset fed to the model. Validation on multiple datasets and a comprehensive assessment of the model performance has been conducted without providing any prior information about the location of the manipulation. The results show a significant performance enhancement in terms of the Fmeasure values and a significant reduction in false alarm rate (FAR) has been achieved.
Stock price manipulation, a major problem in capital markets surveillance, uses illegitimate means to influence the price of traded stocks in order to reap illicit profit. Most of the existing attempts to detect such manipulations have either relied upon annotated trading data, using supervised methods, or have been restricted to detecting a specific manipulation scheme. There have been a few unsupervised algorithms focusing on general detection yet none of them explored the innate affinity among the stock trades, be it normal or manipulative. This paper proposes a fully unsupervised model based on the idea of learning the relationship among stock prices in the form of an affinity matrix. The proposed affinity matrix based features are used to train an under-fitting autoencoder in order to learn an efficient representation of the normal stock prices. A kernel density estimate of the normal trading data is used as the reconstruction error of the autoencoder. During the detection phase, the normal dataset has been injected with synthetic manipulative trades. A kernel density estimation based clustering technique is then used to detect manipulative trades based on their autoencoder representation. The proposed approach is validated on benchmark stock price data from the LOBSTER project and the obtained results show dramatic improvements in the detection performance over existing price manipulation detection techniques.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.