“…Mixture models have a long history of being used to detect anomalies [30,12,31,15,14] for both numerical and categorical data [2]. Various techniques have been used to determine the number of components for a mixture model such as greedy EM-learning [32], minimal message length [20], stacking [22], holdout techniques [33], and assumptions on the components distribution [22].…”