Galling wear in sheet metal stamping processes can degrade the product quality and adversely affect mass production. Studies have shown that acoustic emission sensors can be used to measure galling. In the literature, attempts have been made to correlate the acoustic emission features and galling wear in the sheet metal stamping process. However, there is very little attempt made to implement machine learning techniques to detect acoustic emission features that can classify non-galling and galling wear as well as provide additional wear-state information in the form of strong visualisations. In the rst part of the paper time domain and frequency domain analysis are used to determine the acoustic emission features that can be used for unsupervised classi cation. Due to galling wear progression on the stamping tools, the behaviour of acoustic emission waveform changes from stationary to a nonstationary state. The initial change in acoustic emission waveform behaviour due to galling wear initiation is very di cult to observe due to the ratio of change against the large data size of the waveform. Therefore, a time-frequency technique "Hilbert Huang Transform" is applied to the acoustic emission waveform as that is sensitive to change of wear state, and is used for the classi cation of 'non galling' and the 'transition of galling'. Also, the unsupervised learning algorithm fuzzy clustering is used as comparison against the supervised learning techniques. Despite not knowing a priori the wear state labels, fuzzy clustering is able to de ne three relatively accurate distinct classes: "unworn", "transition to galling", and "severe galling". In the second part of the paper, the acoustic emission features are used as an input to the supervised machine learning algorithms to classify acoustic emission features related to non-galling and galling wear. An accuracy of 96% was observed for the prediction of non-galling and galling wear using Classi cation, Regression Tree (CART) and Neural Network techniques. In the last part, a reduced Short Time Fourier Transform of top 10 absolute maximum component acoustic emission feature sets that correlates to wear measurement data "pro le depth" is used to train and test supervised Neural Network and CART algorithms. The algorithms predicted the pro le depth of 530 unseen parts (530 unseen cases), which did not have any associated labelled depth data. This shows the power of using machine learning techniques that can use a small data training set to provide additional predicted wear-state on a much larger data set. Furthermore, the machine learning techniques presented in this paper can be used further to develop a real-time measurement system to detect the transition of galling wear from measured acoustic emission features.