In this paper, we propose a novel unsupervised learning approach for spatio-temporal pattern classification. We use a spike timing neural network with axonal conductance delays to learn the structure of spatio-temporal patterns from a small set of training samples and then use this network for classifying unseen patterns. The method also enables early classification of patterns, before they are completely observed. To transform a spatio-temporal pattern into a suitable form for the spiking network, we create a mapping process that transforms it into a spike train that can be used to train the network through spiketiming-dependent plasticity. Based on the trained network, we build models of the training samples as strings of characters in which each character represents a set of neurons that fire at a particular time step, in response to the pattern. For classification, we compute the longest common subsequence between the model strings corresponding to the input and training samples and choose the class of the training sample with highest similarity. This method is evaluated on a handwritten digits dataset with spatio-temporal information. We show that this method is robustly detecting the correct class early on. Comparison with one unsupervised and eight other supervised approaches shows that our proposed approach could compete or has better performance rather than all of them. Further analysis show that even for misclassified samples, our system can detect the correct class among the top three class labels.