Deep learning is a powerful tool for image classification. However, training deep learning models often requires large datasets that are not always available, especially in biological research. Here we leverage a small dataset to train an accurate classifier by employing various data augmentation regimes and Bayesian optimisation. We first generate a fine-grained morphological sub-stage profile of the Hamburger-Hamilton developmental stage 10 chick embryo, finding an asynchrony between somite number and brain morphology. We then leverage our domain knowledge to implement data-driven pre-processing steps, before training a deep convolutional neural network to classify our sub-stages. We find that augmenting our images with a combination of rotation, blur, shear, and cutout transformations is effective in training a high classification accuracy network on a small microscopy dataset. Ensuring reliability of our classifier with saliency analysis, we gain insight into the efficacy of different data augmentation techniques, and identify class-specific features. In summary, we apply domain expertise to design effective data augmentation regimes for training a neural network based image classifier on a small, specialised dataset, classifying the posited developmental sub-stages with up to 90.9% accuracy.