The mine production scheduling problem (MPSP) has been studied since the 1960s, and remains an active area of computational research. In extending the concepts of the MPSP, the automated mine may now be regarded as a hierarchical intelligent agent in which the bottom layer consists of distributed robotic equipment, while strategic functionality occupies the higher layers. Here we present a disambiguation of artificial intelligence, machine learning, computational optimization, and automation within the mining context. Specifically, the Q-learning algorithm has been adapted to generate the initial solutions for a high-performing strategic mine planning algorithm, originally developed by Lamghari, Dimitrakopoulos and Ferland, based on the variable neighbourhood descent (VND) metaheuristic. The hierarchical intelligent agent is presented as an integrative conceptual platform, defining the interaction between our new Q-learning adaptation and Lamghari's VND, and potentially other hierarchically controlled components of an artificially intelligent mine, having various degrees of automation. Sample computations involving Q-learning and VND are presented.