Early detection of within-field yield variability for high-value commodity crops, such as cotton (Gossypium spp.), offers growers potential to improve decision-making, optimize yields, and increase profits. Over recent years, publicly available datasets have become increasingly available and at a resolution where within-field yield prediction is possible. However, the viability of using these datasets with machine learning to predict within-field cotton lint yield at key growth stages are largely unknown. This study was conducted on two cotton fields, located near Mungindi, New South Wales, Australia. Three years of yield data, soil, elevation, rainfall, and Landsat imagery were collected from each field. A total of 12 models were created using: (a) two machine learning algorithms: random forest (RF) and gradient boosting machines (GBM); (b) three growth stages: squaring, flowering, and boll-fill; and(c) two different amounts of variables: all variables and the optimal variables determined by a recursive feature elimination (RFE). Results showed a strong agreement between predicted and observed yields at flowering and boll-fill when more information was available. At flowering and boll-fill, root mean square error (RMSE) ranged between 0.15 and 0.20 t ha −1 and Lin's concordance correlation coefficient (LCCC) ranged between 0.50 and 0.66, with RF providing superior results in most cases. Models created using the optimal variables determined by the RFE provided similar results compared to using all variables, allowing greater model accuracy and resolution with targeted sampling. Overall, these findings indicate significant potential of publicly available datasets to predict within-field cotton yield and guide decisionmaking in-season.