Limestone calcined clay cement (LC3) is a sustainable alternative to ordinary Portland cement, capable of reducing the binder’s carbon footprint by 40% while satisfying all key performance metrics. The inherent compositional heterogeneity in select components of LC3, combined with their convoluted chemical interactions, poses challenges to conventional analytical models when predicting mechanical properties. Although some studies have employed machine learning (ML) to predict the mechanical properties of LC3, many have overlooked the pivotal role of feature selection. Proper feature selection not only refines and simplifies the structure of ML models but also enhances these models’ prediction performance and interpretability. This research harnesses the power of the random forest (RF) model to predict the compressive strength of LC3. Three feature reduction methods—Pearson correlation, SHapley Additive exPlanations, and variable importance—are employed to analyze the influence of LC3 components and mixture design on compressive strength. Practical guidelines for utilizing these methods on cementitious materials are elucidated. Through the rigorous screening of insignificant variables from the database, the RF model conserves computational resources while also producing high-fidelity predictions. Additionally, a feature enhancement method is utilized, consolidating numerous input variables into a singular feature while feeding the RF model with richer information, resulting in a substantial improvement in prediction accuracy. Overall, this study provides a novel pathway to apply ML to LC3, emphasizing the need to tailor ML models to cement chemistry rather than employing them generically.