Soil organic carbon (SOC) is a crucial factor influencing soil quality and fertility. In this particular investigation, we aimed to explore the possibility of using diffuse reflectance infrared fourier transform spectroscopy (DRIFT-FTIR) in conjunction with machine-learning models, such as partial least squares regression (PLSR), artificial neural networks (ANN), support vector regression (SVR) and random forest (RF), to estimate SOC in Sohag, Egypt. To achieve this, we collected a total of ninety surface soil samples from various locations in Sohag and estimated the total organic carbon content using both the Walkley-Black method and DRIFT-FTIR spectroscopy. Subsequently, we used the spectral data to develop regression models using PLSR, ANN, SVR, and RF. To evaluate the performance of these models, we used several evaluation parameters, including root mean square error (RMSE), coefficient of determination (R2), and ratio of performance deviation (RPD). Our survey results revealed that the PLSR model had the most favorable performance, yielding an R2 value of 0.82 and an RMSE of 0.006%. In contrast, the ANN, SVR, and RF models demonstrated moderate to poor performance, with R2 values of 0.53, 0.27, and 0.18, respectively. Overall, our study highlights the potential of combining DRIFT-FTIR spectroscopy with multivariate analysis techniques to predict SOC in Sohag, Egypt. However, additional studies and research are needed to improve the accuracy or predictability of machine-learning models incorporated into DRIFT-FTIR analysis and to compare DRIFT-FTIR analysis techniques with conventional soil chemical measurements.