Shallow cumulus clouds exhibit highly three-dimensional (3-D) spatial structure leading to complex variability in the surface solar irradiance (SSI) beneath. This variability is captured by the typically bimodal shape of the SSI probability density function (PDF). Using large eddy simulation to generate well-resolved cloud fields and Monte Carlo 3-D radiative transfer to reproduce realistic associated SSI PDFs, we seek direct relationships between the cloud field properties and the SSI PDF shape. Applying both random forest and artificial neural network algorithms, we find variations in the two modes of the SSI PDF are well predicted by just a handful of cloud field properties. The two algorithms utilize cloud properties similarly, with indistinguishable performance despite their different architectures. These results offer a marked improvement in realism relative to one-dimensional radiative transfer while bypassing computationally expensive 3-D radiative transfer, with immediate application to renewable energy assessments, and potential for several other geophysical applications. Plain Language Summary When sunlight is intercepted by broken clouds, it is not straightforward to calculate how much of the sunlight reaches the ground in the cloud shadows or the gaps between them. To account for this properly in atmospheric models typically requires tracking the exact path of sunlight through the Earth-Atmosphere system, which can be a cumbersome task. We present an alternative approach to calculate how frequently different amounts of sunlight reach the surface under broken cloud scenes. This new approach uses just a few key pieces of information about the clouds to make predictions. Compared with existing approaches, the predicted surface sunlight is substantially more realistic. Results are consistent across two different algorithms tested and arrive at predictions in similar ways. The combined accuracy and efficiency of the new approach has several applications.