The systematic underestimation of band gaps is one of the most fundamental challenges in semilocal density functional theory (DFT). In addition to hindering the application of DFT to predicting electronic properties, the band gap problem is intimately related to self-interaction and delocalization errors, which make the study of charge transfer mechanisms with DFT difficult. To expand the range of available tools for addressing the band gap problem, we design an approach for machine learning density functionals based on Gaussian processes to explicitly fit single-particle energy levels. We also introduce nonlocal features of the density matrix that are expressive enough to fit these single-particle levels. Combining these developments, we train a machine-learned functional for the exact exchange energy that predicts molecular energy gaps and reaction energies of a wide range of molecules in excellent agreement with reference hybrid DFT calculations. In addition, while being trained solely on molecular data, our model predicts reasonable formation energies of polarons in solids, showcasing its transferability and robustness. We discuss how this approach can be generalized to full exchange-correlation functionals, thus paving the way to the design of stateof-the-art functionals for the prediction of electronic properties of molecules and materials.