Hyperbolic equations admit discontinuities in the solution and thus adequate and physically sound numerical schemes are necessary for their discretization. Second‐order finite volume schemes are a popular choice for the discretization of hyperbolic problems due to their simplicity. Despite the numerous advantages of higher‐order schemes in smooth regions, they fail at strong discontinuities. Crucial for the accurate and stable simulation of flow problems with discontinuities is the adequate and reliable limiting of the reconstructed slopes. Numerous limiters have been developed to handle this task. However, they are too dissipative in smooth regions or require empirical parameters which are globally defined and test case specific. Therefore, this paper aims to develop a new slope limiter based on deep learning and reinforcement learning techniques. For this, the proposed limiter is based on several admissibility constraints: positivity of the solution and a relaxed discrete maximum principle. This approach enables a slope limiter which is independent of a manually specified global parameter while providing an optimal slope with respect to the defined admissibility constraints. The new limiter is applied to several well‐known shock tube problems, which illustrates its broad applicability and the potential of reinforcement learning in numerics.