This paper considers an inexact primal-dual algorithm for semi-infinite programming (SIP) for which it provides general error bounds. To implement the dual variable update, we create a new prox function for nonnegative measures which turns out to be a generalization of the Kullback-Leibler divergence for probability distributions. We show that under suitable conditions on the error, this algorithm achieves an O(1/ √ K) rate of convergence in terms of the optimality gap and constraint violation. We then use our general error bounds to analyze the convergence and sample complexity of a specific primal-dual SIP algorithm based on Monte Carlo integration. Finally, we provide numerical experiments to demonstrate the performance of our algorithm. a certain computationally-cheap criterion. In [39], the earlier central cutting plane algorithm from [29] is extended to allow for nonlinear convex cuts.Randomized cutting plane algorithms have recently been developed for SIP in [6,7,14]. The idea is to input a probability distribution over the constraints, randomly sample a modest number of constraints, and then solve the resulting relaxed problem. Intuitively, as long as a sufficient number of samples of the constraints is drawn, the resulting randomized solution should violate only a small portion of the constraints and achieve near-optimality.Discretization methods: In the discretization approach, a sequence of relaxed problems with a finite number of constraints is solved according to a predefined or adaptively controlled grid generation scheme [46,49]. Discretization methods are generally computationally expensive. The convergence rate of the error between the solution of the SIP problem and the solution of the discretized program is investigated in [49].Local reduction methods: In the local reduction approach, an SIP problem is reduced to a problem with a finite number of constraints [21]. The reduced problem involves constraints which are defined only implicitly, and the resulting problem is solved via the Newton method which has good local convergence properties. However, local reduction methods require strong assumptions and are often conceptual.Dual methods: A wide class of SIP algorithms is based on directly solving the KKT conditions. In [28,33,34], the authors derive Wolfe's dual for an SIP and discuss numerical schemes for this problem. The KKT conditions often have some degree of smoothness, and so various Newton-type methods can be applied [45,30,40,44]. However, feasibility is not guaranteed under the all Newton-type methods. A new smoothing Newton-type method is proposed to overcome this drawback in [32].Applications: SIP is the basis of the approximate linear programming (ALP) approach for dynamic programming. In [16,3,13], the authors consider various schemes for solving ALPs. The idea of random constraint sampling in ALP is developed in [16,3]. In [31], an adaptive constraint sampling approach called 'ALP-Secant' is proposed which is based on solving a sequence of saddle-point problems. It is shown that A...