This paper makes several important contributions to the literature about nonparametric instrumental variables (NPIV) estimation and inference on a structural function h 0 and functionals of h 0 . First, we derive sup-norm convergence rates for computationally simple sieve NPIV (series two-stage least squares) estimators of h 0 and its derivatives. Second, we derive a lower bound that describes the best possible (minimax) sup-norm rates of estimating h 0 and its derivatives, and show that the sieve NPIV estimator can attain the minimax rates when h 0 is approximated via a spline or wavelet sieve. Our optimal sup-norm rates surprisingly coincide with the optimal root-mean-squared rates for severely ill-posed problems, and are only a logarithmic factor slower than the optimal root-meansquared rates for mildly ill-posed problems. Third, we use our sup-norm rates to establish the uniform Gaussian process strong approximations and the score bootstrap uniform confidence bands (UCBs) for collections of nonlinear functionals of h 0 under primitive conditions, allowing for mildly and severely ill-posed problems. Fourth, as applications, we obtain the first asymptotic pointwise and uniform inference results for plug-in sieve t-statistics of exact consumer surplus (CS) and deadweight loss (DL) welfare functionals under low-level conditions when demand is estimated via sieve NPIV. Our real data application of UCBs for exact CS and DL functionals of gasoline demand reveals interesting patterns and is applicable to other goods markets. dence bands, nonlinear welfare functionals, nonparametric demand with endogeneity.Assumption CS(i)-(iv) are standard even for series LS regression without endo-be the sieve variance of the plug-in sieve NPIV estimator f CS ( h 0 ). Then these assumptions imply that [σ n (f CS )] 2 J j=1 (a j /μ j ) 2 Jμ −2 J . Assumption CS(v) is sufficient for Remark 4.1(b ) for a fixed t. Our first result is for exact CS functionals, established by applying Theorem D.1 in → d N(0 1)Since μ j > 0 decreases as j increases, we could use the relationAssumption U-CS(i) is slightly stronger than Assumption CS(iii) (since δ = 1 in Assumption U-CS(i) is enough). Assumption U-CS(ii) is made for simplicity to verify Assumption 6(i); other sufficient conditions could also be used. Assumption U-CS(iii) and (iv)(a) strengthen Assumption CS(v) to ensure uniform Gaussian process strong approximation with an error rate of r n = (log J) −1/2 . Again, one could use bounds on σ n that are analogous to relation (21) to provide sufficient conditions for Assumption U-CS(iii) and (iv) that could be satisfied by mildly and severely ill-posed NPIV models. See Remark 5.1 below for one concrete set of such sufficient conditions. Remark 5.1. Let σ 2 n J j=1 (j a μ −2 j ) for a ≤ 0.(i) Mildly ill-posed case. Let μ j j −ς/2 for ς ≥ 0 and a + ς > −1. Let J 5∨(4+ς−a) (log n) 3 /n = o(1) and nJ −(p+a+ς+1) (log J) = o(1). Then Assumption U-CS(iii) and (iv) hold.(ii) Severely ill-posed case. Let μ J exp(− 1 2 j ς/2 ), ς > 0. Let J = (log(n/(log ...