Satellite retrievals of columnar nitrogen dioxide (NO2) are essential for the characterization of nitrogen oxides (NOx) processes and impacts. The requirements of modeled a priori profiles present an outstanding bottleneck in operational satellite NO2 retrievals. In this work, we instead use neural network (NN) models trained from over 360,000 radiative transfer (RT) simulations to translate satellite radiances across 390-495 nm to total NO2 vertical column (NO2C). Despite the wide variability of the many input parameters in the RT simulations, only a small number of key variables were found essential to the accurate prediction of NO2C, including observing angles, surface reflectivity and altitude, and several key principal component scores of the radiances. In addition to the NO2C, the NN training and cross-validation experiments show that the wider retrieval window allows some information about the vertical distribution to be retrieved (e.g., extending the rightmost wavelength from 465 to 495 nm decreases the root-mean-square-error by 0.75%) under high-NO2C conditions. Applying to four months of TROPOMI data, the trained NN model shows strong ability to reproduce the NO2C observed by the ground-based Pandonia Global Network. The coefficient of determination (R2, 0.75) and normalized mean bias (NMB, -33%) are competitive with the level 2 operational TROPOMI product (R2=0.77, NMB=−29%) over clear (geometric cloud fraction<0.2) and polluted (NO2C≥7.5×1015 molecules/cm2) regions. The NN retrieval approach is ~12 times faster than predictions using high spatial resolution (~3 km) a priori profiles from chemical transport modeling, which is especially attractive to the handling of large volume satellite data.