A nonvolatile lookup table (NV-LUT) circuit, which is a key component of a field programmable gate array (FPGA), is proposed for an energy-efficient yet high-performance binarized convolutional neural net-work (BCNN) accelerator. Since the active load is distributed to each configuration memory cell, the effect of the parasitic components is greatly reduced. Moreover, the use of a wired-OR logic-circuit style makes it possible to perform a high-speed logic operation. The proposed 6-input NV-LUT circuit using an active-load-localized single-ended circuit style is designed using a 45-nm CMOS technology and the delay is reduced by 30% with only 13% of hardware overhead compared to those of a conventional circuit. It is also demonstrated that the proposed NV-LUT circuit exhibits variation resilience against three process corners. The use of the proposed NV-LUT circuit also makes it possible to reduce 47% of energy consumption of a BCNN accelerator for digit recognition.