Standardized uptake values (SUVs) have been widely used in the diagnosis of malignant tumors and in clinical trials of tumor therapies as semiquantitative metrics of tumor 18 F-FDG uptake. However, SUVs for small lesions are liable to errors due to partialvolume effect and statistical noise. The purpose of this study was to evaluate the reproducibility and accuracy of maximum and peak SUV (SUV max and SUV peak , respectively) of small lesions in phantom experiments. Methods: We used a body phantom with 6 spheres in a quarter warm background. The PET data were acquired for 1,800 s in list-mode, from which data were extracted to generate 15 PET images for each of the 60-, 90-, 120-, 150-, and 180-s scanning times. The SUV max and SUV peak of the hot spheres in the 1,800-s scan were used as a reference (SUV ref,max and SUV ref,peak ). Coefficients of variation for both SUV max and SUV peak in hot spheres (CV max and CV peak ) were calculated to evaluate the variability of the SUVs. On the other hand, percentage differences between SUV max and SUV ref,max and between SUV peak and SUV ref,peak were calculated for evaluation of the accuracy of SUV. We additionally examined the coefficients of variation of background activity and the percentage background variability as parameters for the physical assessment of image quality. Results: Visibility of a 10-mm-diameter hot sphere was considerably different among scan frames. The CV max and CV peak increased as the sphere size became smaller and as the acquisition time became shorter. SUV max was generally overestimated as the scan time shortened and the sphere size increased. The SUV max and SUV peak of a 37-mm-diameter sphere for 60-s scans had average positive biases of 28.3% and 4.4%, compared with the reference. Conclusion: SUV max was variable and overestimated as the scan time decreased and the sphere size increased. In contrast, SUV peak was a more robust and accurate metric than SUV max . The measurements of SUV peak (or SUV peak normalized to lean body mass) in addition to SUV max are desirable for reproducible and accurate quantification in clinical situations.