PET can be used to monitor response during chemotherapy and assess biologic target volumes for radiotherapy. Previous simulation studies have shown that the performance of various automatic or semiautomatic tumor delineation methods depends on image characteristics. The purpose of this study was to assess test-retest variability of tumor delineation methods, with emphasis on the effects of several image characteristics (e.g., resolution and contrast). Methods: Baseline test-retest data from 19 non-small cell lung cancer patients were obtained using 18 F-FDG (n 5 10) and 39-deoxy-39-18 F-fluorothymidine ( 18 F-FLT) (n 5 9). Images were reconstructed with varying spatial resolution and contrast. Six different types of tumor delineation methods, based on various thresholds or on a gradient, were applied to all datasets. Test-retest variability of metabolic volume and standardized uptake value (SUV) was determined. Results: For both tracers, size of metabolic volume and testretest variability of both metabolic volume and SUV were affected by the image characteristics and tumor delineation method used. The median volume test-retest variability ranged from 8.3% to 23% and from 7.4% to 29% for 18 F-FDG and 18 F-FLT, respectively. For all image characteristics studied, larger differences (#10-fold higher) were seen in test-retest variability of metabolic volume than in SUV. Conclusion: Test-retest variability of both metabolic volume and SUV varied with tumor delineation method, radiotracer, and image characteristics. The results indicate that a careful optimization of imaging and delineation method parameters is needed when metabolic volume is used, for example, as a response assessment parameter.