Biometric testing should attempt to report unbiased, real-world system performance, especially when tested on limited databases. Though testing on a standard database, such as the Linguistic Data Consortiums's YOHO, allows comparison of speaker ver@cation systems, it is well known that certain procedures bias the results low. One such procedure concerns the use of cohort or reference speakers to pe'form veriJication, where the cohort speakers are removed as candidate impostors. A method of testing is proposed to remove this bias by modifying the cohort set for each false acceptance test. Results statistically differ for this mod$ed approach, which tries to "best" model the general population with a fixed random sample. Lastly, three techniques to bound the biometric perjbrmance, using both parametric and noli-parametric resampling is demonstrated.