Abstract. Controlled testing on standard tasks and within standard environments can provide meaningful performance comparisons between robots of heterogeneous design. But because they must perform practical tasks in unstructured, and therefore non-standard, environments, the benefits of this approach have barely begun to accrue for field robots. This work describes a desert trial of six student prototypes of astronautsupport robots using a set of standardized engineering tests developed by the US National Institute of Standards and Technology (NIST), along with three operational tests in natural Mars-like terrain. The results suggest that standards developed for emergency response robots are also applicable to the astronaut support domain, yielding useful insights into the differences in capabilities between robots and real design improvements. The exercise shows the value of combining repeatable engineering tests with task-specific application-testing in the field.