“…A classic handling qualities experiment [50] showed that a few pilots evaluating for a longer period of time produced the same central tendency of the rating excursions as a larger group conducting shorter evaluations. What was lost with the larger group, however, was the quality, consistency, and meaningfulness of the pilot comment data.…”