“…In order to generate valid confidence intervals, sample sizes have to be estimated with statistical tools of sample size determination (Rosner, 2000) as we reported ; 5) Verifiable methods: Experimental methods are available to verify the data generated by the database mining ; and 6) A new working model/hypothesis: Through database mining, a new knowledge gap will be identified, and a new hypothesis will be proposed to test fewer, much more-focused genes in further experiments. The following sections will illustrate these principles in our own publications (Chen et al, 2010;Jan et al, 2010;Ng et al, 2004;Virtue, 2011;Yang et al, 2006a;Yang et al, 2006b;Yin et al, 2009). In our invited review, we pointed out that the identification and molecular characterization of self-antigens expressed by human malignancies, that are capable of elicitation of anti-tumor immune responses in patients, have been an active field in tumor immunology .…”