The iterative update of software leads to frequent continuous integration, so the testing in the continuous integration environment should also be fast and accurate. Reinforcement learning is often used in the research of continuous integration testing because of its sequential strategy and good robustness. Some existing methods use reinforcement learning to solve test case prioritization problem, which provides a good idea, but the experimental defect detection rates are relatively low. Therefore, based on the existing reinforcement learning framework, this article proposes a reward mechanism to provide additional rewards for newly emerging test cases in each integration cycle. Through experiments on three industrial datasets, it has been proven that this mechanism improves the defect detection rate, the recall rate of failed test cases, and the efficiency of test feedback in the testing process.