This paper presents the development of a model for performance evaluation of physical exercises with a robotic trainer system, with the specific aim of distinguishing between consistent & correct performance and non-consistent &/or incorrect performance by considering differences between repetitions. The performance evaluation is based on joint angles data extracted from the system's camera. The initial exploration of clustering for pattern recognition in an unlabeled dataset yielded insufficient performance, but it did highlight the importance of scaling features by exercise type as a means to distinguish between performance types. Following transitioning from unsupervised to supervised learning, classification by logistic regression models was evaluated, with different subsets of feature groups and scaling conditions. An evaluation of scaling conditions revealed their importance, particularly for training on a single dataset. Two datasets were acquired and analyzed, i.e., training of older adults (original dataset) and training with young adults (dedicated data set for evaluation). Training on the original dataset and the additional dedicated data set improved accuracy, suggesting a contribution from one dataset to the classification of the other. Logistic regression models, chosen for their simplicity and efficiency, provided explainable results, aligning with the operational capabilities of our robotic trainer. The selected performance evaluation model was able to classify between the performance types with an accuracy of 0.96 on the older adults data set and 0.97 on the other data set. Future research to enhance human-robot interaction should be aimed at exploring real-time evaluation models and implementing feedback based on this performance evaluation methodology.