Background: Dementia occurs through declining of cognitive abilities, and its early detection stands essential for effective preventive measures. However, mainstream diagnostic tests and screening tools, such as Cambridge Cognition Examination (CAMCOG) and Mini-Mental State Examination (MMSE), often fail to detect dementia accurately. Various graph-based or feature-dependent prediction and progression models have been proposed. Whenever these models exploit information in the patients’ electronic medical records, they represent promising options to identify the presence and severity of dementia more precisely. Methods: The methods presented in this paper aim to address two problems related to dementia: (a) Basic diagnosis: identifying the presence of dementia in individuals, and (b) Severity diagnosis: predicting the presence of dementia, as well as the severity of the disease. We formulate these two tasks as classification problems and address them using machine learning models based on random forests, analysing structured clinical data from an elderly population cohort. We perform a hybrid data curation method where a dementia expert is in the loop to ensure meaningful curation decisions. We then employ the machine learning algorithms that classify individual episodes into a specific dementia class. Decision trees are also used for enhancing the explainability of decisions made by prediction models, allowing medical experts to identify the most crucial patient features and their threshold values for the detection of dementia. Results: Our experiment results prove that clinical record features (such as age or comorbidities), along with baseline arithmetic or memory tests, can predict dementia and its severity with high accuracy. In specific, our prediction models have reached an average f1-score of 0.93 and 0.85 for problems (a) and (b), respectively. Moreover, the decision trees produced for the two issues empower the interpretability of the prediction models. Conclusions: This study proves that there can be an accurate estimation of the existence and severity of dementia disease by analysing various clinical features and cognitive tests from the past episodes of the elderly population. Moreover, a set of rules represent building blocks in the efficiently patient classification. Relevant clinical and screening test features (e.g., simple arithmetic or animal fluency task) represent precise predictors without calculating the scores of mainstream cognitive tests such as MMSE and CAMCOG. They identify not only meaningful features but also justifications of classifications. As a result, the predictive power of machine learning models over curated clinical data is demonstrated, paving the way for accurate diagnosis of dementia.