Aim: A high level of directly measured small dense low-density lipoprotein cholesterol (sdLDL-C) is a strong risk factor for atherosclerotic cardiovascular disease. A method for estimating sdLDL-C by using Sampson's equation that includes levels of total cholesterol, high-density lipoprotein cholesterol (HDL-C), non-HDL-C and triglycerides (TG) has recently been proposed. We investigated the validation and exploration of estimated sdLDL-C level. Methods: The associations between measured and estimated sdLDL-C levels were investigated in 605 Japanese subjects (men/women: 280/325; mean age: 65±15 years) who received annual health check-ups in the Tanno-Sobetsu Study, a population-based cohort. Results: Estimated sdLDL-C level was highly correlated with measured sdLDL-C level in all subjects (R 2 =0.701), nondiabetic subjects without any medication (n=254, R 2 =0.686) and subjects with diabetes mellitus (n=128, R 2 =0.721). Multivariable regression analysis showed that levels of non-HDL-C, TG and γ-glutamyl transpeptidase (γGTP) were independent predictors of measured sdLDL-C level. In a stratification of the LDL window, all of the subjects with a combination of high non-HDL-C (≥ 170 mg/dL) and high TG (≥ 150 mg/dL) had high levels of measured and estimated sdLDL-C (≥ 35 mg/dL). Furthermore, machine learning-based estimation of sdLDL-C level by artificial intelligence software, Prediction One, was substantially improved by using components of Sampson's equation (R 2 =0.803) and by using those components with the addition of γGTP and deletion of TC (R 2 =0.929). Conclusions: sdLDL-C level estimated by Sampson's equation can be used instead of measured sdLDL-C level in general practice. By building multiple machine learning models of artificial intelligence, a more accurate and practical estimation of sdLDL-C level might be possible.