The aim of this study was to develop and test models for training dental students in periodontal examinations using manikins that had distinct anatomical designs but were indistinguishable in external appearance. After four models were tested for inter-and intra-examiner reliability by two experienced dentists, 26 additional models were produced. The models were tested by 35 dental students at a dental school in Germany in 2014. The testing involved completing a periodontal examination that included probing depths, gingival recessions, and furcation involvements. The primary purpose of the study was to determine whether the models could be used as a tool for periodontal examination training by the students. Levels of agreement (students and dentists) and Kappa statistics (dentists) were calculated using absolute (±0 mm) and tolerable difference (±1 mm). Over the span of two weeks, the dentists' reliability with preset values for probing depths, gingival recessions, and furcation involvements ranged from 0.29 to 0.38, 0.52 to 0.61, and 0.54 to 0.57, respectively, under absolute difference and from 0.86 to 0.90, 0.96 to 0.99, and 0.62 to 0.73, respectively, under tolerable difference. The students' proportions of agreement for probing depths and gingival recessions under absolute vs. tolerable difference were 34.8% vs. 79.9% and 71.9% vs. 94.4%, respectively. The students frequently scored values higher than the preset values, overestimated furcation involvements, and failed to differentiate the levels of furcations. The models used did not pose any systematic or technical difficulties in the pilot study. Students were unable to measure furcation involvements with acceptable agreement. Thus, these models could be used for student periodontal examination training.