Four disease diagram sets to aid in assessments of rice brown spot severity were developed and evaluated with regard to accuracy and reliability of the estimates. These sets had severity increments in a linear (LIN) or logarithmic (LOG) fashion with the diagrams depicted in true colour (COL) or black and white (BW). Ten inexperienced raters evaluated one of the sets, totalling forty raters. On each group of raters, the assessment was made first unaided and then aided with the respective diagram set. The minimum and the maximum severities of the diagram sets were 0.5 and 36%, respectively, and the five intermediate severity values were 6, 12, 18, 24 and 32% for the LIN diagram sets, and 1.8, 3.3, 6, 11 and 20% for the LOG sets. Overall agreement, measured by the Lin's concordance correlation coefficient (ρc), increased considerably when using the aid (unaided mean ρc = 0.53, aided mean ρc = 0.87) due to a strong reduction in the systematic bias, measured by the bias correction factor Cb (unaided mean Cb = 0.60, aided mean Cb = 0.95). All diagrams led to similar accuracy and precision, but a consistent overestimation was still observed when using the LIN sets, and variability for the absolute errors was higher for the LOG sets, compared with the LIN sets. Estimates using the diagram sets were more reliable based on the intraclass correlation (mean ρ = 0.79–0.86) compared with unaided estimates (mean ρ = 0.51–0.67). Raters exhibited preference for specific values, such as the ‘knots’ (10, 20, 30%, etc.), and the severity values represented in the diagrams, especially when using the LIN sets. The diagram sets similarly helped to improve accuracy and reliability of estimates of rice brown spot epidemics.