Background. Scoring systems to evaluate children's human drawings for intellectual maturity have been found to have good intra- and inter-scorer reliability. However, there is some evidence (McCarthy, 1944) that such scores may not be stable over time. Aim. The aim of the study was to investigate raters and occasions as potential sources of error in children's Draw-a-Person scores using generalisability and classical test theory. Sample. The sample consisted of 85 school children (45 girls and 40 boys) aged between 8 years I month to 9 years and 7 months. Method. The Koppitz Draw-A-Person (1968) test was administered as a class test on two occasions with exactly a two-week interval between the occasions. All the children's drawings were scored by the same four raters. Results. Generalisability analyses of the Koppitz scores indicated that the variance components for raters and its interaction with both persons and occasions were very small, suggesting that very little measurement error was associated with the raters. However, the estimated variance component for the interaction of persons by occasions was substantial. With four raters and two occasions the generalisability coefficient was .47. These results were consistent with the classical test theory analysis which indicated generally high inter-rater reliabilities but a low test-retest reliability, based on a composite of the four raters. Conclusion. If satisfactory levels of reliability/generalisability are to be achieved with the Koppitz scoring system children have to be tested on several occasions.
|Journal||British Journal of Educational Psychology|
|Issue number||Part 3|
|Publication status||Published (in print/issue) - Sep 2001|