Generalisability and classical test theory analyses of Koppitz's Scoring System for human figure drawings

Gordon Rae, P Hyland

    Research output: Contribution to journalArticle

    Abstract

    Background. Scoring systems to evaluate children's human drawings for intellectual maturity have been found to have good intra- and inter-scorer reliability. However, there is some evidence (McCarthy, 1944) that such scores may not be stable over time. Aim. The aim of the study was to investigate raters and occasions as potential sources of error in children's Draw-a-Person scores using generalisability and classical test theory. Sample. The sample consisted of 85 school children (45 girls and 40 boys) aged between 8 years I month to 9 years and 7 months. Method. The Koppitz Draw-A-Person (1968) test was administered as a class test on two occasions with exactly a two-week interval between the occasions. All the children's drawings were scored by the same four raters. Results. Generalisability analyses of the Koppitz scores indicated that the variance components for raters and its interaction with both persons and occasions were very small, suggesting that very little measurement error was associated with the raters. However, the estimated variance component for the interaction of persons by occasions was substantial. With four raters and two occasions the generalisability coefficient was .47. These results were consistent with the classical test theory analysis which indicated generally high inter-rater reliabilities but a low test-retest reliability, based on a composite of the four raters. Conclusion. If satisfactory levels of reliability/generalisability are to be achieved with the Koppitz scoring system children have to be tested on several occasions.
    LanguageEnglish
    Pages369-382
    JournalBritish Journal of Educational Psychology
    Volume71
    Issue numberPart 3
    Publication statusPublished - Sep 2001

    Fingerprint

    Human Body
    Reproducibility of Results
    Research Design

    Cite this

    @article{38f5f82ed8bf4c7ab0d7e5d650c08049,
    title = "Generalisability and classical test theory analyses of Koppitz's Scoring System for human figure drawings",
    abstract = "Background. Scoring systems to evaluate children's human drawings for intellectual maturity have been found to have good intra- and inter-scorer reliability. However, there is some evidence (McCarthy, 1944) that such scores may not be stable over time. Aim. The aim of the study was to investigate raters and occasions as potential sources of error in children's Draw-a-Person scores using generalisability and classical test theory. Sample. The sample consisted of 85 school children (45 girls and 40 boys) aged between 8 years I month to 9 years and 7 months. Method. The Koppitz Draw-A-Person (1968) test was administered as a class test on two occasions with exactly a two-week interval between the occasions. All the children's drawings were scored by the same four raters. Results. Generalisability analyses of the Koppitz scores indicated that the variance components for raters and its interaction with both persons and occasions were very small, suggesting that very little measurement error was associated with the raters. However, the estimated variance component for the interaction of persons by occasions was substantial. With four raters and two occasions the generalisability coefficient was .47. These results were consistent with the classical test theory analysis which indicated generally high inter-rater reliabilities but a low test-retest reliability, based on a composite of the four raters. Conclusion. If satisfactory levels of reliability/generalisability are to be achieved with the Koppitz scoring system children have to be tested on several occasions.",
    author = "Gordon Rae and P Hyland",
    year = "2001",
    month = "9",
    language = "English",
    volume = "71",
    pages = "369--382",
    journal = "British Journal of Educational Psychology",
    issn = "0007-0998",
    number = "Part 3",

    }

    Generalisability and classical test theory analyses of Koppitz's Scoring System for human figure drawings. / Rae, Gordon; Hyland, P.

    In: British Journal of Educational Psychology, Vol. 71, No. Part 3, 09.2001, p. 369-382.

    Research output: Contribution to journalArticle

    TY - JOUR

    T1 - Generalisability and classical test theory analyses of Koppitz's Scoring System for human figure drawings

    AU - Rae, Gordon

    AU - Hyland, P

    PY - 2001/9

    Y1 - 2001/9

    N2 - Background. Scoring systems to evaluate children's human drawings for intellectual maturity have been found to have good intra- and inter-scorer reliability. However, there is some evidence (McCarthy, 1944) that such scores may not be stable over time. Aim. The aim of the study was to investigate raters and occasions as potential sources of error in children's Draw-a-Person scores using generalisability and classical test theory. Sample. The sample consisted of 85 school children (45 girls and 40 boys) aged between 8 years I month to 9 years and 7 months. Method. The Koppitz Draw-A-Person (1968) test was administered as a class test on two occasions with exactly a two-week interval between the occasions. All the children's drawings were scored by the same four raters. Results. Generalisability analyses of the Koppitz scores indicated that the variance components for raters and its interaction with both persons and occasions were very small, suggesting that very little measurement error was associated with the raters. However, the estimated variance component for the interaction of persons by occasions was substantial. With four raters and two occasions the generalisability coefficient was .47. These results were consistent with the classical test theory analysis which indicated generally high inter-rater reliabilities but a low test-retest reliability, based on a composite of the four raters. Conclusion. If satisfactory levels of reliability/generalisability are to be achieved with the Koppitz scoring system children have to be tested on several occasions.

    AB - Background. Scoring systems to evaluate children's human drawings for intellectual maturity have been found to have good intra- and inter-scorer reliability. However, there is some evidence (McCarthy, 1944) that such scores may not be stable over time. Aim. The aim of the study was to investigate raters and occasions as potential sources of error in children's Draw-a-Person scores using generalisability and classical test theory. Sample. The sample consisted of 85 school children (45 girls and 40 boys) aged between 8 years I month to 9 years and 7 months. Method. The Koppitz Draw-A-Person (1968) test was administered as a class test on two occasions with exactly a two-week interval between the occasions. All the children's drawings were scored by the same four raters. Results. Generalisability analyses of the Koppitz scores indicated that the variance components for raters and its interaction with both persons and occasions were very small, suggesting that very little measurement error was associated with the raters. However, the estimated variance component for the interaction of persons by occasions was substantial. With four raters and two occasions the generalisability coefficient was .47. These results were consistent with the classical test theory analysis which indicated generally high inter-rater reliabilities but a low test-retest reliability, based on a composite of the four raters. Conclusion. If satisfactory levels of reliability/generalisability are to be achieved with the Koppitz scoring system children have to be tested on several occasions.

    M3 - Article

    VL - 71

    SP - 369

    EP - 382

    JO - British Journal of Educational Psychology

    T2 - British Journal of Educational Psychology

    JF - British Journal of Educational Psychology

    SN - 0007-0998

    IS - Part 3

    ER -