a 2014

Reanalyzing the DISMAS Test Data: Comparing IRT and CTT Based Estimates of the Error of Measurement

CÍGLER, Hynek, Michal JABŮREK a Jan ŠIRŮČEK

Základní údaje

Originální název

Reanalyzing the DISMAS Test Data: Comparing IRT and CTT Based Estimates of the Error of Measurement

Autoři

CÍGLER, Hynek (203 Česká republika, garant, domácí), Michal JABŮREK (203 Česká republika, domácí) a Jan ŠIRŮČEK (203 Česká republika, domácí)

Vydání

The 9th Conference of The International Test Commission : Global and Local Challenges for Best Practices in Assessment, 2014

Další údaje

Jazyk

angličtina

Typ výsledku

Konferenční abstrakt

Obor

50100 5.1 Psychology and cognitive sciences

Stát vydavatele

Španělsko

Utajení

není předmětem státního či obchodního tajemství

Kód RIV

RIV/00216224:14230/14:00076015

Organizační jednotka

Fakulta sociálních studií

ISBN

978-84-9860-997-4

Klíčová slova česky

DISMAS; IRT; DIF; diagnostika matematických schopností

Klíčová slova anglicky

DISMAS; IRT; DIF; assessment of math ability

Příznaky

Mezinárodní význam
Změněno: 21. 7. 2014 16:39, Mgr. Michal Jabůrek, Ph.D.

Anotace

V originále

DISMAS test (Traspe & Skalkova, 2013) is an original Czech assessment tool for the diagnosis of the structure of math disability in children in grades 1–5. It consists of five main areas divided into 14 unidimensional subtests and it was developed and standardized using classical test theory. Authors verified validity. Subtests’ correlations with WISC-III were low, but significant, differences between normal and clinical population were high and significant for all grades. Regardless the high validity, reliability seemed to be poor – Cronbach alpha ranged from .44 to .95 on subtest level, on the test level from .69 to .76 depending on the grade level. The aim of this paper is to show why. We questioned the adequacy of CTT and internal consistency as a method for estimating reliability in this case. That is why we applied simple Rasch model and Partial Credit Rasch model to reanalyze the data. Consequently, the latent trait estimates as well as their error of measurement were transformed back to the raw score scale. We compared IRT and CTT based errors of measurement on the same scale for all observed raw scores according to the distribution of the standardization sample. In most subtests the error of estimate based on CTT was smaller around the average score but bigger in the extremes, especially at the low end of the scale (DISMAS is designed to test disability, so it has a high ceiling effect). We conclude that IRT analysis produces more accurate estimate for children with math disability, for whom the test is designed, so in this case Rasch model seems to be more appropriate method to estimate confidence intervals than CTT. Finally, we discuss other sources of relatively low reliability.