Comparison of Person-Fit Statistics for Polytomous Items in Different Test Conditions

Sengul Avsar, ASİYE

doi:10.21031/epod.525647

Comparison of Person-Fit Statistics for Polytomous Items in Different Test Conditions

Sengul Avsar A.

JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD, cilt.10, sa.4, ss.348-364, 2019 (ESCI)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 10 Sayı: 4
Basım Tarihi: 2019
Doi Numarası: 10.21031/epod.525647
Dergi Adı: JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD
Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), Scopus, TR DİZİN (ULAKBİM)
Sayfa Sayıları: ss.348-364
Anahtar Kelimeler: Polytomous items, aberrant item response, person-fit statistics, ITEM RESPONSE THEORY, SCORE PATTERNS, MODELS
Recep Tayyip Erdoğan Üniversitesi Adresli: Evet

Özet

The validity of individual test scores is an important issue that needs to be studied in psychological and educational assessment. An important factor affecting the validity of individual test scores is aberrant item response behavior. Aberrant item scores may increase/decrease the individuals' scores and as a result individuals' ability can be estimated above/below their true ability. Person-fit statistics (PFS) are useful tools to detect aberrant behavior. There are a great number of parametric and nonparametric PFS in the literature. The general purpose of the study is to examine the effectiveness of the parametric and nonparametric PFS in data sets which consist of polytomous items. This study is fundamental research aimed at determining the effectiveness of PFS using simulated data sets. According to the results, as expected, as the Type I error rates (significance alpha level) increased, detection rates (power) increased. In general, it is seen that as the number of misfitting item score vector and number of items increased, detection rates increased. Generally, nonparametric PFS (N-PFS) (especially G(P)) detected more aberrant individuals than parametric PFS (P-PFS) l(z)(p). However, in some tests' conditions l(z)(p) detected more aberrant individuals than N-PFS for longer tests. The results indicate that N-PFS outperformed P-PFS in most of the test conditions.