Adaptive Selection Algorithm and Standard Error Termination Rule in Comparative Judgement: An Application for Assessing Writing Skills

dc.contributor.authorGürel, Sungur
dc.contributor.authorŞahin, Murat Doğan
dc.contributor.authorUysal, İbrahim
dc.contributor.authorİbileme, Ali İhsan
dc.contributor.authorGündüz, Tuba
dc.date.accessioned2025-04-16T06:52:10Z
dc.date.available2025-04-16T06:52:10Z
dc.date.issued2025-01-01
dc.departmentFakülteler, Eğitim Fakültesi, Eğitim Bilimleri Bölümü
dc.description.abstractThis study aims to examine the scoring reliability of comparative judgement under different sample sizes and standard error termination rule conditions. For this purpose, a Monte Carlo simulation study with 9 conditions and 82 iterations was conducted with sample sizes of 250, 500 and 1000 and standard error termination rules of 0.40, 0.35 and 0.30. In addition, a application for assessing writing skills was conducted with a sample of 50 students using the standard error termination rule of 0.40 and a maximum number of comparisons of 40. In the simulation study, scoring reliability was determined by true reliability, rank order accuracy and scale separation reliability. In the application, the correlation between scores that are obtained with a holistic rubric and ability estimates that are obtained with adaptive comparative judgement as well as the correlation between scores that are obtained using an analytic rubric and ability estimates that are obtained with adaptive comparative judgement were examined. In addition, scale separation reliability was calculated to obtain ability estimates using adaptive comparative judgement. The simulation results showed a high level of reliability in all conditions. Moreover, reliability was high, independent of the sample size. We conclude that stricter standard error termination rules lead to higher levels of reliability, but this requires performances to be subjected to a higher number of pairwise comparisons. The application results showed high scale separation reliability of .89 and correlations of over 0.70 with the scores obtained by using both holistic and analytic rubrics. Overall, the results of the study suggest that adaptive comparative judgement can be used in both classroom and large-scale assessment applications. In addition, adaptive comparative judgement is considered advantageous because it is easier to administer, does not require a difference in the testing process, and places the abilities on a continuous scale.
dc.identifier10.15390/EB.2025.14123
dc.identifier.doi10.15390/EB.2025.14123
dc.identifier.issn13001337
dc.identifier.other2-s2.0-105001182913
dc.identifier.scopus2-s2.0-105001182913
dc.identifier.scopusqualityQ3
dc.identifier.urihttps://doi.org/10.15390/EB.2025.14123
dc.identifier.urihttps://hdl.handle.net/20.500.12604/8605
dc.identifier.volume50
dc.identifier.wos001440919900005
dc.identifier.wosqualityQ4
dc.indekslendigikaynakScopus
dc.indekslendigikaynakWeb of Science
dc.institutionauthorGürel, Sungur
dc.relation.ispartofEgitim ve Bilim
dc.relation.ispartofseriesEgitim ve Bilim
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectComparative judgement
dc.subjectHolistic assessment
dc.subjectPairwise comparison
dc.subjectScale separation reliability
dc.titleAdaptive Selection Algorithm and Standard Error Termination Rule in Comparative Judgement: An Application for Assessing Writing Skills
dc.typeJournal
oaire.citation.volume50

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
Sungur-Gürel-2025.pdf
Boyut:
701.66 KB
Biçim:
Adobe Portable Document Format
Lisans paketi
Listeleniyor 1 - 1 / 1
[ X ]
İsim:
license.txt
Boyut:
1.17 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: