Direkt zum Inhalt

Kraus, Elisabeth Barbara ; Wild, Johannes ; Hilbert, Sven

Using Interpretable Machine Learning for Differential Item Functioning Detection in Psychometric Tests

Kraus, Elisabeth Barbara, Wild, Johannes und Hilbert, Sven (2024) Using Interpretable Machine Learning for Differential Item Functioning Detection in Psychometric Tests. Applied Psychological Measurement, S. 167-186.

Veröffentlichungsdatum dieses Volltextes: 17 Mai 2024 11:55
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.58285


Zusammenfassung

This study presents a novel method to investigate test fairness and differential item functioning combining psychometrics and machine learning. Test unfairness manifests itself in systematic and demographically imbalanced influences of confounding constructs on residual variances in psychometric modeling. Our method aims to account for resulting complex relationships between response patterns and ...

This study presents a novel method to investigate test fairness and differential item functioning combining psychometrics and machine learning. Test unfairness manifests itself in systematic and demographically imbalanced influences of confounding constructs on residual variances in psychometric modeling. Our method aims to account for resulting complex relationships between response patterns and demographic attributes. Specifically, it measures the importance of individual test items, and latent ability scores in comparison to a random baseline variable when predicting demographic characteristics. We conducted a simulation study to examine the functionality of our method under various conditions such as linear and complex impact, unfairness and varying number of factors, unfair items, and varying test length. We found that our method detects unfair items as reliably as Mantel–Haenszel statistics or logistic regression analyses but generalizes to multidimensional scales in a straight forward manner. To apply the method, we used random forests to predict migration backgrounds from ability scores and single items of an elementary school reading comprehension test. One item was found to be unfair according to all proposed decision criteria. Further analysis of the item’s content provided plausible explanations for this finding. Analysis code is available at: https://osf.io/s57rw/?view_only=47a3564028d64758982730c6d9c6c547.



Beteiligte Einrichtungen


    Details

    DokumentenartArtikel
    Titel eines Journals oder einer ZeitschriftApplied Psychological Measurement
    Verlag:Sage
    Seitenbereich:S. 167-186
    Datum11 März 2024
    InstitutionenNicht ausgewählt
    Identifikationsnummer
    WertTyp
    10.1177/01466216241238744DOI
    Stichwörter / Keywordspsychometrics, machine learning, interpretable machine learning, random forest, test fairness, differential item functioning
    Dewey-Dezimal-Klassifikation100 Philosophie und Psychologie > 150 Psychologie
    StatusVeröffentlicht
    BegutachtetJa, diese Version wurde begutachtet
    An der Universität Regensburg entstandenJa
    URN der UB Regensburgurn:nbn:de:bvb:355-epub-582858
    Dokumenten-ID58285

    Bibliographische Daten exportieren

    Nur für Besitzer und Autoren: Kontrollseite des Eintrags

    nach oben