Digitala Vetenskapliga Arkivet

Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Agreement of driving examiners' assessments: evaluating the reliability of the Swedish driving test
Umeå universitet, Samhällsvetenskapliga fakulteten, Institutionen för tillämpad utbildningsvetenskap.
Umeå universitet, Samhällsvetenskapliga fakulteten, Institutionen för psykologi.
2013 (engelsk)Inngår i: Transportation Research Part F: Traffic Psychology and Behaviour, ISSN 1369-8478, E-ISSN 1873-5517, Vol. 19, s. 22-30Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

The purpose of this study was to examine the consistency of examiner assessments of test-takers' performance on the Swedish driving test. The study included 535 tests and was designed so that the ordinary examiner and a supervising examiner assessed the same test-taker. The assessment was done on a two-grade rating scale (pass/fail). Since the result can be affected by factors associated with the test-taker and the two examiners, questionnaires were developed and these were filled in by the test-takers and the examiners. Information about the administration of the test was collected via a specially designed form filled in by the supervising examiner. Using this form, the ordinary examiners' performance was rated on a number of aspects. The result from the study indicated that the agreement between the assessments was very good. For 93% of the tests the two examiners chose the same mark on the two-grade scale. In the cases where ratings differed, the analysis indicated only a few systematic differences among variables designed to provide possible explanations for differences in opinion. However, none of these was problematic with respect to consistency of assessment. Results indicated that most tests were carried out in a satisfactory manner.

sted, utgiver, år, opplag, sider
Elsevier, 2013. Vol. 19, s. 22-30
Emneord [en]
Driving test, Reliability, Driving examiner
HSV kategori
Identifikatorer
URN: urn:nbn:se:umu:diva-76245DOI: 10.1016/j.trf.2013.02.004ISI: 000319542100003OAI: oai:DiVA.org:umu-76245DiVA, id: diva2:636043
Tilgjengelig fra: 2013-07-08 Laget: 2013-07-08 Sist oppdatert: 2019-10-14bibliografisk kontrollert
Inngår i avhandling
1. Licence to drive: the importance of reliability for the validity of the Swedish driving licence test
Åpne denne publikasjonen i ny fane eller vindu >>Licence to drive: the importance of reliability for the validity of the Swedish driving licence test
2019 (engelsk)Licentiatavhandling, med artikler (Annet vitenskapelig)
Abstract [en]

Background: The Swedish driving licence test is a criterion-referenced test resulting in a pass or fail. It currently consists of two parts - a theory test with 65 multiple-choice items and a practical driving test where at least 25 minutes are spent driving in traffic. It is a high-stakes test in the sense that the results are used to determine whether the test-taker should be allowed to drive a car without supervision. As the only other requirements for obtaining a licence is a few hours of hazard education (and a short introduction if you intend to drive with a lay instructor) it is important that the test result, in terms of pass or fail, is reliable and valid. If this is not the case it could have detrimental effects on traffic safety. Examining all relevant aspects is beyond the scope of this licentiate thesis so I have focused on reliability.

Methods Reliability for both the theoretical and practical test results was examined. As these are very different types of tests the types of reliability examined also differed. In order to examine inter-rater reliability of the driving test 83 examiners were accompanied by one of five selected supervising examiners for a day of tests. All in all 535 tests were conducted with two examiners assessing the same performance. At the end of the day the examiners compared notes and tried to determine the reason for any inconsistencies. Both examiners and students also filled in questionnaires with questions about background and preparation. As for studying decision consistency and decision accuracy of the theory test, three test versions (a total of around 12,000 tests) were examined with the help of methods devised by Subkoviak (Subkoviak, 1976, 1988) and Hanson & Brennan (Brennan, 2004; Hanson & Brennan, 1990).

Results The results from two research studies concerning reliability were presented. Study I focused on inter-rater reliability in the driving test and in 93 per cent of cases the examiners made the same assessment. For the tests where their opinions differed there was no correlation to any of the background variables or other variables examined except for three, which had logical explanations and did not constitute a problem. Although there were cases where the differences were due to different stances on matters of interpretation the most common suggested cause was the placement in the car (back seat vs. front seat). Although the supervising examiners gave both praise and criticism as to how the test was carried out the study does not answer the question whether the tests were equal in terms of composition and difficulty.

In Study II the focus was on decision consistency and decision accuracy in the theory test. Three versions of the theory tests were examined and, on the whole, found to be fairly similar in terms of item difficulty and score distribution, but the mean was so close to the cut-score (i.e. the score required to pass) that the pass rate differed somewhat between versions. Agreement coefficients were around .80 for all test versions (between .79 and .82 depending on method). Classification accuracy indicated an .87 probability of a correct classification.

Conclusion It is important to examine the reliability and validity of the driving licence test since a misclassification can have serious consequences in terms of traffic safety. In the studies included here the rate of agreement between examiners is deemed as satisfactory. It would be preferable if the classification consistency and classification accuracy, as estimated by the methods used, were higher for the theory test, given its importance.

While reliability in terms of agreement between raters/examiners or consistency and accuracy of classification are routinely examined in other contexts, such as large-scale educational testing, this is not often done for the driving licence tests. At the same time, the methods used here can be transferred to contexts where such properties are generally not examined. Collecting information about test-takers and examiners, like in Study I, can provide evidence concerning possible bias.

Examining to what extent decisions are consistent is one important aspect of collecting evidence that shows that test results can be used to draw conclusions about driver competence. Still, regardless of outcome, validation is a process that never ends. There is always reason to examine various aspects and make further improvements. There are also many other relevant aspects to examine. A prerequisite for the validity of the score interpretation of a criterion-referenced test like this one is that the cut-score is appropriate and the content relevant. This should therefore be the subject of further research as the validation process continues.

sted, utgiver, år, opplag, sider
Umeå: Department of applied educational science, Educational measurement, Umeå university, 2019. s. 56
Serie
Academic dissertations at the department of Educational Measurement, ISSN 1652-9650 ; 12
Emneord
Driving licence tests, driver's licence, driving test, theory test, licensing test, interrater reliability, classification consistency, examiner agreement, classification accuracy, förarprov, körprov, kunskapsprov, reliabilitet, validitet, bedömare
HSV kategori
Forskningsprogram
beteendevetenskapliga mätningar
Identifikatorer
urn:nbn:se:umu:diva-163949 (URN)9789178551156 (ISBN)
Presentation
2019-10-25, Aulan, Vårdvetarhuset, Umeå, 10:00 (svensk)
Opponent
Veileder
Tilgjengelig fra: 2019-10-14 Laget: 2019-10-12 Sist oppdatert: 2019-10-14bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekst

Søk i DiVA

Av forfatter/redaktør
Alger, SusanneSundström, Anna
Av organisasjonen
I samme tidsskrift
Transportation Research Part F: Traffic Psychology and Behaviour

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 585 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf