Professional Documents
Culture Documents
Question formats
The AKT consists of two item types: 70 single best answer (SBA)
questions and 80 extended matching questions (EMQ). Single best
answer questions require the candidate to select the one correct
answer from five response options (Table 1); EMQ items require
candidates to select the most likely or best answer from a list of up
to 26 choices. Frequently, several EMQ items are presented together
with the same menu list and the same medical theme (eg. diagnosing
Reprinted from Australian Family Physician Vol. 37, No. 8, August 2008 659
EDUCATION Assessing general practice knowledge base – the applied knowledge test
660 Reprinted from Australian Family Physician Vol. 37, No. 8, August 2008
Assessing general practice knowledge base – the applied knowledge test EDUCATION
Table 2. Reliabilities of the AKT examination segment In addition, AKT scores typically correlate with those of the two
(1999.2–2006.2) other examination segments in the range of 0.55–0.65. This figure
indicates that the AKT has some variance in common with the other
Time of administration Cronbach’s α
segments. Candidate acceptance of a test format is also an important
1999.2 0.82
aspect of its performance as well as providing support for its validity.
2000.1 0.82
2000.2 0.83 Standard setting
2001.1 0.85 Standard setting for each of the three RACGP examination segments
2001.2 0.87 was introduced in 1999. A modified Angoff method6 has been applied
2002.1 0.90 throughout that period to the AKT. As part of the ongoing review
2002.2 0.88 of all aspects of the RACGP examination, AKT standard setting
2003.1 0.88 is currently the focus of attention in an attempt to improve its
2003.2 0.86 consistency. In general terms, each AKT paper is standard set by
a group of 20 experienced GP judges. They review each AKT item,
2004.1 0.86
determining a required level of performance and taking into account
2004.2 0.88
the characteristics of each item. Judgments are combined over items
2005.1 0.82
and judges to create a pass mark.
2005.2 0.82
2006.1 0.84 Conclusion
2006.2 0.80 The AKT was introduced as part of the RACGP’s commitment to
.1 = first examination, .2 = second examination maintaining relevance and quality of knowledge testing in the new
RACGP examination. The available evidence supports the validity of
testing, validity ‘... refers to the appropriateness, meaningfulness the AKT segment. Analysis of its performance over the past 7 years
and usefulness of the specific inferences made from test scores. has demonstrated that its reliability remains at a level that satisfies
Test validation is the process of accumulating evidence to support international expectations for a high stakes assessment.
such inferences’.4 Validity is a complex construct and four types are Maintaining content validity, blueprinting and standard setting
frequently considered in assessment: require ongoing attention to ensure ongoing performance of the AKT
• face validity estimates how well a test appears to measure a segment at international standards. Newer issues involve further
certain criterion; it does not guarantee that the test actually development of the standard setting processes and the testing of
measures that criterion their validity and reliability. Testing of concurrent or predictive validity
• content validity estimates how far your test samples from or would enhance knowledge about the value of the AKT in high stakes
represents all of the relevant content general practice testing.
• concurrent validity estimates the degree to which a test correlates
with a criterion measure given at the same time Conflict of interest: none declared.
• predictive validity estimates the degree to which a score on the test
References
predict scores on another criterion measure available in the future. 1. Farmer E, Hinchy J. Assessing general practice clinical decision making skills.
Face validity of the AKT is supported by the use of writers and Aust Fam Physician 2005;34:1059–61.
reviewers who are GPs from a wide range of Australian general 2. Britt H, Miller G, Knox S, et al. General practice activity in Australia 2004–05.
Canberra: Australian Institute of Health and Welfare (General Practice Series No
practice backgrounds. Content validity is achieved by selecting items
18); 2005 December 2005.
according to a blueprint, which represents the profile of complaints 3. The RACGP. Training Program Curriculum. South Melbourne: The RACGP, 1999.
presenting to Australian general practice.2 Systematic review of 4. American Educational Research Association, American Psychological Association,
National Council on Measurement in Education. Standards for educational and
relevant evidence ensures the validity of the answer key. However,
psychological testing. Washington: American Psychological Association, 1999.
formal testing of concurrent or predictive validity has not been done. 5. Cronbach L. Coefficient alpha and the internal structure of tests. Psychometrica
The AKT is characterised by a high level of internal 1951;6:297–334.
consistency. The internal consistency of a test is a measure 6. Angoff W. Scales, norms, and equivalent scores. Educational measurement.
Washington, DC: American Council on Education, 1971;508–600.
of the coherency of the test. Cronbach’s α is commonly used
to index internal consistency and ranges between 0–1; scores
above 0.8 are commonly accepted as a gold standard for high
stakes examinations.5 The reliabilities obtained over the past 15
administrations of the examination from 1999.2 to 2006.2 have
consistently met and surpassed that criterion (Table 2). correspondence afp@racgp.org.au
Reprinted from Australian Family Physician Vol. 37, No. 8, August 2008 661