Professional Documents
Culture Documents
In 1962, the Scholastic Aptitude Test replaced the essay test used by the Colleg e Entrance Examination Board.
This test, and the advent of machine scoring led to a rapid increase in the use of standardized achievement tests in the U.S.
1.
2.
3.
Assign Grades
4.
5.
6.
Curriculum Evaluation
7.
Accountability
?????? The information gained from a standardized test is dependent upon how the testing is incorporated into the learning material.
Summative Evaluation :? Testing is done at the end of the instructional unit.? T he test score is seen as the summation of all knowledge learned during a particu lar subject unit.
Formative Evaluation : Testing occurs constantly with learning so that teachers can evaluate the effectiveness of teaching methods along with the assessment of students' abilities.
Norm-Referenced
Criterion Referenced
In order to accomplish this goal, they make objective information concerning sch olastic performance available to educators and public policy officials.
They use a criterion-referenced approach to evaluating performance in ten subjec t areas, which are age stratified in four groups :? at age 9,13,17, and 25-35.?
The criterions they set can be used as guidelines to evaluate the effectiveness of the educational system within a particular area by comparing the performance to the national criterion levels. Types of Standardized Achievement Tests
4 major Categories
Survey Test Batteries : Commonly used to determine general standing with respect to group performance. He battery is a group of subject area tests, usually cont aining a fairly limited sample of questions with in each subject area. ?????? Test batteries usually have lower reliabilities than single subject surve y tests bc of the limited question sample of each subject area.
Single Subject Survey Tests : Longer and more detailed than batteries, but only one subject are is covered by the test. Greater sampling of questions means high er levels of reliability than survey batteries.
Diagnostic Tests? :? Allows for the identification of specific strengths and wea knesses within a subject area by subdividing the subject area into the underlyin g components. Diagnostic tests are common in the areas of reading, mathematics, spelling, and foreign languages are most common. ???
Prognostic Tests :? Aptitude tests which are designed to predict achievement in specific school subjects. Criticisms of College Entrance Examinations
1.
learning.
2.
?????? C. Concerned with only the answer, not how the person came up with the an swer.
ETS came under fire in the 1980's by Raplph Nader, a perpetual candidate for the presidency and a consumer advocate.
Nader criticized the SAT's and ACT's for not measuring imagination, idealism, de termination, and other abilities which he considered important in quantifying hu man qualities.
Allan Niarn (1980), a collegue of Nader's claimed that the SAT's measure Social Class, rather than educational aptitude.
Nairn claims the ETS is trying to suppress this information, bc the evil purpose behind the SAT is to maintain the status quo of society and to deny opportunity to those of lower socio economic status.
Nairn claims that since the SAT is a poor p[redicotr of college grades, a differ ent measure of assessment should be developed.
ETS responded to these highly publicized attacks by claiming the SAT does not de ny access to higher education for individuals from working class and poor famili es.
Continued the attacks made by Nairn by claiming the SAT is biased against minori ty groups and women, and therefore deny them an equal opportunity for higher edu cation.
They also criticized the use of "experimental" sections of the SAT which were no t used for grading purposes.
New York State has a "truth in testing" law which requires all test takers be gi ven copies of their own answer sheet, and informed how test scores will be compu ted. In addition, test developers must file information concerning the validity of the standardized measure with the State commission of Education
The ETS claims that careful internal review of all potential test items has resu lted in removing bias within an item which would adversely affect scores of wome n and/or minorities.
If scores are significantly affected by Kaplan seminars or other preparation met hods, how reliable are these standardized tests ?
To the degree that people with more disposable resources (higher SES) are more l ikely to take advantage of these programs, is a class bias being created that af fects the interpretation of the test scores ?
?????? Studies designed to measure the effectiveness of these testing seminars h as produced mixed results : College Entrance Examination Board (1971) : Claimed there was no evidence that ? short -term, intensive drilling on SAT type questions did not lead to significan tly higher scores on the verbal portion of the SAT.
However, studies done by Stanley Kaplan and the FTC (1979) showed significant ga ins after a 10 week coaching course.
ETS reanalyzed the FTC data and concluded the coaching sessions could add 20 - 3 5 points to both Math and verbal scores.
The SAT scores were renormed in 1996, to bring the mean back to 500 and the stan dard deviation back to 100.
Declines occurred both sexes, for all ethnic groups, and for both low and high p erformers.
Television
Increased permissiveness in society Autin & Garber 1982 analyzed the decline in SAT scores
1/2 of the variance in the decline in scores was do to difference's in the overa ll composition of students taking the exam.
As college became more accessible to students from middle and lower class backgr ounds, more and more students began taking the entrance exam.
As the population of students taking the SAT began to more closely resemble the population of all possible students who could take the exam, the scores dropped.
This is one qualification public schools in NYS make when cautioning parents whe n interpreting standardized test scores by school district.????? If the school district tries to maximize the proportion of students taking these exams, they will take pains to mention that fact when their "report cards" come out. ?A high level of performance at a school where only 20% of students take that ex am can not be meaningfully compared to test performance from a school which has
Gender :? Men score 37 points higher than females on SAT-Math section.? Testoste rone, hemispheric lateralization, differential reinforcement from math teachers, and differential cultural expectations are four hypothesized differences for th is discrepancy. Men score 7 points higher than women on the SAT verbal section.
Bob Shaffer from Fairtest :? Girls are more inclined to think through a problem, and weigh all the options, and that puts them at a strategic disadvantage in mu ltiple choice tests.
This gender gap means women are less likely to receive scholarships than men.
In college freshman and sophomore years, the women achieve a higher GPA than men , on average.
SAT scores for all other minority groups fall below the test score levels of Cau casian Americans.
Average SAT scores from large cities are typically lower than average.
Average SAT scores from suburban regions are typically higher than average.
To the degree primary education systems suffer from institutional racism as a re sult of funding policies, the class differences which produce error variation in SAT scores will still persist into the future.
Conclusion
Despite the many criticisms, the SAT is still the single best predictor of who w ill be seen a s academically successful during the first year of college.
The differences in education exposed by the SAT (gender, ethnicity, geographic) illustrates areas of improvement which must be made to ensure equal opportunity for higher education for all Americans, not simply those who are well off financ ially and live in educational districts where high levels of educational attainm ent is the norm, rather than the exception.
?????? Both 2000 presidential candidates have stated that education is one of th eir "top priorities" should they be elected.