Professional Documents
Culture Documents
This study examined the effects of students and professors sex on student
evaluations of professors teaching effectiveness. Ratings of over four hundred
faculty made by over nine thousand students were analyzed. After controlling for a
large number of variables, the main results showed that (a) male faculty were given
significantly higher evaluations on global teacher effectiveness and academic
competence than female faculty; (b) when controlling for extraneous variables,
female faculty were not found to be rated as more sensitive to student needs than
male faculty; and (c) when making overall, global judgments of faculty performance,
students seem to place more weight on academic competence for male faculty than
for female faculty.
Request for reprints should be sent to ProfessorJim Sidanius, University of California, Los
Angela, Department of Psychology, 405 Hilgard Avenue, Los Angela, CA 90024-1563.
174
topic (Goldberg, 1968) showed that college women who judged the quality
of articles (which one half thought were written by women and the other
half thought were written by men) gave higher ratings to the work attributed
to men (see also McKee & Sherriffs, 1957). Similarly, Fidell (1975) reports
that college and university faculty who evaluated descriptions of
hypothetical psychologists rated the men more favorably than the women.
Deaux and Emswiller (1974) showed that male performance on a perceptual
discrimination task was rated as more skillful than the equivalent female
performance. Taynor and Deaux (1973) revealed that raters evaluated the
response to an emergency situation made by a male as more appropriate and
logical than the females identical response.
Still other research reports more favorable evaluations of womens
performance than mens. Jacobson and Effertz (1974) conducted a study in
which some subjects (the leaders) instructed others how to complete a
task. They conclude that women in a leadership position are perceived, and
perceive themselves, as doing a better job than male leaders, by male and
female subjects, given that the actual performance of the male and female
leaders is equivalent. Hamner, Kim, Baird, and Bigoness (1974) carried out
an investigation in which subjects were asked to take on the role of a grocery
store manager and assess the performance of job applicants stocking
shelves. Female applicants were seen as having a higher overall level of task
performance than male applicants. Abramson, Goldberg, Greenberg, and
Abramson (1977), whose research produced similar findings (women
attorneys and legal assistants were judged to be more vocationally
competent than men), call this effect the talking platypus phenomenon.
The term describes an instance in which an individual achieves a level of
success not anticipated and the achievement is magnified rather than
diminished; it matters little what the platypus says, the wonder is that it can
say anything at all. Two studies of actual employees in actual organizational
settings find evidence of this phenomenon. Although there is evidence in
the literature which suggests that, at least in certain situations, women can
receive higher performance evaluations than men (see e.g., Peters et al.,
1984) there is also evidence to indicate that success for men will tend to be
attributed to competence while success for women will tend to be attributed
to luck (see Etaugh & Brown, 1975; Wiener et al., 1971).
Although some studies have revealed biases favoring males and others
have found biases favoring females, other studies tend to find no effect of
gender on evaluations at all (see e.g., Elmore & LaPointe, 1975; Harris,
1975). Hall and Hall (1976) conducted research in which student subjects
evaluated the performance of a hypothetical manager who effectively solved
a problem. They found that the gender of job incumbents did not affect
performance appraisals. Frank and Drucher (1977) trained undergraduate
176 SlDANlUS AND CRANE
or teachers they have read about rather than teachers they have had as
students (see also review by Marsh, 1984).
The aim of our research is to assess the effect of sex on performance
evaluations in a more comprehensive manner than has been done in the
past. We will attempt to examine the effects of teachers sex by correcting
for the specification errors inherent in earlier research and using several
thousand naturally occurring and formal performance evaluations. We will
ask four specific questions regarding gender effects on evaluations. The first
question addresses the possibility of an effect of teachers sex on evaluations,
when the effects of other determinants of teaching evaluations are
controlled for and a larger sample of evaluations are considered. The
second, and related question concerns the possibility of an interaction effect
of student and teachers sex. The third question concerns the effect of the
violation of gender stereotypes. This will be done by examining the
evaluations of women in departments which are overwhelmingly dominated
by men. Finally, we will compare the determinants of overall or global
evaluations for male and female professors. We will compare the impact of
control variables, gender-related variables, and specific aspects of teaching
effectiveness on global evaluations of male and female professors.
Method
Procedure
Subjects
Dependent Variables
Results
Indicator Loading
Competency
The instructor showed a scholarly grasp of the course material. $2
The instructor seemed well prepared for lecture or discussion. .78
The instructor showed confidence before the class. .74
The instructor used clear, relevant examples. .66
The instructor kept the lectures and class discussions focused
on the subject of the course. .69
Sensitivity To Students
The instructor seemed to be sensitive to the feelings and
needs of the students. .76
The instructor usually seemed to be aware of whether the class
was following the presentation with understanding. .75
The instructor made me feel free to ask questions, disagree,
and express my ideas. .75
Factor scores for [Competency] and [Sensitivity to Students] were calculated as weighted
sums of the factor loadings found in the measurement model.
%he classic experimental analytic approach was used where the main effects of instructor
and student sex each were evaluated while controlling for the effects of the other and the
interaction term is assessed after the main effects are controlled. No adjustment for interaction
is made while assessing main effects.
182 SlDANlUS AND CRANE
Multivariate Analyses
Global Evaluations
7The quadratic aspects were also considered because it has been found that evaluations are,
in part, a curvilinear function of the number of students in the class.
JOB EVALUATIONAND GENDER 183
Table 2
Entry
Step Variables Beta B R
Table 2 (Continued)
Hierarchical Regression of Global Evaluation on Independent Variables
and Their Interactions
Entry
Step Variables Beta B R
S Teachers sex X
Associate professor .M8 .m*
Teachers sex X
Assistant professor .008 .012
Teachers sex X
Lecturer - .149 -.240* * *
Teachers sex X
Instructor -.044 -.074**
Teachers sex X
Teaching assistant .046 .110
.293***
6 Competency .420 .349***
Sensitivity to students .392 .241** *
.742** *
7 Competency
X Students sex -.007 -.001
Sensitivity to students
X Students sex -.004 -.ooo
.742** *
8 Competency
X Teachers sex .446 .053** *
Sensitivity to students
X Teachers sex -.088 -.011
.743** *
higher Global Evaluation ratings than women faculty, even when controlling
for the covariates above.
Continued additions to the equation indicated no significant Instructor
sex x student sex or Instructor sex x Percent women faculty interactions
(stages 3 and 4 respectively). This is to say that there was no evidence that
the differences in the performance evaluations male and female instructors
received depended upon the sex of the student. Likewise, there was also no
evidence to indicate that the differences in the performance evaluations that
male vs. female instructors received depended upon the number of female
instructors within a given department. This factor was tested because it is
conceivable that female instructors teaching in traditional and
predominantly male-dominated departments might be judged significantly
more harshly than their male counterparts. Our results tend not to confirm
this hypothesis.
The fifth stage of the hierarchical analyses did, however, reveal significant
Instructor sex x Instructor rank interactions. The nature of these
interactions revealed that, although male Teaching Assistants and Assistant,
Associate and Full Professors were given about 9% higher global
evaluations (on the average) than their female counterparts, males with the
rank of Instructor were given only marginally higher evaluations than
their female counterparts. Furthermore, males with the rank of Lecturer
were given 9% lower ratings than female Lecturers. Therefore, the sex
effect noted above must be modified somewhat by adding that the Global
Evaluation received by male and female teachers seems, in part, to depend
on the teachers rank.
A second means of attacking this question would be by examining the
rates at which male and female faculty actually participate in the evaluation
process. If male faculty are avoiding the evaluation process to a larger extent
than female faculty, then the proportion of male faculty in the examined
sample should be significantly smaller than the proportion of male faculty
within the university at large. However, examination of these proportions
revealed that male faculty were not unrepresented in the sample at any rank.
If anything, the male faculty seemed to be slightly, although not
significantly, overrepresented in the sample compared to the university
population.
Finally, one could argue that the results are biased in favor of the male
faculty because the males will have a higher tendency to allow themselves to
be evaluated only in their really good courses whereas females will tend to
allow evaluation across all of their courses. Although this argument is
plausible, it seems somewhat unlikely. Unfortunately, however, the best way
to settle the question, i.e., empirical test, is not possible here because our
data set does not contain information concerning the number of courses
186 SlDANlUS AND CRANE
Competency
ratings than instructors within other academic areas; (e) instructors rank.
Full Professors, Associate Professors, Assistant Professors and Instructors
tended to get relatively high competency ratings while Lecturers and
Teaching Assistants tended to get relatively low competency ratings; ( f )
number of students in the class. Instructors of very small and very large
classes tended to get relatively high competency ratings while instructors of
moderate-sized classes tended to get relatively low competency ratings; and
(g) percent of women in faculty. The greater the percentage of women
faculty in the instructors department, the higher these instructors
competency ratings. With the exception of the last result, these relationships
have also been found in other studies (see Table 3).
The critical question for this study, however, is whether or not women
faculty will receive significantly lower competency ratings even when
controlling for all of the above covariates. The hierarchical analyses
disclosed that, as predicted, female faculty were indeed perceived as being
significantlyless competent than male faculty, even when controlling for the
covariates. Furthermore, step 3 of the analyses showed no significant
interaction effect between student sex and instructor sex. This is to say that
the adjudged superior competence of male vs. female faculty was not
affected by the students sex
The fourth step of these analyses did disclose a significant interaction
between percentage of female faculty in a given department and the facultys
sex. It was hypothesized that female faculty in non-traditional roles (as
operationalized by the percentage of the women in a given department)
would be perceived as less competent than their male counterparts.
Although this interaction effect was found to be statistically significant, the
pattern of competency ratings for different combinations of sex and
percentage of female faculty was much more complicated than expected. As
predicted, the relative adjudged competency of males vs. females was greater
in predominantly male-dominated departments as compared to departments
with a higher proportion of female faculty. However, in very
male-dominated departments female faculty were rated as being very
competent, even more so than their male counterparts in those
departments.
Finally, the last stage of these analyses showed a significant interaction
between the facultys sex and the facultys rank, showing junior female
faculty being judged as more competent than their male counterparts and
female senior faculty being judged less competent than their male
counterparts. All of the independent variables together showed a relatively
small yet statistically significant relationship with the instructors judged
competency (R = .267,p -c lo4).
188 SlDANlUSAND CRANE
Table 3
Entry
Step Variables Beta B R
1 Students sex -.049 -.054** *
Expected grade .125 .178** *
Grade point average -.035 -.035**
Student rank .017 .015
Elective vs. required
course .004 .012
Life Sciences -.024 -.176
Humanities - .0562 - .124** *
Social Sciences -.012 -.063
Associate Professor .089 .155***
Assistant Professor .025 .046
Lecturers -.130 -.248** *
Instructors .004 .007
Teaching assistants -.044 -.126
Number of students -.147 -.001**
Number of students2 .238 .001**
Percent women faculty .082 .ow** *
.237** *
2 Instructors sex .052 .065***
.242** *
3 Students sex X
Teachers sex ,012 .013
.242** *
4 Instructors sex X
Percent women faculty - ,035 -.003**
.244** *
5 Teachers sex X
Associate professor .016 .027*
Teachers sex X
Assistant professor .041 .075
Teachers sex X
Lecturer -.147 -.283***
Teachers sex X
Instructor -.043 -.085*
JOB EVALUATIONAND GENDER 189
Table 3 (Continued)
Entry
Step Variables Beta B R
Teachers sex X
Teaching assistant -.001 -.m
.267***
Sensitivity to Students
Despite the fact that the simpler analyses above showed women faculty
being perceived as more sensitive than male faculty, after controlling for all
of the covariates there were no significant differences in the degrees to
which male and female faculty were rated as sensitive to students. There
were, however, certain significant interactions between faculty sex and
faculty rank. These interactions tended to show that female Lecturers were
considered to be more sensitive to students than male Lecturers but that
male Assistant and Full Professors tended to be considered more sensitive
to students than female Assistant and Full Professors.
Altogether there was a moderate relationship between all the
exogenous variables and the students perceptions of the facultys sensitivity
to students (R = .368,p < lo-). However, the single factor having, by far,
the largest impact upon the students perceptions of the instructors
sensitivity to students was the number of students in the class. This
relationship consisted of significant linear and quadratic components.
The purpose of this study has been fourfold: (1) to determine whether
there are differences in job performance evaluations made of men and
women faculty; (2) to determine whether these evaluations are affected by
interactions between the sexes of evaluators and evaluatees; (3) to
determine whether any effects of the possible violation of traditional gender
190 SlDANlUS AND CRANE
Table 4
Entry
Step Variables Beta B R
Entry
Step Variables Beta B R
Teachers sex X
Teaching assistant .016 ,062
.267***
notion that the less appropriate a persons position in a given role, the less
adequate that persons performance will be judged. However, in extreme
circumstances, that is to say when women are found in very male-dominated
environments, the female instructors were judged to be more competent
than their male counterparts. This result is consistent with the talking
platypus hypothesis. This interpretation is also congruent with the recent
findings of Eagley and Steffen (1984). On the other hand, it is also possible
that in order for women to be accepted in very male-dominated roles, they
will in fact have to be much more competent than their male counterparts
and that the higher-perceived competency of these women is not simply a
function of perceptual distortion but rather due to the fact that they really
are more competent. Finally, of course, it is quite possible that all three
factors are operative at the same time. Needless to say, this particular
phenomenon seems quite complicated and will demand more intense
scrutiny using controls which are not possible for us here if it is to be well
understood.
Fourth, and perhaps most interestingly, we did find evidence indicating
that when students make global evaluations of instructor performance,
competency seems to be a more salient judgmental dimension when making
judgments of male instructors than when making judgments of female
instructors. Furthermore, this effect seems unaffected by the students sex
Therefore, the overall impression given by these data is that, not only are
male professors judged to be more intellectually competent than female
professors, but that when judging overall teacher performance, it is also
more important for males to be intellectually competent than it is for
females.
There are three factors which we feel allow us to view these findings with
a certain degree of relative confidence: (1) Instead of being generated from
some contrived experimental procedure using a small number of instructors
and students, the data used here were generated as part of the universitys
regular and long-standing teacher evaluation process and will actually be
used by this university in making critical personnel decisions; (2) a very large
number of students and teachers were examined, thereby increasing the
precision of parameter estimates; and (3) as far as we know, our study has
controlled for a larger number of possible contaminating variables than
have been considered in the past. Given these circumstances, we think it is
reasonably safe to conclude that gender really does seem to make a
difference in performance evaluation, a difference prejudicial to women
academics as a group.
However, there is also reason to be careful about how these results are to
be interpreted. To begin with, the fact that, in general, men were perceived
as being more competent than women need not be a function of gender
JOB EVALUATION AND GENDER 193
References
Abramson, P., Goldberg, P., Greenberg, J., and Abramson, L. (1977). The
talking platypus phenomenon: Competency ratings as a function of sex
and professional status. Psychology of Women Quarter&, 2,114-124.
Aleamoni, L. (1981). Student ratings of instruction. In J. Millman (Ed.),
Handbook of Teacher Evaluations (pp. 110-145). Beverly Hills, C A Sage
Publications.
Anderson, W. T., Jr., Alpert, M. I. 8t Golden, L. L. (1977). A comparative
analysis of student-teacher interpersonal similarity/dissimilarity and
teaching effectiveness. The Journal of Educational Research, 71,3644.
Bayer, A. F. and Astin, H. S. (1975). Sex differentials in the Academic
reward system. Science, 188,796-801.
Basow, S. and Silberg, N. T. (1987). Student evaluations of college
professors: Are female and male professors rated differently? Journal of
Educational PsychoZogy, 79,308-314.
Bennett, S. K (1982). Student perceptions of and expectations for male and
female instructors: Evidence relating to the question of gender bias in
teaching evaluations.Journal of Educational Psychology, 74,170-179.
Benokratis, N. and Feagin, J. (1986). Modern Sexzkm. Englewood Cliffs, NJ:
Prentice-Hall.
Bigoness, W. J. (1976). Effect of applicants sex, race and performance on
employers performance ratings: Some additional findings. Journal of
Applied Psychology, 61,8044.
Bray, J. and Howard, G. (1980). Interaction of Teacher and student sex and
sex role orientations and student evaluations of college instruction.
Contemporay Educational Psychology, A5,241-248.
Carter, A. M. and Ruther, W. E. (1975). The Disappearance of Sex
Discrimination in the First Job Placement of New PhD.3. Los Angela:
Higher Education Research Institute.
Centra, J. A. (1973a). Effectiveness of student feedback in modifying college
instruction. Journal of Educational PsychoZogy, 65,395401.
Centra, J. A. (1973b). Self-ratings of college teachers: A comparison with
student ratings. Journal of Educational Measurement, 10,287-295.
JOB EVALUATIONAND GENDER 195
Peters, L., OConnor, E., Weekley, J., Pooyan, A, Frank, B., and
Erenkrantz, B. (1984). Sex bias and managerial evaluations: A replication
and extension.Journal of Applied Psychology, 69,349-352.
Pheterson, G., Kiesler, S. and Goldberg, P. (1971). Evaluation of the
performance of women as a function of their sex, achievement, and
personal history. Journal of Personality and Social Psychology, 19,
114-118.
Rosen, B. & Jerdee, T. H. (1973). The influence of sex-role stereotypes on
evaluations of male and female supervisory behavior. Journal of Applied
Psychology, 57,4448.
Schrank, W. E. (1977). Sex discrimination in faculty salaries: A case study.
Canadian Journal of Economics, X,411-433.
Schrank, W. E. (1985, April). Sex digerences in faculty salaries at Memorial
University.+Adecade later. Report submitted to the President of Memorial
University and the Executive Committee of the Memorial University of
Newfoundland Faculty Association.
Sohn, D. (1982). Sex differences in achievement self-attributions: An
effect-size analysis. Sex Roles, 8,345-357.
Taynor, J. and Deaux, K (1973). When women are more deserving than
men: Equity, attribution and perceived sex differences. Journal of
PersonaZity, 28,360-367.
Taynor, J. and Deaux, K. (1975). Equity and perceived sex differences: Role
behavior as defined by the task, the mode and the actor. Journal of
Personality and Social Psychology, 32,381-390.
Terborg, J. and Ilgen, D. (1975). A theoretical approach to sex
discrimination in traditionally masculine occupations. Organizational
Behavior and Human Performance, 13,352-376.
Tuchman, H. P. (1975). Publication, teaching and the academic reward
structure. Lexington, MA. Lexington Books.
Unger, R. (1979). Sexism in teacher evaluation: The comparability of real
life and laboratory analogs. Academic Psychology Bulletin, 1,163-170.
Wiener, B., Frieze, I., Kukla, A., Reed, L., Rest, S. A & Rosenbaum, R. M.
(1971). Perceiving the causes of success and failure. New York: General
Learning Press.
Wilson, D. & Doyle, K. 0. (1976). Student ratings of instructors. Journal of
Higher Education, 47,465-470.