You are on page 1of 10

2010 1 ( 136 )

: 20 80
, , , ,

,
, ,
20 50, , ,

, ,
: ; ; ; ;
Abstrac t: L arge-sca le standard ized language tests hav e deve loped w ith g rea t mom entum since the 1980s in China.
M ost o f these tests are high- stakes ones, exerting sign ificant w ashback e ffects on teach ing and learn ing in the country.
T herefore, whethe r these tests per se are scientifically designed and whethe r they are appropriate ly used becom e im-
portant concerns. Ifw e estab lish a set o f standards wh ich ca ter to the spec ific c ircum stances and the needs of languag e
testing in Ch ina, d irect and gu ide the practices of test deve lopers and users in the entire testing process, then w e can
m ake these tests m ore scientific and ob jec tive and fa ire r to stakeholde rs. S ince the 1950s, especia lly in the past 20
years, language testers around the wo rld have collaborated w ith educationalists to establish m any im po rtant standards.
T h is pape r rev iew s the achievem ents in the field o f standard se tting and reflects on these standa rd se tting in itiatives,
wh ich, hope fu lly, can prov ide us w ith insp irations for our own standard se tting endeavo r.
K ey word s: language testing; test design; test adm in istration; test use; standard setting
: H 319 : B : 1004- 5112( 2010) 01- 0082- 10

1.



, ( Shoham y 2001b: 374),
( h ig h-stakes test) ,
, , / , 0
( , 2007: 372),
, , ( 1999,
2000) ,
,
, ,
( A verm ae,t Kuijip er& Sav ille
2004; B achm an 2000; Boyd & Davies 2002; D av ies 1997a, 1997b, 2008 )

* / 0,

# 82#
Foreign Language W orld N o. 1 2010 ( G eneral Serial N o. 136)

20 80,
, , CET ( ) PETS
( ) TEM (
) ,
, ( Cheng 2008; 2007; 2000;
2004 ) ,

,


,

( 2007: 372)
, / 5 6,
- . 0,
A lderson ,
, , ( A lderson, C lapham & W all 1995:
237) , : ( 1)
? ( 2) ?

2.

20 40, ,

( Dav ies 1997a, 2008) 20 20


( Spo lsky 1995), , ,
( Bachm an 2000; Dav ie s 1997b; Kunnan 1999
) , ?
2. 1
, ( ALTE 2001; A verm aet et al 2004)
( APA ) 20 40,
1954, 5 6,
, 5
6 ( AERA, APA, NCM E 1999)
( ETS) 2002 5ETS 6 5
6, ETS ( ETS 2002: 2) , ETS
( ALTE ) 1994 5
6 ( ALTE 1994) , 2001, 5
6 ( ALTE 2001) ( ILTA ) 2000
A lan Dav ies 5 6,
5 6 ( Bachm an 2000; Boyd & Davies 2002; ILTA 2000)
, , , ,


5 6 ,
5 6

# 83#
2010 1 ( 136 )

1 10

1 10

1. ETS ( ET S S tandards ETS , ETS


ET S 1981, 2002
of Q ua lity and Fa irness)

2. ( Standards for AERA, APA, ,


1985, 1999
Educational and P sycho log ical T esting ) NCM E ,

3. ( Code , ,
JCTP 1988, 2004
of F a ir T esting P ractices in Education) ,

4. , ,
ALTE 1994
( ALTE Code of Practice) ,

5. ( Code (
of P ro fessiona lR espons ibilities in Educa- NCM E 1995


tionalM easurem ent) )

6. :
( R ights and R esponsibilities of T est T ak- JCTP 1998
ers: G u idelines and Expec tations)

7. ( R e-
,
port o fT ask Fo rce on T est U ser Spec ifica- APA 2000

tions)

8. ( Code
ILTA 2000
of E thics for ILTA )



9. ( R espon-
AACE 2003


sib ilities o fU sers o f Standard ized T ests)

10.
( EALTA G uide lines ,
EA LTA 2006
of G ood Practice in Languag e T esting and ,
A ssessment)

1: ( 1) , , ; ( 2 )
, ,
; ( 3) , , 10

# 84#
Foreign Language W orld N o. 1 2010 ( G eneral Serial N o. 136)

2. 2
, 1
: 5ET S 65 65
6 5 6,
: , ,
; ,

,

2. 2. 1 5ET S 6
,
ET S Kurt M. L andg raf, ET S /
, 0 ( ETS 2002: V ),
L andgra f / 0 ( ET S 2002: V ) ET S 1981
, , 2002,
/
, 0 ( ET S 2002: 1) ETS
13, ETS,



, ETS,
: ,
( ET S 2002: 3)
, ETS
: ,
5 6 ( ET S 2002: 3), ET S
, ET S, ,
ETS, ( Aud it R e-
quirem en ts) / , ETS 0 ( ETS 2002:
5) , ,
, , ,
2. 2. 2 5 6
, 5APA 6 ( A -
l
derson et al 1995: 237) ( AERA)
( APA )
( NCME ) 1954,
1985, 1999 1985,
, , ,
, 15, 264: ( 1)
, ( 2) , ( 3)
;

;
# 85#
2010 1 ( 136 )


, /

0 ( AERA, APA, NCME 1999: 2)
, /
,
, 0 ( AERA, APA,
NCME 1999: 2) , ,
,

,
, ( Bachm an 2000; Sho-
ham y 2001a, 2001b ), ,
, ( Kunnan 2000) ,

, ,
, ( AERA,
APA, NCME 1999: 4) ,
, 20 50
,
2. 2. 3 5 6
( ALTE ) 1990, 27, 24
, /
0( Averm aet et al 2004: 138) ,
,
( Averm aet et al 2004: 139) ( JCTP ) 1988
5 6 ( A ld erson et al 1995: 245) , /
, ,
, 0 ( Averm aet et al 2004: 138- 139)
, : ( )
( )

,
( ALTE 2001; Averm aet et al 2004)

, ,
,
, , ,
, ( Boyd & Dav ies 2002; Da-
vies 2008) , , ) ) ) 5
6 ( ALTE 2001) , ,
,
, ( JCTP ) 1988
# 86#
Foreign Language W orld N o. 1 2010 ( G eneral Serial N o. 136)

5 6,
( A lderson & Banerjee 2002; Bachm an 1990, 2000; Kunnan 1999 )
2. 2. 4 5 6
( ILTA ) 1992,
,
, ( Boyd & Dav ies 2002; D av ie s
1997a, 1997b, 2008) ,
, ,
( Boyd& Dav ies 2002) A lan Dav ies, 2000 3
/ , 0 ( ILTA 2000)
, 5 6
, ,
, ,

( ), ( ) ,
( ),
( ) , ,
( ) , ( ) ,
, ( ),
( ) , ,
, , Dav ies( 1997b: 328) ,
,
, , ( D av ie s
2008: 441)

3. :

, , A lderson ( 1995: 255)


, , ,
,
5 6, / 0 ( ILTA
2000: 1) ,
5ETS 6 /

,
0 ( ETS 2002: 1), , ?
, : ( 1)
; ( 2) ; ( 3)
, ; ( 4)
# 87#
2010 1 ( 136 )

, ,
, , ,

, ,
Bachm an( 1990: 279) : /
; 0
, 5ET S
6 5 6, , ET S
, , ,
, ,
, ,
, ; ,
, ( A ld erson &
Buck 1993; A lderson et al 1995) ,
,

,
,
, ,
,
( AERA, APA, NCME 1999: 4) , 5
6
Selig er Shoham y ( 1989: 17) , ,
, ,
,
,
,
, ,
, ,
,


,
,

, ,
, ,
,
, 5 6,
5 6

( ALTE 2001) ETS 5ETS


6 ( ETS 2002: 5), ,
; , ,
( A verm aet et al 2004) ,
# 88#
Foreign Language W orld N o. 1 2010 ( G eneral Serial N o. 136)

, ,
, , 5 6
, ,
,
,
, ,

4.

,
, ,

, , ;
,
,
, ,

:
[ 1] AACE: A sso ciation fo r A ssessm ent in Counse ling and Education( http: / /www. theaaceon line. com /)
[ 2] AERA: Am erican Educationa lR esearch A ssoc iation( http: / /www. aera. net/)
[ 3] ALTE: A ssoc ia tion o f L anguage T esters in Europe( http: / /www. a lte. o rg / )
[ 4] APA: Am e rican P sycho log ical A ssociation( http: / /www. apa. org /)
[ 5] EALTA: European A ssoc ia tion for L anguage T esting and A ssessm ent( http: / /www. ea lta. eu. org /)
[ 6] ET S: Educationa l T esting Se rv ice( www. ets. org /)
[ 7] ILTA: Internationa l Languag e T esting A ssociation( www. iltaonline. com /)
[ 8] JCTP: Jo in t Comm ittee on T esting P ractices( h ttp: / /www. apa. o rg / sc ience / jctpw eb. htm l)
[ 9] NCM E: N ationa l Counc il on M easurem ent in Education( http: / /www. ncm e. o rg / )

[ 1] AACE. Responsibilities of User s of S tandardized T ests [ EB /OL ] . http: / / aace. nca t. edu /R esources/ docu-
ments /RU ST 2003% 20v11% 20F ina.l pd,f 2003.
[ 2] AERA, APA, NCM E. Standards for Educational and Psy cholog ical T esting [ Z ]. W ash ington, DC: A uthor,
1985.
[ 3] AERA, APA, NCM E. Standards for Educational and P sy chological T esting [ Z]. W ash ing ton, DC: AERA,
1999.
[ 4] A lderson J C & Bane rjee J. L anguage testing and assessm ent ( P art Tw o) [ J]. Language T eaching, 2002,
( 35).
[ 5] A lderson J C & Buck G. Standa rds in testing: A study o f the practice o fUK exam ina tion boards in EFL /ESL
testing [ J]. Language T esting, 1993, 10( 1).
[ 6] A lderson J C, C lapham C M & W a ll D M. Language T est Cons truction and Evaluation [ M ]. Cam bridge:
# 89#
2010 1 ( 136 )

C ambr idge U n iversity P ress, 1995.


[ 7] A LTE. ALTE Code of P ractice: T he Code of Practice for the A ssociation of Language T esters in Europ e ( V ersion
1) [ EB /O L] . h ttp: / /www. testda.f de / institut/pdf/ALTE /A LTE code of P rac tice E inle itung EN.
pd,f 1994.
[ 8] A LTE. Pr incip les of G ood P ractice for A LTE Ex am inations [ EB /OL ]. http: / /www. testda.f de / institut/ pdf/
ALTE /A LTE good practice. pd,f 2001.
[ 9] A PA. T echn ica l recomm endations fo r psycho log ica l tests and d iagnostic techn iques [ J]. P sychological Bulle-
tin, 1954, 51( 2, P art 2, Supplem ent).
[ 10] A PA. Repor t of the Task Force on T est U ser Sp ecif ications [ EB /OL ]. http: / /www. apa. o rg / sc ience / tuq. pd,f
2000.
[ 11] A ve rmae t P V, Ku ijpe rH & Sav ille N. A code o f prac tice and quality m anagem en t system for inte rnational lan-
guage exam inations [ J] . Language A ssessm ent Q uarterly, 2004, 1( 2& 3).
[ 12] Bachman L F. Fundam ental Consid era tions in Language T esting [ M ]. O x fo rd: Ox ford U n iversity P ress,
1990.
[ 13] Bachm an L F. M ode rn language testing at the turn o f the century: A ssur ing that what w e count counts [ J].
Language T esting, 2000, 17( 1).
[ 14] Boyd K & Dav ies A. Doc to rps orders for language testers: T he or ig in and purpose of e th ica l codes [ J]. Lan-
guage T esting, 2002, 19( 3).
[ 15] Cheng L. The key to success: Eng lish language testing in Ch ina [ J]. Language T es ting, 2008, 25( 1) .
[ 16] D av ies A. D emands o f be ing pro fessiona l in language testing [ J]. Language T esting, 1997a, 14( 3).
[ 17] D av ies A ( ed. ). Specia l issue: Eth ics in language testing [ J]. Language T esting, 1997b, 14( 3) .
[ 18] D av ies A. Ethics, pro fessionalism, r ights and codes [ A ]. In Shoham y E & H ornberge r N H ( eds. ). Ency-
clop edia of Language and Education ( V olum e 7: Language A ssessmen t) [ Z]. N ew Y ork: Springer, 2008. 429
- 433.
[ 19] ETS. ETS S tandard s for Quality and Fairness [ Z ]. P rinceton, N ew Jersey: A utho r, 2002.
[ 20] ILTA. Cod e of E thics for ILTA [ EB /OL ] . http: / /www. iltaonline. com / code. pd,f 2000.
[ 21] JCTP. Cod e of F air T esting Practices in Education [ EB /O L ]. http: / /www. apa. org / sc ience / fa irtestcode. h-
t
m ,l 1988 /2004.
[ 22] JCTP. R ights and R esp onsib ilities of T est T ak ers: G uidelines and Exp ectations [ EB /O L ]. h ttp: / /www. theaa-
ceon line. com / righ ts. pd,f 1998.
[ 23] K unnan A. R ecent deve lopm en ts in language testing [ J]. A nnual R eview of A pp lied L inguistics, 1999, ( 19).
[ 24] K unnan A ( ed. ). Fairness and Validation in Languag e A ssessm ent [ M ]. Cam bridge: Cambr idg e U nivers ity
Press, 2000.
[ 25] NCM E. Code of Prof essional Responsibilities in Educational M easurem ent [ EB /OL ]. http: / /www. na td. o rg /
Code of P rofessiona l R esponsib ilities. h tm ,l 1995.
[ 26] Se lig erH W & Shoham y E. S econd Language R esearchM ethod s [M ]. O x ford: Ox fordU n iv ers ity P ress, 1989.
[ 27] Shohamy E. The Power of T ests: A Critical P er sp ective of the U ses of Language T ests [ M ] . London: Pearson
Education, 2001a.
[ 28] Shohamy E. Demo cra tic assessm ent as an alte rnative [ J]. Language T esting, 2001b, ( 4).
[ 29] Spo lsky B. M easured W ord s [M ]. O x fo rd: O x fo rd U n ive rsity P ress, 1995.
[ 30] . ) ) ) [ M ]. :
, 2007.
[ 31] .
[ J]. , 2000, ( 4).
# 90#
Foreign Language W orld N o. 1 2010 ( G eneral Serial N o. 136)

[ 32] . : [M ]. :
, 2004.
[ 33] , ( ). [ M ]. : , 2003.
[ 34] . [ J]. , 1999, ( 1).
[ 35] . [ A ]. A lderson J C, C lapham C M, W allD M . [M ]. :
, 2000.
[ 36] , . [ J]. , 2007, ( 4).

: , 200240

( 81 )
[ 29] T illm ann C et al. A ccelera ted DP Based Search for Statistical T ranslation [ R ]. R hodes, G reece: T he 5th Eu-
ropean Conference on Speech Comm un ication, 1997.
[ 30] Turney P D. M easuring Semantic S im ilarity by L atentR ela tionalA naly sis [ R ] . Edinburgh, Sco tland: T he 19th
Interna tiona l Jo int Con fe rence on A rtific ial Inte lligence, 2005.
[ 31] V ida l E. F inite-State Sp eech-to-Speech T ranslation [ R ]. M un ich, G erm any: The Internationa l Conference on
A coustics, Speech and S igna l P ro cessing, 1997.
[ 32] Zhang Y, V ogel S & W a ibe lA. Interp reting B leu /N ist Scores: H ow M uch Improvem ent? D o W e N eed toH ave a
Better System? [ R ]. L isbon: T he In ternational Confe rence on L anguage R esources and Evalua tion ( LREC ),
2004.
[ 33] . [ J]. , 2003, ( 1): 76- 84.
[ 34] . [ D ]. : , 2005.
[ 35] . [ J]. , 2006, ( 3): 284- 292.
[ 36] , , . N-G ram [ J] . , 2007, ( 4):
405- 413.

: 1. , 225009
2. , 100089

5 6
5 6 2006
,
5 6 5, 500,
5 6 1400
, ,
,



, ,
, 5
6

# 91#

You might also like