You are on page 1of 3

2007 7 2007, l 7

2 20 Foreign Languages and T heir T eaching Serial l 220

*

( , 210097)
: , ( RST )
, , 385 ,

:
, , RST
: ; ;
Abstract: Compared w it h ot her t ypes of annot at ion, discourse-st ructure annot at ion is a new comer. How ever, considerable achievement s have been
made in t his new area. Now , a discourse-tagged corpus containing 385 document s select ed f rom t he Penn T reebank has been built in the framework of

R het orical St ructure T heory by a research group from In formation Science Instit ute, U niversity of Southern California. It is a large- scale hierarchically

annot at ed ref erence corpus w it h high quality and consist ency. Building such a corpus not only est ablishes a theoret ical syst em f or discourse- st ruct ure anno-
t at ion, but also opens up new f ields t o apply discourse st ruct ures. This article is a review of t he major achievement s in establishing such a corpus.

Key Words: Rhet orical Struct ure Theory; discourse st ruct ure; annot at ion

: H059 : A : 1004- 6038( 2007) 07- 0009- 02

( 1987) ( R ST)
0. ( RST ) :
, ;
,
, , ;
, ,

,
, ( RST )
, 2.
,
385 , , Givon ( 1983)
, Sacks( 1974)
, Polanyi( 1988)
, Grosz S indner( 1986 )
, , ,
,
,
1. , , M arcu
Carlson ( 2001) , , :
G roz Sidner( 1986) M ann T hom pson( 1987) , ;
, , ;
, ;
, , ,
, Carlson( 2001) M ann Thompson in spit e of ( ) , according t o( )

*
: , , , , :
: 2006- 10- 20; 2007- 04- 10( )
7 #10 #

M arcu Grosz S indner ( 1986 ) M ann ,


( 1987) T hompson ( 1987) , 5. 2
,
, ,
3. K appa ( K appa
, M ann T hompson( 1987) 20 ,
, , ) Kappa 0. 8, ;
, 0. 6 0. 8 ,
M arcu( 2000) 53 25
, 78 16 , 6.
RST ,
, ,
,
, 6. 1

4. , ,
, Carlson ( 2001)
,
,
, 3
, , s ince as
, , ,
, , 1/ 3,
,
, , as since,
, ,
, 6. 2
, , Lancast er O BC ( G arside , 1987; Biber ,
, , 1998, Carlson, 2001) , T DT
( Wayne, 2000, Carlson, 2001) ,
, , RST
,
, , , ,
, , , ,

,
5. , ,
,
,
, , , ,
, ,
5. 1 , ,
, ,
, 6. 3
,
, , /
, , , - 0,
,
( 16 )
7 #16 #

, , ; ,
,
,
, ,
8.
/ 0 ,
,
, ,
, ,
, ,
/ 0 ,

,
, ,
, :
, [ 1] . [ J] . , 2000( 4) : 291- 297.
, , [ 2] . [ J] . , 1991
, , ( 1) .
[ 3] . / 0 [ J] . , 2001( 3) .
,
[ 4] . [ J] . , 2000( 1) .
,
[ 5] . [ Z] .
: , 2002.
, , , [ 6] . - [ J] .
, , , 1988( 2) .

( 10 ) 2001 , Daniel M arcu R ST


385
, , , 31 2, 124 ,
, , / 0 176, 000, 458 ,
,
, Halliday Hasan :
( 1976) , RST
6. 4
:
, , [ 1] Carlson, L. , M arcu. D. & Okurowski M . Building a D iscourse-tagged
Corpus in t he Framework of Rhet orical St ruct ure Theory. Proceedings of
,
the First Annual M eet ing of the N orth American Chapt er of the Associa-
, ,
t ion for Computat ional Linguist ics, Seatt le, WA , 2001: 9- 17.
, , [ 2] Grosz , B. & Sidner, C. At t ent ions, Intentions, and the St ructure of
, , Discourse[ J] . Comput ati onal L i nguisti cs, 12( 3) : 175- 204. Talmy
Givon, 1983/ 1986.

[ 3] Halliday, M . A . K . & R. Hasan. Cohesi on i n Engli sh [ M ] . Lon don:



Longman, 1976.
, [ 4] M ann. W. & S. Thompson. R het orical Struct ure T heory: A Theory
of T ext O rganizat ion. U SC Inf ormat ion S cience Inst it ut e. Technical
Report I ( S I/ RS - 87- 190) , 1987.
[ 5] M arcu, D. The T heory and Pr act ice of D iscourse Par si ng and Su m-
mariz ati on [ M ] . Cambridge, M assachusett s: M IT Press, 2000.
7.