You are on page 1of 15

g:\proceduresdox\bf&accent&fieldcodes.

doc updated 11/19/1


ACCENT CODES
An accent code appears immediately after the letter it is supposed to appear on in print, with no spaces around it. These codes all begin
with a $ (dollar sign) and, if they are in the title, author, or contributor lines, are enclosed in <...> (angle brackets) and preceded by ^
(caret). There are four vertical levels for the placement of accent codes, determined by the shape of the letter and the position of the
accent (see below for a fifth group of accents):
capital tall descender short
1 At the top 2 At the top Name 3 At the bottom 4 At the bottom of Name
of capital of lower case of of regular letters with of
or tall letters: or short letters: accent: letters: descenders: accent:
= A$OCL = a$FOC overcircle e = e$FCL

= j$UCL undercircle
= C$DOT = c$FDT overdot c = e$FUD g

= g$UDT underdot
= A$MAC = a$FMA macron e = e$FUN j

= j$UMA underline
= C$ACT = c$FAC acute = c$FLC g

= g$LCD left cedilla


= O$GRA = o$FGR grave e = e$FLH g

= g$LHK left hook


= O$DAC = o$FDA double acute e = e$FRH g

= g$RHK right hook


= G$CIR g = g$FCI circumflex
= C$HAK c = g$FHA hacek
= A$UMT a = a$FUM umlaut
= E$BRE e = e$FBR brevis
= A$TIL = a$FTL tilde
Through the middle of letters (tall/short, capital/lower-case):
5 At the left, the Name Same as old
right, or through of accent codes
the middle: accent: (DO NOT USE)
d

= d$FCM strike-through ( = D$CMA use $FCM instead for all)


d

= d$FMC strike-through (D

= D$CMC use $FMC instead for all)


= d$FMB strike-through (D = D$CMB because D$FMB seems to differ in print)
= l$FVR slash ( = L$VIR use $FVR instead for all)
In the case of multiple accents on a single letter, multiple codes can be used; be sure to avoid overlapping accents in print (ie, for e use
$FTL for the tilde because it is close to the letter but use $ACT for the acute so that it doesn't overlap the tilde). Note that if you use
multiple accent codes on a single letter, the order does matter; the best print results seem to be obtained with the order (by the
groupings established above) 2 > 5 > 1 > 3 > 4 (only 2 > 1 and 2 > 3 have been encountered and confirmed so far). Use this example
for reference:
e" # $ would be e$FMA$ACT$FCL.
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
WARNING: All accent, field, and BF codes get stripped out of the electronic version of LLBA. Be aware of the consequences of this
for the appearance and accuracy of our records. For accent codes, this leaves the bare letter, so that what appears as dda in print will
appear as dada online. A recent real example of a grievous error with accent codes that rendered a title completely inaccurate is this
one:
Prosodic Stem =^<$FVR> Prosodic Word in Bantu
This will appear just fine in print as
Prosodic Stem Prosodic Word in Bantu
but it will be exactly opposite in meaning online because the accent code will be removed:
Prosodic Stem = Prosodic Word in Bantu
This title should have appeared as follows in order to avoid this error:
Prosodic Stem (Does Not Equal) Prosodic Word in Bantu
because there is no "" symbol available and the intention of the author must not be misrepresented.
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
FONT FORMATTING FIELD CODES
Our database uses only regular Roman font, but we have the need for font formatting. Note that all field codes are enclosed in <...>
(angle brackets) and all fields are closed with the code <RF>. Also note that field codes in the title line are preceded by a ^ (caret):
Times Roman is used in the general note line to give page numbers of do-as-ones and in the title line to give descriptive information
(this visually offsets the information because these two fields are bolded in the database):
TI: Linguistics (A. Remy's Tr from German) = Linguistics ^<FTMR>(A. Remy's Tr from German)^<RF>
GN: Comments, 11-29; Reply, 30-40. = Comments, <FTMR>11-29<RF>; Reply, <FTMR>30-40<RF>.
Times Roman Italics is generally used for book titles and linguistic examples:
AB: German Apfel apple = German <FTRI>Apfel<RF> "apple'
Times Roman Bold is used in the abstracts of do-as-ones for article titles:
AB: N. Chomsky's The Author Responds = N. Chomsky's <FTRB>The Author Responds<RF>
Times Roman Bold Italics is used in the title line and the general note line instead of <FTRI> (this visually blends the information
because these two fields are bolded in the database):
TI: Review Article: Sprachen = Review Article ^<FTBI>Sprachen^<RF>
GN: A. Remy's English translation of Sprachen. = A. Remy's English translation of <FTBI>Sprachen<RF>.
TIMES ROMAN SMALL is generally used for dates and phonetics:
AB: in the 3rd century BC = in the 3rd century <FTRS>BC<RF>
AB: the lax high front vowel /I/ = the lax high front vowel /<FTRS>I<RF>/
WARNING: All accent, field, and BF codes get stripped out of the electronic version of LLBA. Be aware of the consequences of this
for the appearance and accuracy of our records. For field codes, this leaves the unformatted text, so that what appears as dada in print
will appear as dada online. No errors seem to be introduced by the removal of field codes, but it is still something to bear in mind.
OTHER FONT FORMATTING CODES AND SPECIAL KEYBOARD CHARACTERS
Following are the rules for superscripting and subscripting:
Like other font formatting, superscripts and subscripts have a beginning and ending code; however, like accent codes, these codes all
begin with a $ (dollar sign) and, if they are in the title, author, or contributor lines, are enclosed by <...> (angle brackets) and preceded
by ^ (caret).
Superscript
and
Subscript
are generally used in linguistic examples:
AB: Proto-Algonquin *k
w
eh
2
anka = Proto-Algonquin <FTRI>k$SUPw$ESUeh$SUB2$ESUanka<RF>
TI: Proto-Algonquin *k
w
eh
2
anka = Proto-Algonquin ^<FTBI>k^<$SUP>w^<$ESU>eh^<$SUB>2^<$ESU>anka^<RF>
Following are the rules for quotation marks:
Smart quotes differentiate between open and close quotation marks. The STAR database does not recognize them but our print product
uses them. Therefore,
for (a single open smart quote) in print, type " (the double quote on your keyboard)
for (a single close smart quote or apostrophe) in print, type ' (the single quote on your keyboard)
for (a double open smart quote) in print, type "" (two of the double quotes on your keyboard)
for (a double close smart quote) in print, type ' ' (two of the single quotes on your keyboard)
Note that quotation marks are enclosed in <...> (angle brackets) and preceded by ^ (caret) in the title line only when they come in
pairs:
TI: Das Bedeutung von gaalik Hund = Das ^<"">Bedeutung^<''> von ^<FTBI>ga'alik^<RF> ^<">Hund^<'>
TT: The Meaning of gaalik dog = The ""Meaning'' of <FTRI>ga'alik<RF> "dog'
For
c
or

and
o
or

in Semitic words and names, substitute single open and single close quotes, respectively (and do not enclose them
in angle brackets in the title because they are not paired):
Be
c
afumet = Be"afumet AND Be

afumet = Be"afumet
ela
o
esh = ela'esh AND ela

esh = ela'esh
Other characters:
For (em-dash) in print, type { (the open curly bracket):
AB: The nounnot the verbdetermines person agreement. = The noun{not the verb{determines person agreement.
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
In titles whose first character is not the letter to be used for alphabetization, use ^A ("caret-A" or "alpha-marker") to mark where
alphabetization begins:
TI: The Beginning of the End = The ^ABeginning of the End
TI: Random Word Order = ^<"">^ARandom^<''> Word Order
TI: Top und andere neue Wrter = ^<FTBI>^ATop^<RF> und andere neue Wo^<$FUM>rter
WARNING: All accent, field, and BF codes get stripped out of the electronic version of LLBA. Be aware of the consequences of this
for the appearance and accuracy of our records.
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
SPECIAL SYMBOLS (BF-CODES)
STAR recognizes only the basic Roman alphabet; all other alphabetic symbols are generated, where needed*, by BF-codes, which
follow a strict format: backslash + BF + 3-place alphanumeric set identifier + 2-place (column, row) alphanumeric symbol codes +
pipe (above the backslash) + "END". See the symbol sets provided.**
= \BFGR1A8A4A6968988|END
Multiple symbols can be produced, if they are of the same set, within one BF-code string by using a string of 2-place alphanumeric
symbol codes following the set identifier and preceding the pipe. Note that BF-codes are enclosed in caret-brackets (^<...>) in the title
field; never appear inside field codes (ie, BF-codes are never superscripted or subscripted, and font formatting codes must be ended
before and started again after BF-codes); are never used in the author or contributor lines; and are never interrupted by other
characters or coding (ie, the BF-code string must be ended before and started again after any accent codes, characters, or BF-codes
from other sets that appear inside a string of special symbols):
= \BFGR1A8A4A696|END\BFPHRA4|END\BFGR18988|END
w = \BFGR1A8A4A696|ENDw\BFGR18988|END
= \BFGR1A8A4A696|END$FAC\BFGR18988|END
anema = <FTRI>an<RF>\BFPHR52|END<FTRI>ema<RF>
(*Except for the title line or in examples that require the use of symbols, transliterate whenever possible to reduce the use of BF-
codes, eg, transliterate Greek unless the abstract is specifically addressing the orthography and its symbols. Note that )
(**The symbol sets PHB and GR2 are bolded versions of PHR and GR1, respectively, and are to be used in titles and general note
lines. Note that PHB and PHR are organized identically, while GR2 and GR1 are not.)
WARNING: All accent, field, and BF codes get stripped out of the electronic version of LLBA. Be aware of the consequences of this
for the appearance and accuracy of our records. For BF codes, the code is translated into a plain text equivalent, so that what appears
as dad in print will appear as dadae online.
A common error with BF codes is the confusion of Greek symbols with phonetic symbols:
A ^<\BFGR288|END>-Theoretical Take on Passives
This will appear just fine in print as
A -Theoretical Take on Passives
but it will be wrong online because Greek codes are transliterated:
A th-Theoretical Take on Passives
This title should be coded as follows:
A ^<\BFPHB99|END>-Theoretical Take on Passives
because the correct symbol would appear in print AND the correct description would appear online:
A (theta)-Theoretical Take on Passives
Another common error with BF codes is the incorrect use of symbols that visually resemble what is seen in print, but are unrelated to
the intended representation:
The V^<\BFM2X66|END> Level
This will appear just fine in print as
The V Level
but it will be wrong online because of the plain text equivalent of the symbol used:
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
The V(degrees) Level
This title should be coded as follows:
The V-Zero-Bar Level
because "zero-bar" is the meaning of the "" notation, and there is no appropriate symbol available.
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
PHR GR1
1 2 3 4 5 6 7 8 9 A B C D E F 8 9 A B C D E
0 1
1 2
2 3
3 4
4 _ 5
5

6
6 + 7
7 4 8
8 +

9
9 ' !
l
y
A
`
! '
B :
C
D e
PHB GR2
1 2 3 4 5 6 7 8 9 A B C D E F 8 9 A B C D E
0 1
1 2
2 3
3 4
4 _ 5
5

6
6 + 7
7 4 8
8 +

9
9 ' !
l
y D
A
`
! '
B :
C
D e
M1X M2X M3X SPX TRX
4 6 7 8 C D 4 5 6 4 5 7 8 9 C 6
1 ~ 1 @ 6 1 9 #
2 > 5 _ 8 2
5 6 9
6 { B $ B
7 }
8 <
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
PLAIN TEXT EQUIVALENTS TO THE BF-CODES
BF-CODE ONLINE TEXT PRINT TEXT COMMENTS
GR181 a
GR182 b
GR183 s
GR184 d
GR185 e
GR186 f
GR187 g
GR188 e
GR189 i
GR191 x
GR192 k
GR193 l
GR194 m
GR195 n
GR196 o
GR197 p
GR198 th
GR199 r
GR1A2 s
GR1A3 t
GR1A4 u
GR1A6 o
GR1A7 kh
GR1A8 ps
GR1A9 z
GR1C1 A
GR1C2 B
GR1C4 D
GR1C5 E
GR1C6 F
GR1C7 G
GR1C8 E
GR1C9 I
GR1D1 X
GR1D2 K
GR1D3 L
GR1D4 M
GR1D5 N
GR1D6 O
GR1D7 P
GR1D8 Th
GR1D9 R
GR1E2 S
GR1E3 T
GR1E4 U
GR1E6 O
GR1E7 Kh
GR1E8 Ps
GR1E9 Z
GR281 a
GR282 b
GR283 g
GR284 d
GR285 e
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
GR286 z
GR287 e
GR288 th
GR289 i
GR291 k
GR292 l
GR293 m
GR294 n
GR295 x
GR296 o
GR297 p
GR298 r
GR2A2 s
GR2A3 t
GR2A4 u
GR2A5 f
GR2A6 kh
GR2A7 ps
GR2A8 o
GR2C1 A
GR2C2 B
GR2C3 G
GR2C4 D
GR2C5 E
GR2C6 Z
GR2C7 E
GR2C8 Th
GR2C9 I
GR2D1 K
GR2D2 L
GR2D3 M
GR2D4 N
GR2D5 X
GR2D6 O
GR2D7 P
GR2D8 R
GR2D9 S
GR2E2 T
GR2E3 U
GR2E4 F
GR2E5 Kh
GR2E6 Ps
GR2E7 O
GR2ED s
M1X41 ~ ~
M1X66 { {
M1X77 } }
M1X81 -->
M1X82 +/-
M1XC8 < <
M1XD2 > >
M1XD5 X
M2X41 @ @
M2X4B $ $
M2X55 _ _
M2X66 (degrees) do not use for "zero-
bar" in syntax
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
M3X46
M3X48
M3X56 .
PHB11 (soft yer)

do not use for
transliterating
PHB12 (hard yer)

do not use for
transliterating
PHB19 (rising falling tone)
%
PHB1A (falling rising tone)
` `` `
&
PHB20 (extra high tone)

PHB21 (high tone)

PHB22 (mid tone)

PHB23 (low tone)

PHB24 (extra low tone)

PHB25 (rising tone)
'
PHB26 (falling tone)
(
PHB27 (high rising tone)
)
PHB28 (low falling tone)
*
PHB29 (low rising tone)
&
PHB2A (high falling tone)
%
PHB2B (length mark)
+
PHB2C (half length mark)
,
PHB2D (root)

PHB31 (alpha)

PHB32 (lambda)

PHB33 (pi)

PHB36 (l-yogh ligature)

PHB39 (hooktop k)

PHB40 (reversed epsilon)

PHB41 (epsilon)

PHB42 (turned y)

PHB43 (esh)

PHB44 ae

PHB45 (schwa)

PHB46 (script a)

PHB47 (right-tail l)

PHB48 AE

PHB4A OE

PHB4C (Iota)

PHB51 (sigma)

PHB52 (thorn)

PHB54 (chi)
-
PHB55 (dotless j)

PHB56 (turned ?)

PHB58 (left-tail o)

PHB5A (script v)

g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
PHB5B (turned a)

PHB5D (dotless i)

PHB67 (hooktop p)

PHB6D (open o-e ligature)
e e e e
PHB70 (turned small capital U)

PHB72 (reversed e)

PHB73 (closed epsilon)

PHB76 (closed reversed epsilon)

PHB78 (right-tail curvy turned h)


PHB79 (curvy turned h)

l ll l
PHB7A (hooktop g)

PHB7B (pipe)

PHB7C (curly-tail j)

PHB82 (long-leg n)

PHB83 (bent-tail yogh)

PHB84 (right-tail s)

PHB85 (right-tail z)

PHB86 (crossed h)

PHB87 (curly-tail z)

PHB89 (gamma)
.
PHB8A (baby gamma)

mid-back-unrounded
vowel
PHB8B (double pipe)

PHB8C (double-barred pipe)

PHB8D (turned t)

PHB92 (ram's horns)

PHB93 (phi)

PHB94 (beta)

not the same as the
double-s in German
PHB95 (right-hook schwa)

PHB97 (eng)

PHB98 (turned v)

PHB99 (theta)

PHB9A (barred o)

PHBA0 (bilabial click)

PHBA1 (turned k)

PHBA2 (yogh)

PHBA3 (left-hook n)

PHBA4 (eth)

PHBA6 (turned w)

PHBA7 (omega)

PHBA8 (small capital omega)

PHBA9 (open o)

PHBAA (glottal stop)

PHBAC (hooktop q)

g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
PHBB4 (hooktop heng)

PHBB7 (right-tail turned r)

PHBBC (long-leg r)

PHBC1 (turned long-leg r)

PHBC2 (right-tail q)

PHBC3 (hooktop h)

PHBC4 (belted l)

PHBC5 (right-tail d)

PHBC6 (right-tail t)

PHBC9 (closed omega)

PHBD0 (upsilon)

PHBD3 (stretched c)

PHBD4 (inverted glottal stop)

PHBD5 (long-leg turned m)

PHBD6 (curly-tail esh)

PHBD7 (curly-tail yogh)

PHBD8 (hooktop b)

PHBD9 (hooktop d)

PHBDA (hooktop barred dotless j)

PHBDC (t-esh ligature)

PHBE0 (barred reversed glottal
stop)

PHBE1 (barred glottal stop)

PHBE2 (right-tail n)

PHBE3 (glottal stop)

PHBE4 (reversed glottal stop)

PHBE6 (turned m)

PHBE7 (curly-tail c)

PHBE8 (fish-hook r)

PHBE9 (right-tail r)

PHBF0 (left-tail m)

PHBF1 (turned h)

PHBF2 (barred u)

PHBF3 (slashed o)

PHBF4 oe

PHBF5 (turned script a)

PHBF6 (turned r)

PHBF8 (inverted capital R)

PHBF9 (barred dotless j)

PHR11 (soft yer)

do not use for
transliterating
PHR12 (hard yer)

do not use for
transliterating
PHR19 (rising falling tone)
'
PHR1A (falling rising tone)
`
!
g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
PHR20 (extra high tone)

PHR21 (high tone)

PHR22 (mid tone)

PHR23 (low tone)

PHR24 (extra low tone)

PHR25 (rising tone)

PHR26 (falling tone)
+
PHR27 (high rising tone)
4
PHR28 (low falling tone)
+
PHR29 (low rising tone)
!
PHR2A (high falling tone)
'
PHR2B (length mark)
:
PHR2C (half length mark)

PHR2D (root)

PHR31 (alpha)

PHR32 (lambda)

PHR33 (pi)

PHR36 (l-yogh ligature)

PHR39 (hooktop k)

PHR40 (reversed epsilon)

PHR41 (epsilon)

PHR42 (turned y)

PHR43 (esh)

PHR44 ae

PHR45 (schwa)

PHR46 (script a)

PHR47 (right-tail l)

PHR48 AE

PHR4A OE

PHR4C (Iota)

PHR51 (sigma)

PHR52 (thorn)

PHR54 (chi)
_
PHR55 (dotless j)

PHR56 (turned ?)

PHR58 (left-tail o)

PHR5A (script v)

PHR5B (turned a)

PHR5D (dotless i)

PHR67 (hooktop p)

PHR6D (open o-e ligature)
e
PHR70 (turned small capital U)

PHR72 (reversed e)

PHR73 (closed epsilon)

g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
PHR76 (closed reversed epsilon)

PHR78 (right-tail curvy turned h)

PHR79 (curvy turned h)



l
PHR7A (hooktop g)

PHR7B (pipe)

PHR7C (curly-tail j)

PHR82 (long-leg n)

PHR83 (bent-tail yogh)

PHR84 (right-tail s)

PHR85 (right-tail z)

PHR86 (crossed h)

PHR87 (curly-tail z)

PHR89 (gamma)
y
PHR8A (baby gamma)

mid-back-unrounded
vowel
PHR8B (double pipe)

PHR8C (double-barred pipe)

PHR8D (turned t)

PHR92 (ram's horns)

PHR93 (phi)

PHR94 (beta)

not the same as the


double-s in German
PHR95 (right-hook schwa)

PHR97 (eng)

PHR98 (turned v)

PHR99 (theta)

PHR9A (barred o)

PHRA0 (bilabial click)

PHRA1 (turned k)

PHRA2 (yogh)

PHRA3 (left-hook n)

PHRA4 (eth)

PHRA6 (turned w)

PHRA7 (omega)

PHRA8 (small capital omega)

PHRA9 (open o)

PHRAA (glottal stop)

PHRAC (hooktop q)

PHRB4 (hooktop heng)

PHRB7 (right-tail turned r)

PHRBC (long-leg r)

PHRC1 (turned long-leg r)

PHRC2 (right-tail q)

PHRC3 (hooktop h)

PHRC4 (belted l)

g:\proceduresdox\bf&accent&fieldcodes.doc updated 11/19/1
PHRC5 (right-tail d)

PHRC6 (right-tail t)

PHRC9 (closed omega)

PHRD0 (upsilon)

PHRD3 (stretched c)

PHRD4 (inverted glottal stop)

PHRD5 (long-leg turned m)

PHRD6 (curly-tail esh)

PHRD7 (curly-tail yogh)

PHRD8 (hooktop b)

PHRD9 (hooktop d)

PHRDA (hooktop barred dotless j)

PHRDC (t-esh ligature)

PHRE0 (barred reversed glottal
stop)

PHRE1 (barred glottal stop)

PHRE2 (right-tail n)

PHRE3 (glottal stop)

PHRE4 (reversed glottal stop)

PHRE6 (turned m)

PHRE7 (curly-tail c)

PHRE8 (fish-hook r)

PHRE9 (right-tail r)

PHRF0 (left-tail m)

PHRF1 (turned h)

PHRF2 (barred u)

PHRF3 (slashed o)

PHRF4 oe

PHRF5 (turned script a)



PHRF6 (turned r)

PHRF8 (inverted capital R)

PHRF9 (barred dotless j)

SPX7B <--
SPX89 <
SPX92 >

SPXC1 <--->
TRX69 # #

You might also like