Professional Documents
Culture Documents
k
i=1
v
i
log
z
(
v
i
). H(V) is zeio if theie is a maximum bias towaids a single level, e.g. with
v
1
= 1,
v
z
= c, and
v
I
= c, H(V) ieaches its maximumin unifoimdistiibutions, e.g. with
v
1
=
1
I
,
v
z
=
1
I
, and
v
I
=
1
I
.
l estimate using the Maximum Likelihood method, i.e. fiom the empiiical fiequencies.
Tis has been noted by Bickel & Nichols (zcc) foi the co-exponence of othei categoiies as well.
DRAFT Maich 11, zc11
1c Distributional biases
the development in eithei diiection has a veiy low piobability (as 1b would piedict). loi othei
vaiiables in the iange .c1 .1c, the pictuie is similai to what is illustiated by Table 1 they
tend to be heavily biased and ieect a rara vs. universalia distiibution. Such biases aie likely
to iesult fiom stiong aieal diusion oi univeisal piessuie so stiong in fact that the ielevant
choice is likely to establish itself veiy quickly, and that once the choice is made, languages
iefiain fiom undoing it and families look almost completely homogenous. Tis ieects the
scenaiio hypothesized in (1a) and is not consistent with (1b). Tus, iathei than suggesting
faithful ieplication, extiemely low numbeis of known changes seem to point to veiy stiong
eects of some diiving factoi (pace Paikvall zccs, Wichmann & Holman zccv, oi Bakkei et al.
zccv).
Vaiiable Changes Oppoitunities lntiopy Ratio
(and data souice) min(C
F
) min(O
F
) of values
lnteiiog./decl. distinction (Diyei zccc) 1 sv c.c1 s111
lndep. subject pionouns (Daniel zcc) c I1 c.c zsz
Tonal case (~U1o1vv and Diyei zccd) I v1 c.c evse
Stem exivity condit. by NlG (~U1o1vv) c 1c c.1z 11111
Have-peifect (Dahl & Velupillai zcc) 1 1 c.I 1c1
Co-exponent type of NlG (~U1o1vv) 1 zI1 c.ec 1sI1111111
Table 1 Vaiiables foi which the minimum numbei of changes does not exceed what is
expected undei .c1 (in incieasing oidei of entiopy)
Teie is one fuithei piece of evidence against Hypothesis (1b) alieady with .c and cei-
tainly with .1c, it is viitually impossible foi typological distiibutions to peisist ovei deep
time in such a way that what one obseives now is similai to what was theie many geneiations
ago. Tis can be shown by computei simulations. l set up datasets with 1,Icc fake languages
(appioximating the size of the laigest available ieal databases) with fake codings foi a binaiy
typological vaiiable. Te codings iepiesent distiibutions ianging fiom 1vv to zcsc to
1cec. lach such distiibution was then sent thiough a numbei of geneiations. ln each gen-
eiation theie was a ceitain piobability thieshold (ianging fiom .c1 to .1c, at inciements
of .c1) below which a iandom subset of languages would change fiom one state to the othei,
with no piefeiied diiection of change. Choosing iandom subsets below iathei than at is
motivated by the assumption that language change is constiained by maximum piobabilities
but does not opeiate at a constant iate, in addition, the method favois Hypothesis (1b) since
change does not always opeiate at full speed as it weie. Afei the distiibutions went thiough
all geneiations, l tested whethei the initial distiibution was still detectable using a two-sided
binomial test. Tis pioceduie was iepeated 1,ccc times, allowing to compute the piopoition of
simulations in which the oiiginal distiibution was still detectable, and fiom this an estimate of
the oveiall piobability of successful detection.
liguie Ia iepoits the iesults foi 1cc geneiations, and liguie Ib foi c geneiations. lf we
assume an aveiage lifespan of languages of about 1,ccc yeais, 1cc geneiations ieect a low
estimate of the age of human language, i.e. a time when majoi innovations that aie likely to
depend on language use, such as oinamentation, pigment piocessing, and long-distance tiading,
DRAFT Maich 11, zc11
Distributional biases 11
become well auested in the aicheological iecoid (McBieaity & Biooks zccc). c geneiations
ieects an uniealistically low estimate, viz. a time when modein symbolic behavioi has spiead
even well outside Afiica.
0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.10
0
.
0
0
.
2
0
.
4
0
.
6
0
.
8
1
.
0
Maximum probability of random change
P
r
o
p
o
r
t
i
o
n
d
i
s
t
r
i
b
u
t
i
o
n
s
(a) Afei 1cc geneiations of change
0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.10
0
.
0
0
.
2
0
.
4
0
.
6
0
.
8
1
.
0
Maximum probability of random change
(b) Afei c geneiations of change
liguie I Piopoition of 1,ccc simulated distiibutions that aie still detected afei 1cc (a) oi c
(b) geneiations of iandom change at given maximum piobabilities (solid line initial
distiibution 1vv, dashed line zcsc, doued line 1cec, giey line .c piobability
thieshold of detecting a an initial distiibution)
Te ndings suggest that alieady at .c, the piobability of detecting even the most
heavily biased initial distiibution (1vv, ploued as a solid line) staits to no longei exceed
.c afei 1cc geneiations (as indicated by the giey hoiizontal line in the guie), even undei
the shoitei scenaiio of c geneiations, the most heavily biased distiibution ieaches the .c
piobability thieshold just befoie .1c, the level that was noted above as the minimumat which
an appieciable piopoition of vaiiables (about z) begins to show numbeis of change that aie
statistically expected. Tus, even undei an uniealistically shoit life span of human language,
the minimum values of that begin to be consistent with the numbei of known changes is fai
too high foi allowing the long-time peisistence of typological distiibutions iequiied by (1b). ln
othei woids, iealistic values of aie so high that no synchionic distiibution can be accounted
foi by faithful ieplication ovei deep time.
To summaiize, theie aie thiee pieces of evidence against the hypothesis of extiemely low
piobabilities of change in typological vaiiables (i.e. Hypothesis 1b) ist, we tend to nd many
moie cases of change than what extiemely low piobabilities of change would lead us to expect.
Second, those few vaiiables foi which extiemely low piobabilities of change would in piin-
ciple match the obseived numbei of changes, tend to display an extieme rara vs. universalia
distiibution, and this ts beuei with veiy unequal piobabilities of change (1a) than with equal
piobabilities (1b). Tiid, even if piobabilities of change weie as low as .1c, they would still be
too high foi typological distiibutions to iemain stable ovei the entiie lifespan of the human
DRAFT Maich 11, zc11
1z Distributional biases
language faculty.
Taken togethei, these thiee pieces of evidence make the hypothesis in (1b) an unlikely ex-
planation of diiectional family biases. Tis suppoits the alteinative in (1a) and theieby the
coie claim of the lamily Bias Teoiy that diiectional family biases ieect the piessuie of some
diiving factoi and that high degiees of genealogical stability can only explain non-diiectional
family bias but not also diiectional family bias.
5 e problem of small families
Te lamily Bias Teoiy piovides a systematic diachionic inteipietation of synchionic typo-
logical distiibutions. Howevei, like all diachionic inteipietations it has a natuial limit when
confionted with small families it is dicult to estimate a bias oi ieconstiuct foims if one
knows only, say, two oi thiee membeis. lt becomes almost impossible if one only knows a sin-
gle membei. Tis pioblemis substantial because even foi laige databases, such as the genealogy
databases in ~U1o1vv (N z,esc, Nichols & Bickel zccv) oi in the World Atlas of Language Struc-
tures (N z,11, Haspelmath et al. zcc), about half of the stocks aie iepiesented by only one
membei (and this is so iegaidless of which of the two taxonomies is applied and even when one
excludes, as l do heie, cieoles and sign languages, which could all be analyzed as single-membei
families in theii own iight since they iepiesent the biith of new families).
Te pioblem has both epistemological and statistical consequences. lpistemologically, the
pioblem has the eect that many vaiiables of typological inteiest cannot ieally be investigated
when these vaiiables happen to be best iepiesented in less-well documented oi isolated families.
Statistically, the pioblem is one of powei in detecting signals any limitation to laige families
seveiely ieduces the size of datasets, and this has the eect that statistical tests do no longei
have enough powei to detect signals.
ln some sense, one could say that these aie just the ieal limits on what one can possibly
know about diachiony and the histoiical foices shaping typological distiibutions. Howevei, to
the extent that one can tiust the iesults fiom examining biases in laige families, it is possible to
extiapolate fiom laige to small families i.e., foi concieteness sake, fiom families with at least
membeis (cf. Section I) to families with less than membeis. Tis iequiies two assumptions
(1) a. Normal Diarony Assumption:
Te membeis of small families aie the sole suivivois of laigei families.
b. Uniform Development Assumption:
Unknown families aie subject to the same developmental piinciples as known fam-
ilies.
Te iationale behind the Noimal Diachiony Assumption is the following if we dont know
what othei languages a language oi a small gioup of languages is ielated to, i.e. if we aie
dealing with an isolate oi a small gioup, this is only an epistemological issue, not an ontological
one (a point also emphasized by Maslova (zcccb)). Ontologically, isolates and small gioups
aie still membeis of laigei families, its only that we dont know them because they became
extinguished, in most cases because theii speakeis shifed to othei, unielated languages. Tis
assumption is motivated by the fact that just like any othei language, isolates and small families
DRAFT Maich 11, zc11
Distributional biases 1I
must come fiom somewheie, i.e. they aie the iesult of noimal diachionic tiansmission (in the
sense of Tomason & Kaufman 1vss). Obviously, the assumption does not hold foi most cieoles
and sign languages because they did not aiise fiom noimal diachionic tiansmission, and it
would indeed be incoiiect to extiapolate insights fiom known laige families to the genesis of
cieoles and sign languages. (As a iesult, cieoles and sign languages piovide a dieient window
on univeisal piessuie shaping language than languages with a noimal diachiony behind them
and ieseaich on this iequiies othei methods than what l discuss heie.)
Te Unifoim Development Assumption assumption is again based on the insight that the
status of languages as isolates oi membeis of small gioups is an epistemological and not an
ontological fact the extent to which we know genealogical ielationship of a gioup has no
piincipled consequences on the kinds of diachionic developments that the gioup went thiough.
loi example, just because we dont know any sistei languages of Basque does not mean that
the kind of diachionic piocesses that iesulted in modein Basque was iadically dieient fiom
the kind of piocesses that iesulted in the development of Hindi fiom Pioto-lndo-luiopean
we expect the same kind of complex mix of spontaneous (iandom) change, contact eects and
univeisal piessuie.
Taken togethei, the two assumptions in (1) allow us to extiapolate fiom laige to small fam-
ilies. loi this, we ist compute the piopoition of biased vs. diveise families in a suivey of laige
families and then use this piopoition as an estimate of the extent to which small families aie
biased. loi example, in the suivey of A-befoie-P oideis in Section I we found that ev laige
families aie biased in some way and I1 aie diveise. Based on (1), we can now make the as-
sumption that the extent to which families diveisify theii oideiing of A and P aiguments, and,
conveisely, the extent to which families keep whatevei oidei they have, does not only hold foi
laige families but also foi small families. ln othei woids, we assume that oui extiemely ieduced
knowledge of Basques ancestiy has no consequences on howBasque developed (viz. by noimal
diachionic tiansmission, as pei 1a) and to what extent the language was aected by univeisal
piessuie in language change (as pei 1b). Teiefoie, we assume that about c of small families
aie the sole suivivois of laige families with a bias and about Ic of small families aie the sole
suivivois of laige families without a bias, i.e. to be diveise.
Teie is one piobabilistic detail that we need to take caie of befoie pioceeding fuithei,
howevei if we happen to nd 1cc laige families to be biased (in whatevei diiection), it would
not be coiiect to estimate a piobability of 1 that small families aie biased as well, i.e. theie
cannot be absolute ceitainty that all small families aie biased and it is always possible that
they iepiesent laigei diveise families. A well-established way of avoiding this is by estimating
piobabilities using Laplaces Rule of Succession if k out of n laige families aie biased, we
estimate the piobability of small families to be biased as
k+1
n+z
. ln oui example of A-befoie-P
oideis, this would be
11+1
v+z
.esv. Tis is veiy close to the estimate based on the iawpiopoitions
(.ev) but foi smallei samples, the dieience can be moie substantial if we had obseived 1c
out 1c families, the estimate would not be (biased) 1, but (biased)
11
1z
.vz. Te key idea
behind the foimula is this the a priori assumption that families can in piinciple be eithei biased
oi diveise is equivalent to having obseived one biased and one diveise family, and these as if
DRAFT Maich 11, zc11
11 Distributional biases
obseivations aie added to the obseived fiequencies.
Using the estimated piobability of being biased, we then iandomly declaie a coiiesponding
piopoition of small families to be the solve suivivois of biased families. ln oui example of A-
befoie-P oideiing, we would declaie a iandom selection of ev small families to be biased. Te
iemaining small families (I1) aie declaied to be the sole suivivois of diveise laigei families.
Howevei, by viitue of being statistical estimates, biases aie giadual and allow deviations foi
example, one of the laige families in the suivey, Austionesian, is signicantly biased towaids
A-befoie-P oideiing. Despite this bias, zv out of the 11 (zc) iepiesentatives of the family in
the database deviate fiom this, paitly by having VPA (VOS) oidei (such as Kiiibatese), paitly
by having vaiiable woid oidei (such as Acehnese). When assuming that a small family is the
sole suivivoi of a biased family, the question theiefoie aiises to what extent the small gioup
we know iepiesents the oveiall bias of the family, oi deviates fiom this tiend, just like the zc
of Austionesian languages that deviate fiom the oveiall tiend in Austionesian.
ln iesponse to this, we ist estimate the piobability that a small gioup iepiesents the family
bias fiom the extent to which the bias is found in laige families. As noted, in Austionesian this
extent is .sc, in othei laige families (e.g. Diavidian) it is 1. Laige families can of couise be biased
in the opposite diiection. ln oui example, we obseived this foi Algic and lioquoian, and heie
the bias (against A-befoie-P) is in each case complete (i.e. 1) in the database. Taken all these bias
estimates togethei suggests that, on aveiage, if a laige family is biased on the aigument oidei
vaiiable in whichevei diiection (A-befoie-P oi the opposite), it is so biased to v., and theie
aie on aveiage 1. deviates inside the family. (Austionesian, with as many as zc deviates, is
theiefoie quite exceptional.) When estimating the piobability having a bias vs. being diveise,
we coiiected these estimates by the Laplace Rule of Succession because a priori it is always
possible foi families to be biased to some degiee oi to be diveise. loi estimating the deviation
piobability, howevei, l suggest to iely on the baie piopoitions, i.e. if all biased families aie
completely biased, with no deviations, l suggest to assume a geneial deviation piobability of c.
Te ieason is as follows. Postulating deviations is the same as postulating instances of language
change (unlike postulating a bias, which may oi may not imply language change, depending
on the extent of the bias). Now, fiom geneial paisimony constiaints on histoiical linguistics
(Occams Razoi), it follows that one postulates language change only in the piesence of positive
evidence. Teiefoie, a piioii i.e. unless theie is any evidence to the contiaiy we assume
that an isolated language oi small language gioup iepiesents its ancestois faithfully, with no
change, no deviation.
Given these consideiations, we can estimate the piobability to which the membeis of a
what we estimate is a biased small family iepiesent indeed the family bias (heie, v.) and
the piobability to which membeis aie likely to be deviating exceptions (heie, 1.). Based on
this, we iandomly declaie some piopoition of the estimated biased families to be obseived
with iepiesentative membeis and some piopoition to be obseived with deviating membeis. ln
those small families wheie membeis aie estimated to iepiesent theii family bias, we declaie the
lt is a mauei of fuithei ieseaich to establish whethei
1
z
is indeed an appiopiiate paiametei value of the a priori
bias piobability heie.
Technically, this is done via a iandomly geneiated binomial distiibution with the estimated bias piobability.
DRAFT Maich 11, zc11
Distributional biases 1
family to biased towaids whatevei happens to be its sole type oi what appeais to be its most
likely type given the geneial bias estimate and the fiequency distiibution within the family.
loi example, a small family will be declaied to be biased towaids A-befoie-P oidei if all oi
most of the small gioup have A-befoie-P oidei, if theie is a tie (e.g. two languages with A-
befoie-P and two languages with othei oideis), we iandomly pick one as iepiesentative. ln
the small families wheie membeis aie estimated to iepiesent deviating exceptions, we declaie
them as suivivois of a family that had a bias in an alteinative diiection (iandomly chosen but
weighted by the piobability of diiections given by the geneial bias estimate and the fiequency
distiibution within the family). loi example, if all oi most languages in the small family have
A-befoie-P oidei (oi if indeed theie is only a single language and it happens to have A-befoie-P
oidei), we estimate that these languages come fiom a laigei family with the opposite bias (i.e.
no A-befoie-P oidei), if theie is a tie, we again iandomly select one of them.
ln the oveiall extiapolation piocess theie aie thiee situation wheie we make iandom selec-
tions ist, when declaiing a piopoition of small families to be the sole suivivoi of diveise vs.
biased families, second, when bieaking ties foi deteimining what kind of bias a small biased
family iepiesents, and thiid, when assigning an alteinative type to those small biased families
that we estimate as iepiesenting deviating exceptions within laigei families. Tese iandom
choices induce statistical eiioi but because the eiioi is iandom, it can be assumed to be noi-
mally distiibuted. Teiefoie, we can peifoim the extiapolations with all iandom selections
many times (say, zccc oi 1c,ccc times) and then compute the mean of all extiapolation iesults.
loi example, a single extiapolation might suggest 11 small families with an A-befoie-P
bias, zv with the opposite bias and ev to be diveise, the next extiapolation might suggest 11e
cases of A-befoie-P bias, I1 opposite biases and e diveise families etc. lf we take the mean
of these fiequencies ovei zccc extiapolations, we aiiive at estimated fiequencies of 11.c A-
befoie-P bias, II.cv opposite bias and ee.sI diveise. Added to the estimates fiom laige families,
this iesults in an oveiall estimate of 11.c A-befoie-P biases, I.cv opposite biases and s1.sI
diveise. Tis conims the iesult fiom Section I that theie is a signicant tiend foi families to
be biased towaids A-befoie-P oidei as against P-befoie-A oi fiee oideis (exact binomial test, p
.cc1, .sz).
ln Section z l dened the stiength of the univeisal piessuie by the piopoition of biased as
opposed to diveise families. Since when extiapolating to small families, we use this piopoition
foi estimating to what extent small families aie the sole suivivois of families with a bias, we
can no longei ie-compute this piopoition fiom the extiapolation iesults. ln othei woids, as fai
as l can see, estimates of the stiength of univeisals can only be taken fiom laige families. Te
estimate of the stiength can theiefoie be dened as the Laplace estimatoi of biases discussed
above, i.e.
() s =
k + 1
n + z
,
wheie k is the numbei of biased families out of a total of n families. Note that because of this
equality, extiapolations will not be of help when testing foi what l called Scenaiio B (non-
A ieady-to-use function foi computing family biases in this way is available in an R package wiiuen by Taias
Zakhaiko at http://www.uni-leipzig.de/~autotyp/familybias.R.
DRAFT Maich 11, zc11
1e Distributional biases
diiectional family bias) in Section z the piopoition of biased families is by denition the same
befoie and afei the extiapolation. What can usefully be done, howevei, is to examine whethei
the bias is still undiiected afei extiapolation to small families.
Applying the extiapolation method to the non-diiectional family bias of piedicative adjec-
tive encoding (cf. Section I) suggests that this is the case. ln Section I we found no statistical
piefeience foi any type. Afei extiapolation, we can estimate I1.eI families to be biased to-
waids veibal, Ic.1s towaids nonveibal and z.ev towaids mixed encoding. Tese fiequency
estimates aie still not signicantly dieient fiom what one would expect undei a unifoim (
each) distiibution (
z
1.I1, p .1). Tis suggests that the encoding of piopeity teims in
piedicative function is a genealogically stable piopeity, not subject to any known univeisal oi
laige-scale aieal piessuie.
ln the two examples ieviewed so fai, the iesults of signicance tests weie not aected by the
numbei of datapoints befoie vs. afei extiapolation to isolates and small families. Howevei, this
can be quite dieient because (a) isolates and small families may biing in ciitical evidence and
(b) because a statistical test may only be able to detect a signal if the dataset ieaches a ceitain
minimal size. Tis can be exemplied by examining the distiibution of (some kind of) gendei
in independent pionouns, based on Siewieiskas (zcc) WALS dataset (N Is1, afei adding a
few data to which l had easy access in oidei to inciease the numbei of stocks with moie than
ve membeis'). Without extiapolations, the data fiom laige families suggests non-diiectional
family bias of 1 laige families, 1z have a bias, of these, 1 aie biased towaids having gendei
and s against. A piopoition of 1z families out of 1 to be biased is boideiline signicant undei
a binomial test (p .c, .1). But the 1s iatio in the diiection of the bias does not suggest a
signicant piefeience (p .1v1, .ee), and so the data seemto suggest Scenaiio B fiomSection
z daughtei languages seem to maintain whatevei the pioto-language was like if it had gendei,
gendei is pieseived, if it didnt have gendei, it doesnt develop it. Howevei, the absence of a
statistical signal could also just ieect the fact that the total numbei of laige families is veiy
small (N s) and statistical tests dont have enough powei to detect tiends. ln addition, theie
could be a possible diiection in the bias specically among small families and isolates. To nd
out, we can use the piopoition of biases among laige families (1z out of 1) and extiapolate to
small families and isolates.
Te mean extiapolations suggests that 1z.z1 (z1) families aie biased towaids and sc.
(1) against having gendei distinctions in independent pionouns. Tis dieience matches the
1s iatio among the laige families, but the laigei numbei now allows detecting a statistically
signicant signal (p .cc1, .ee). Te iesult seems to suggest a univeisal bias against gendei
in pionouns.
Howevei, in this case the extiapolations aie based on only 1e families, and the iesult should
not be taken as establishing a woildwide tiend against gendei. Te only way to put the iesults
on imei giounds is to develop databases that collect moie datapoints pei family and theieby
puisue a data-collection stiategy that is the exact opposite of how most typologists have col-
lected data in the past.
At any iate, as tentative as they aie, the iesults on pionominal gendei t with Nicholss
' l added Tobelo, Galela, and Somali as languages with pionominal gendei.
DRAFT Maich 11, zc11
Distributional biases 1
(1vvz, zccI) hypothesis that gendei in geneial is disfavoied univeisally. lt does not seem to be
paiticulaily pione to inheiitance unless theie is suppoit fiom neighboiing families that have
gendei (like in luiope oi Afiica). Suppoit fiom neighboiing languages is an issue of aieal
conditions deteimining family biases, which is one of the topics in the following section.
6 Extending the Family Bias approa to multivariate distributions
ln the pieceding l have limited my auention to the distiibution of a single vaiiable. But the
distiibution of one vaiiable may be conditioned by othei vaiiables, foi example othei stiuctuial
vaiiables (such as woid oidei) oi aieal oi social vaiiables (such as Spiachbund membeiship oi
the piesence of ceitain kinship systems).'' l subsume all kinds of conditional eects undei the
teim conditional piessuie.
Just like in univaiiate designs, conditional piessuie can be stiong oi weak, and the lowei
bounds of this stiength can be estimated by the piopoition of biased vs. diveise families. Unlike
in univaiiate designs, howevei, these stiengths need not be unifoim but can diei acioss condi-
tions. loi stiuctuial factois, piessuie stiength can be expected to be unifoim acioss conditions
in the case of bi-diiectional univeisals. An example is the classical hypothesis that OV stiuc-
tuies favoi postpositions and, conveisely, that VO stiuctuies favoi piepositions (Gieenbeig
1veI, Diyei 1vvz). ln the case of uni-diiectional univeisals (e.g. post-nominal ielative clauses
undei non-veib-nal woid oidei conditions, but no tiend towaids pie-nominal ielative clauses
undei veib-nal conditions), one expects stiong piessuie in one condition (heie, undei the
non-veib-nal condition) but undei the othei condition, theie can be many diveise families, oi
families can be biased in iandomways. Te univeisals is suppoited as long as the tiend towaids
biases is stiongei undei one than undei the othei condition.
loi aieal factois, such dieiences aie in fact expected when compaiing the distiibution of
featuies inside vs. outside an aiea, one expects stiong piessuie towaids some featuie F (less
diveisity) inside the aiea but only weak piessuie against F (moie diveisity) outside the aiea.
Aieal diusion leads to the widespiead adoption of F, iesulting in an incieased fiequency of
F. Tis is in contiast to the woild outside the aiea, wheie nothing is suspected to aect the
distiibution of F it can tend to be diveise within families oi families can be biased in iandom
ways. All that maueis foi aieality is that theie is signicantly highei piopoition of families
with an F-bias inside than outside the aiea.
Testing bivaiiate hypotheses like these is complicated by the fact that the ielevant condi-
tion may not hold foi entiie families foi example, stocks like lndo-luiopean oi Sino-Tibetan
contain both VO and OV bianches. A solution to this pioblem comes fiom the fact that the
lamily Bias Teoiy makes no assumptions about the taxonomic level oi time-depth at which
biases can be found, it is not even iequiied that biases aie always found on the same taxonomic
level acioss families (cf. Section z). Tis suggests that family biases can be estimated in what-
evei is the highest taxonomic level at which subgioups aie not split with iegaid to the ielevant
condition. ln lndo-luiopean foi example, one can estimate biases within OV and VO bianches.
'' oi many such vaiiables togethei and inteiacting with each othei. Heie l concentiate on simple cases. loi a
discussion of inteiactions between conditioning vaiiables, see Bickel (zccsa) and Cysouw (zc1ca).
DRAFT Maich 11, zc11
1s Distributional biases
Teie is one fuithei complication, though given the ofen sketchy knowledge that is avail-
able on subgiouping, it is ofen impossible to nd plausible subgioups, oi, even if the taxonomy
is well established, subgioups may be diveise with iegaid to some condition of inteiest. ln both
these cases, l piopose to posit pseudo-gioups, based on the dieience in the ielevant condi-
tions, e.g. an OV pseudo-gioup vs. a VO pseudo-gioup. lmpoitantly, these pseudo-gioups aie
posited solely foi the puiposes of testing whethei dieiences in the condition have an eect on
family biases. Tey aie not evidence foi ieal subgioups because changes in typological piopei-
ties can be due to factois that aie entiiely dieient fiom the kind of aibitiaiy and idiosynciatic
innovations that dene genealogical tiees. Howevei, since some change must have split the
family, it is a legitimate isogloss foi testing puiposes undei the lamily Bias Teoiy, the ques-
tion is only whethei the isogloss is associated with dieient iesponses to such an extent that
the pseudo-gioups aie now biased in a piedictable diiection.
ln the following l ist exemplify this appioach with hypotheses on stiuctuial and then on
aieal factois.
6.1 Example 1: relative clause position and word order
Te ist example conceins the hypothesis that the odds foi ielative clauses to be post-nominal
aie highei undei non-veib-nal than undei veib-nal conditions (Gieenbeig 1veI, Diyei 1vvz,
Hawkins 1vv1, among otheis). To examine this hypothesis, l combined Diyeis (zcca) v~is
dataset on ielative clause position with his dataset on dominant main clause veib positions,
excluding languages with exible oideis (Diyei zccb), but adding moie ciitical data on Sinitic
(Yue zccI). Both small and laige families can be homogenous oi split on the ielevant condition,
i.e. can contain both non-veib-nal and veib-nal languages. ln the dataset (N 1I languages),
theie aie zz laige stocks (i.e. with at least membeis) and ve small families. Of the zz stocks,
11 (oi e1) aie homogeneously veib-nal oi non-veib-nal. ln 1 (1s) stocks (lndo-luiopean,
Sino-Tibetan, Cushitic, and Austioasiatic), homogenous bianches can be found at the majoi
bianch level.
ln some cases, deteimining family biases at lowei levels leaves small families oi even single
languages stianded as the sole iepiesentatives of theii bianch, e.g. Albanian and Gieek in lndo-
luiopean, oi Lolo-Buimese (with I iepiesentatives), two Naga languages and few otheis in
Sino-Tibetan. ln some cases, an entiie bianch ends up with single-membei gioups the Westein
Oceanic gioup of Austionesian, foi example, is iepiesented in the database by Tolai and Tawala.
Since the two languages diei in basic woid oidei, they aie assumed heie to iepiesent theii own
single-membei gioups.
ln I of the zz stocks (1I), homogenous gioups can be found only by positing pseudo-
subgioups. One example is Uto-Aztecan. While the Noithein bianch is homogeneously veib-
nal, and the Aztecan gioup of the Southein bianch is homogeneously non-veib-nal, the
Sonoian gioup of the Southein bianch is mixed. Teie aie two non-veib-nal and ve veib-
nal languages and the distinction does not match any subgiouping iepiesented in the ~U1o1vv
taxonomy assumed heie (although it might of couise t othei possible subgiouping hypothe-
ses). ln this case, l posit two pseudo-subgioups foi computing family biases, a non-veib-nal
one and a veib-nal one. Anothei example is found in the Bantu bianch of Benue-Congo
all but one Bantu language in the database aie non-veib-nal. Heie l posit a laige non-veib-
DRAFT Maich 11, zc11
Distributional biases 1v
nal non-nal Sum
diveise z c z
Rel-N e 1
N-Rel 1 1 1s
Sum v 1s z
(a) laige families only
nal non-nal Sum
diveise zz.v .sc zs.v
Rel-N zv.I 1.v I1.es
N-Rel Iz.1s 1z.z 1v.I
Sum s.cc 1I.cc zzc.cc
(b) with extiapolation to small families
Table z lamily biases in ielative clause position dependent on main clause veib position
nal pseudo-gioup and a small veib-nal pseudo-gioup which is iepiesented only by a single
language in the database (viz. Tunen Mous 1vv). Te thiid case wheie pseudo-gioups aie
necessaiy is Aiawakan. Tis stock is iepiesented in the database with only single iepiesenta-
tives fiom each bianch, with ve non-veib-nal and one veib-nal bianch. Heie l assume a
non-veib-nal pseudo-gioup with ve membeis and a small single-membei gioup.'
With this, we aiiive at a total of z laige families, including I pseudo-gioups. Of the z
families, 11 (z) aie at the stock level, v (II) at the highest (majoi) bianch level, and 1 (1)
at lowei levels. Table za cioss-tabulates family biases against main clause veib oidei. Te hy-
pothesis is that biases towaids N-Rel (Noun-Relative Clause) sequences aie much moie likely in
non-veib-nal than in veib-nal families. Tis can be tested with a lishei lxact Test compaiing
the odds foi families with N-Rel biases against families with the opposite bias undei veib-nal
vs. non-veib-nal conditions. Te iesult suggests a signicant and stiong eect (one-sided p
.cc1, estimated odds iatio'
e1.e1). Tis is obviously caused by the fact that only one non-
veib-nal family in the database (viz. Sinitic) is biased towaids pie-nominal ielative clauses.
Te stiength of the univeisal can be estimated by the Laplace estimatoi of the piobability of
biases (cf. ), suggesting a piessuie of .v undei the ciitical condition of non-veib-nal oidei,
i.e. a faiily stiong univeisal.
Undei the othei condition, veib-nal oidei, the bias piobability is .I. Tis suggests
that undei veib-nal conditions, ielative clause position is genealogically stable (Scenaiio B in
Section z) oi, alteinatively, that theie is a univeisal tiend favoiing pie-nominal clauses (Scenaiio
A). Te e1 iatio in Table za is suggestive of a diiectional tiend, but the counts aie small and
exclude data fiom small families and isolates.
ln iesponse to this, l peifoimed extiapolations following the same pioceduie as desciibed in
Section , sepaiately foi each condition. Using the bias piobability estimates of .v foi non-nal
and .I foi nal woid oidei, this iesults in the mean estimates summaiized in Table zb.
' Te single veib-nal Aiawakan language in Diyeis (zccb) database is Taiiana, but this language would in fact
seem to be moie accuiately coded as lacking a dominant oidei (Aikhenvald zccI). On eithei analysis, Aiawakan
iequiies pseudo-gioups until possible subgioupings aie iobustly established.
' Although not commonly used in typology, the odds iatio () is a standaid and useful statistic to compaie piopoi-
tions acioss conditions. lt is dened as the iatio between the odds, and so an odds iatio of about ee means that
the odds foi biases towaids post-nominal ielative clauses aie ee times highei in non-veib-nal than in veib-nal
families.
DRAFT Maich 11, zc11
zc Distributional biases
Te extiapolations conim the ist nding fiom the laige families theie is a signicant
tiend foi families to be biased towaids post-nominal ielative clauses undei non-veib-nal con-
ditions (lishei lxact test, p .cc1,
.vs). Te mean estimated numbei of non-veib-nal
families with pie-nominal ielative clauses is 1.v. Tis guie iesults fiom the fact apait fiom
Sinitic, in 1sv out of zccc extiapolations (i.e. in v), an additional non-veib-nal language
was estimated to iepiesent a family with an oiiginal bias towaids pie-nominal ielative clauses.
Tis language is the Sino-Tibetan language Bai (Wieisma zccI), which has SVO main clauses
and pie-nominal ielative clauses. Te exact position of Bai within Sino-Tibetan is contiovei-
sial, and Nichols & Bickels (zccv) taxonomy tieats Bai as a stock-level isolate. lt is possible
that Bai comes fiom a bianch that oiiginally had post-nominal ielative clauses and changed
to pie-nominal oidei undei Sinitic inuence, i.e. that Bai comes fiom a diveise bianch. lt is
also possible, howevei, that Bai inheiited pie-nominal ielative clauses fiom one of its ancestois
(which might have changed to pie-nominal oidei eailiei, again possibly undei Sinitic inuence
oi even identity with pioto-Sinitic). Without fuithei ieconstiuction and detailed compaiative
woik, it is impossible to decide between these scenaiios. All that we know foi good is that
Bai now has pie-nominal ielative clauses and that, woildwide, the position of ielative clauses
in non-veib-nal languages is ielatively stable ovei time (estimated at .v). Tis favois a
scenaiio wheieby Bai inheiited its oideiing piinciples fiom its bianch ancestoi and does not
ieect iecent change undei Sinitic inuence. ln the extiapolations, this high bias estimate of
.v iesults in Bai being taken to ieect a bias towaids pie-nominal ielative clauses in v of the
zccc extiapolations, pushing the mean numbei up to 1.v.
Te second nding fiom the laige families was that theie is a possible piefeience foi pie-
nominal ielative clauses undei veib-nal conditions. Table za suggests odds of e1 foi this. But
this is not conimed by the extiapolations in Table zb, wheie the odds (zv.IIz.z .vz) go in a
dieient diiection but aie not signicantly dieient fiom1 anyway (p .1s, .e). Tis makes
it likely that theie is no diiectional bias and that instead the bias stiength of .I ieects a
faii degiee of genealogical stability (cf. Scenaiio B in Section z).
6.2 Example 2: hotbeds of pronominal gender
At the end of Section we obseived tentative evidence foi univeisal piessuie against pionom-
inal gendei. Howevei, as Nichols (1vvz, zccI) notes foi gendei in geneial, pionominal gen-
dei tends to be beuei ietained in families when they clustei togethei with similai families in
hotbeds while the phenomenon does not appeai to spiead easily, its ietention seems to be
favoied in specic iegions.
ln oidei to test this hypothesis, l classied the data fiomSiewieiska (zcc) into thiee hotbeds
based on Nicholss (1vvz) suggestions Afiica (including adjacent Semitic languages), (Westein)
luiope (up to a line fiom the Caipathians following the Wisa to the Baltic see, cf. Nichols &
Bickel zccv) and the Sahul aiea (including neai islands up to the Wallace line and collapsing the
stiata postulated by Nichols 1vvb, see cf. Map 1). l then computed family biases within and
outside the hotbeds.
Te dataset contains 1 laige families (with at least ve membeis), I small families, and
1z isolates. Most families aie located completely within oi outside hotbeds, but I of the z
(e) families aie split lndo-luiopean, Uialic, and Austionesian. Within lndo-luiopean, non-
DRAFT Maich 11, zc11
Distributional biases z1
Map 1 Te distiibution of pionominal gendei acioss hotbeds (Siewieiska zcc, with some ad-
ditions in South Ameiica and Papua New Guinea). / Afiica, / (Westein) luiope, /
Sahul, / iest of the woild, lled symbols denote piesence, empty symbols absence of
pionominal gendei
split taxa can be found at the majoi bianch level except foi Balto-Slavic which accoiding to
Nichols & Bickels (zccv) naiiow denition of luiope splits into subbianches within (West
Slavic) and subbianches (last Slavic and Baltic) outside the luiopean hotbed. Te same is tiue
of Uialic, wheie non-split taxa can only be found at the lowest taxonomic levels since even
linno-Ugiic is split by the naiiow denition (leaving Hungaiian inside the luiopean hotbed
and linnish outside). Te situation is again similai in Austionesian wheie the Sahul hotbed
boundaiy ciosscuts the Oceanic and Cential Malayo-Polynesian subgioups. As a iesult, non-
split gioups can only be found at ielatively shallow taxonomic levels, each with small numbeis
of membeis (below ). Te iest of Malayo-Polynesian (the Westein Malayo-Polynesian non-
clade of Nichols &Bickel zccv) is again split and foi lack of established subgiouping, it is divided
heie foi statistical puiposes into a laige pseudo-gioup (N s) outside and smallei pseudo-gioup
(N 1) inside the Sahul hotbed.
Te splits leave a total of 1s laige unsplit gioups (with moie than ve membeis), tabulated
in Table Ia. Tis is a small numbei to base statistical estimates on. loi the Laplace estimatoi (cf.
) this means lost of piecision and moie iandom guessing on the extent to which small families
iepiesent biases. To some extent this is compensated by the iesampling stiategy desciibed in
Section , but the iesults must cleaily be taken as pieliminaiy. On the basis of Table Ia the
bias estimatoi is .c inside and .s outside the hotbeds as expected, it is a bit moie
likely foi families to be biased (in some diiection) inside than outside the hotbeds. Te bias
estimatois iesult in a mean extiapolation to small families as given in Table Ib. An analysis
of the extiapolations shows that the odds foi biases towaids gendei aie
z.I times highei
inside than outside the hotbeds, which is signicant undei a lishei lxact Test (p .czv). Tis
suppoits the hypothesis that gendei is beuei pieseived in families when they clustei togethei
in hotbeds.
DRAFT Maich 11, zc11
zz Distributional biases
inside outside Sum
diveise z 1 e
with gendei 1 c 1
without gendei z e s
Sum s 1c 1s
(a) laige families only
inside outside Sum
diveise zv.cI 1.I e.e
with gendei zv.1c 1.cs 1.zv
without gendei Iv.s 1e.1v vc.v
Sum vs.cc 11.cc z1I.cc
(b) with extiapolation to small families
Table I lamily biases in pionominal gendei inside vs. outside hotbeds
7 Discussion
Te family bias estimates iepoited heie aie pieliminaiy and cleaily need fai moie densei sam-
pling of families. Tis is a sampling stiategy that is the exact opposite of what has been iec-
ommended in the past, wheie typologists have emphasized that samples should avoid picking
many iepiesentatives fiom the same families, i.e. that they should be genealogically balanced.
lt is instiuctive to compaie the iesults iepoited heie to the conclusions that one might diaw on
the basis of genealogically balanced sampling.
Since Diyei (1vsv), the standaid in the eld has been to cieate samples in which each family
contiibutes one single datapoint. When families aie diveise, e.g. some membeis have gendei
and otheis dont, they aie sometimes counted as contiibuting seveial datapoints (and theie
aie moie oi less iened methods foi dealing with this, cf. Bickel zccsb). Te key point of the
method, howevei, is that families aie always tieated statistically in the same way as isolates
and that any tiends oi biases within families is ignoied. Te iesult of such a pei-family count
(using Bickels (zccsb) algoiithm) is given in Table 1. Unlike in Table Ib, the odds iatio of this
table is not neai signicance undei a lishei lxact test (