Professional Documents
Culture Documents
DEEP BACKGROUND
Massimo Piattelli-Palmarini
University of Arizona
Spring 2002
Introduction:
Goodmans chapter is called "The new riddle of induction". What was the
old riddle? It was Humes problem of justifying induction. Justifying induction,
that is, on deductive grounds. But that proved to be impossible. Any induction
can only be "justified" on the basis of another induction (that there is uniformity
in Nature, that like effects are derived from like causes, that one phenomenon is
sufficiently similar to another phenomenon etc.). Hume concluded that the only
justification is the formation of habits of the mind that lead to useful predictions
more often than not.
Why deal with Goodmans Paradox (GP) in the study of linguistics?
Because it concerns induction and therefore learning. As Fodor has rightly
stressed ever since 1974 (in Psychological Explanation, and again in The Language
of Thought, and in the debate with Piaget, and in Concepts), the only theory of
learning that we remotely understand is a theory of how hypotheses are
confirmed or refuted by experience. This applies, with bells on, to the learning of
concepts, and/as the learning of lexical meanings. Everyone (I really mean
everyone, in any field) who tries to explain a cognitive competence as the
product of "learning", has to solve GP for that competence. Notably, he/she has
to offer explanations for (at least) the following:
(1) a: Where do the hypotheses that the learner puts to test (submits to
the "tribunal of experience", in Quines words) come from?
b: Which ones are tested first, and why?
c: Which ones are never even tried out, and why?
d: What counts as "confirmation" of a tested hypothesis, and why?
e: How many confirming data are needed, for the hypothesis to be
finally adopted?
GP in essence:
Closer to us:
The empiricists theory of the learning of the lexicon by the child invokes a
"pairing" between a sound and an external stimulus. (The pairing of a word to a
referent)2. A series of episodes of conspicuous ostensions accompanied by the
simultaneous utterance of a word are alleged to be the core of the process. But
(see Bloom, among other authors) this is a very special case, at best. Because (just
to name a few counters):
(2) a: The most common situation is one in which the new word is
embedded in a sentence;
b: The referents of many words have no sensory counterpart (uncle,
yesterday, remembering etc.);
c: The attention of the speaker and of the child must be jointly
focused onto something in an obvious way (a "theory of mind" is
crucially presupposed);
d: The presence of the referent and the utterance of the word are
usually not simultaneous (Where is my compass? Not here, not
here. Ah, here it is!). The mother is not making "an ongoing
commentary" (Gleitmans term) of what is happening, while its
happening. Most of the time referents are evoked and not
"presented".
1 In short, we have to make radical assumptions that rule out most of the possible answers to a
given problem before deciding which of several closely related possible answers are correct -- we
already 'know' where to look for the answer. In fact, we already know nearly everything. Where
does that knowledge come from? If it's innate, with respect to some kinds of problems people
may forever be like the drunk looking for his keys under the lamppost.
2 Humes formation of habits of mind.
3
(3) a: The left boot only (if what has been shown is a left boot);
b: A brown boot only (if a brown exemplar has been shown);
c: That object plus about 1 inch of air surrounding it;
d: That object plus about 2 inches of air surrounding it (and so on ad
infinitum);
e: The undetached sole of the boot;3
f: The undetached tip of the boot.
And so on, and so on. Why no child ever entertains those fancy
hypotheses, and "gets" the meaning right, from the very start? Our suggestion:
There are powerful innate pre-delimitations on the possible meanings of words,
and on the ways words map onto things and events in the world, and on what
Mother "intends to convey".
publish this example). He, in fact, was the staunchest cultural relativist, and
nominalist-pragmatist that there ever was. In his opinion, a human group was
free to invent any language it pleased, and see it adopted by everyone, through a
tacit convention.
Let R be the property "is a raven". Let B be the property "is black". All
ravens are black has the logical form of the logical conditional (ever since
Bradley)
4Jean Nicod was a great French logician and mathematician, who invented this paradox around
1930.
5
(9) a: Relevance
b: Category
c: Similarity
But these, alas, are not logical concepts. We can say a lot about these
notions, but not by means of logic alone.
Goodman stresses the difference between "this piece of copper conducts
electricity" (naturally leading to expect conductivity in other pieces of copper)
and "this man is a third-born" (leading to no expectation at all in other men in the
room). His account of the different "projectibility" of these two predicates relies
on a property of evidential attributes that he calls "entrenchment". Its entirely
pragmatic, context-relative and collective. It is rooted on a history of past success
in the projection of predicates (for instance those that are based on "micro-
structure", versus those that are purely relational). We will not go into this, for
reasons of time, of relevance to this class, and because, in my opinion, this
account of "entrenchment" was never developed in a satisfactory way by
Goodman (or by anyone else).
I observe many gems, similar to emeralds, and more and more emeralds.
All emeralds I have observed so far are green. Is this a confirmation of my
hypothesis? Sure! But, is it a confirmation of my hypothesis only? NO! Goodman
points out that these observations also, and by the same token, confirm an
infinity of other hypotheses, all different from, and incompatible with, mine.
Goodmans"other" hypotheses
First reaction:
Another try:
otherwise observed"). And makes reference to a point in time in the future. Is this
not an "objective" (logical) distinction?
Goodmansanswer is: NO! It all depends on your theoretical vocabulary. It
depends on what your scientific language considers to be primitive:
b: Goodmanian
Grue and bleen are primitive.
Then green and blue have to be defined in terms of grue and bleen.
In this language, green and blue are disjunctive and contingent. I
have to introduce disjunctions and an instant of time in the future
to define them.
(14) a: Human
Grue = observed before instant t and ascertained to be green,
or otherwise observed and ascertained to be blue.
Bleen = observed before instant t and ascertained to be blue,
or otherwise observed and ascertained to be green.
b: Goodmanian
Green = observed before instant t and ascertained to be grue,
or otherwise observed and ascertained to be bleen.
Blue = observed before instant t and ascertained to be bleen,
or otherwise observed and ascertained to be grue.
Logic cannot decide. It all depends on the theoretical language that you
decide is basic. Davidson refined this paradox.
A frequent misunderstanding:
The hypothesis "All emeralds are grue" does not say that these stones will
undergo a "change in color". The hypothesis does not contain an event that is an
event of color-changing. That is something extra. Not part of the hypothesis. That
would add a physiological hypothesis (the color suddenly looks different to a
human eye). And/or a micro-structural hypothesis (the molecular conformation
changes suddenly) But the original hypothesis does not say anything like that.
Green and blue are predicates that in our world are more "projectible"
than grue or bleen. There is nothing in the property of projectibility that is
"absolute" or "logical" (nor anything, in his opinion, that remotely connects to
innateness, or "human nature"). Projectibility is relative to our concerns, interests,
background knowledge, and on a history of "past success". It is an entirely
pragmatic notion Certain properties (predicates) are more "entrenched" than
others. They have a longer history of collective successes in being projected onto
new cases.
(18) a: Simplicity
b: Elegance
c: Similarity with other phenomena
d: Other laws we believe to be true
Fine and dandy, BUT these are not logical criteria.5 Yes, but what is simple
and what is not, and what counts as "beyond what is necessary" are entirely
context-relative to Goodman.
(19) a: As you know, about 49% of recorded human births have been girls.
What's your judgmental probability that the first child born in the
21st century will be a girl?
5
cf. modern movie variants of Descartess Demon (as in the movies "The Truman Show" or "The
Matrix"). There is in principle no way to distinguish between the hypothesis that things are as
they appear to be and the hypothesis that things that appear to be do so only at the behest of a
capricious external power. -- although it might affect what kinds of events you imagine might
occur in the future. The intuition that accepting things at face value is 'simpler' than positing a
capricious external power, or that Ockham's Razor has any content at all, is the very heart of
Goodman's conundrum.
6
He is one of the greatest philosophers of probability, professor at Princeton.
10
I bet that the first reaction of some of you will be: (1) This is damn
important in many ways, to many problems (in fact to all problems of learning);
(2) How come I was never told about this paradox? The explanation is simple:
This is considered "stuff for philosophers" (and it is!), but psychologists, linguists
and other "social scientists" simply ignore this paradox. It figures prominently in
the work of Chomsky and Fodor, but one would be hard put to find any explicit
mention of it anywhere else. Bloom & other acquisition people are paying
attention to it with their hypotheses about the built-in assumptions that babies --
very young babies -- make about the world.
Some proposals for assumptions the child brings to the word-learning process:
Biggest problem for all of the above: only work for nouns! But kids learn
more than nouns, and do so very early: more, bye, hit, want, up, no.
Survey of 20-month olds' vocabulary revealed that only half of their nouns
referred to basic kinds of solid objects. Other types of words included location
(beach), temporal entities (day) and events (party). Words like friend and uncle
also appear early, as do pronouns (yes, but they are special, see Laura Petittos
study on normal and congenitally deaf children. Persona pronouns appear all of
sudden with a transient brief period of referential inversion) and proper names.
Secondary issue: are these constraints supposed to be innate? Really seems
like they must be. Then, are they specific to the language-learning problem? Or
can they be shown to fall out from some other recognized system?
Bloom's proposal is essentially Gleitman's syntactic bootstrapping
hypothesis, applied to the domain of nominals.
(23-i) takes care of the problem of proper names and pronouns; because
they're NPs, they must refer to individuals.7
Some problems:
b. Italian: what about languages where it's not clear syntactically what's a proper
name and what's not? See above. Quite puzzling. Or perhaps there's other clues.
Maybe, but its far from obvious. Gender is irrelevant (see above) Clitics
apply to both "Ne ho bevuta tanta" (acqua, mass) and "Ne ho bevuti tanti
(bicchieri, count). Raffi has interesting evidence that the "reflexive" "si" may have
a measuring out role wit h some verbs. But the determiner+proper name thing
would be worth looking into. Not in any obvious way. Seems to be like English
(un po, di piu, piu apply to mass, determinative articles to count).
c. Mass nouns in English: often look like NPs, not just Ns. In order to distinguish
between an N and an NP reading of water in a string like I like water, (vs. I like
Fido) the child would also have had to have heard and remembered, Give me
some water or similar (*Give me some Fido). That was Quines point: The
difference between water and "mama" (his examples) can ONLY come
after the child has mastered the quantifiers. But the experiments by Sue Carey
and Nancy Soja show that 2 + 1/2 year-olds immediately understand whether
what you show is an object (blinket) versus a stuff (dax) and the do not master
quantifiers. I pointed this out to Quine and, disappointingly, he told me he could
accommodate these date in his theory. He did not tell me how.
7
The special pronominal forms in the genitive in English constitute evidence for this analysis (as
do many other phenomena, but the genitive is positive evidence), because that 's affix attaches to
NPs, not Ns, and hence there are no irregular N genitives, unlike, e.g., plurals. However there are
"irregular" pronominal genitives (his, mine) as well as regulars (hers, theirs, its, yours).
13
Chierchia has put forth the theory that mass is the default in Chinese, while
count is the default in English. I never understood how that may be, unless they
have different genes presiding over what is the default. Maybe some other cues
in the language suggest what is the "local" default. UNCLEAR TO ME.
9. a. But, claim some researchers, kids DON'T associate 'individuals' with count
nouns and 'portions' with mass nouns. That is, lots of experiments demonstrate
that the linguistic cue (i.e. 'N preceded by a' or 'bare N') overrides the semantic
cue (i.e. blob of amorphous stuff or well-defined solid object). Carey and Soja
observed that the determiner (a, the, some, more, another) is irrelevant to the 18-
months old, becomes relevant between 3 and 5 years of age, then becomes
irrelevant again (for other reasons) over that age. Surely "cake" (count) and
"dessert" (mass) may have the same ostensible referent. See also the Italian
examples above.
They conclude that sorting into mass vs. count categories is a purely syntactic
procedure, without any semantic repercussions or implications: the association
between 'individuals' vs 'portions' and 'count' vs. 'mass' happens later. Well,
something like that. But some basic "match" between what is presented and the
syntactic form must be active in acquisition. A rough and ready, fallible heuristic,
that does good duty 85% of the times.
10. Evidence that kids are doing some semantic categorization, differentiating
between II and III
a. They make mistakes more often with non-canonical mass nouns like furniture
and money, and less often with canonical ones like juice and milk. Well, the latter
are "STUFF", while the former are more abstract. Not surprising.
c. They assume that a novel count noun in the presence of unfamilar amorphous
substance refers not to the substance but to the bounded pile of it. (Doesn't work
in reverse they really want words referring to discrete bounded objects to
refer to the object, not the stuff it's made of) Yep, thats Carey and Soja. But a
splintered fragment of teak(wood) would suggest that we are referring to that
stuff , not to that "thing".
14