Professional Documents
Culture Documents
Semantic
Determining the meaning of words and groups of words. We have to address different types of
ambiguity
(i) same words with completely different meanings e.g might, will, bank
(ii) same words with some common ancestry e.g. stamp, surf
(iii) same words with varying meanings e.g. game (reference Wittgenstein in
Philosophical Investigations)
(iv) different interpretations e.g. replace the bolt can either mean get another bolt or
put the bolt back where it was
Pragmatic
Non-explicit inference, e.g. Can you tell me the time?
In a Verbmobil dialog to fix a meeting one speaker suggests a day to which the reply
is Isnt that All Saints Day? implying it is a holiday.
Dialog management
Handling the process of a dialog in interactive sytems.
World knowledge
Very hard to capture outside the most limited domains, but essential to understanding.
e.g. a BBC subtitle on a sporting event should have read there is a bit of acrimony between these
two players but actually output there is a bit of macaroni between these two players
noun
verb
adjective
adverb
determiner
pronoun
preposition
conjunction
exceptions
content words
function words
anything else
Content words have semantic content: they are also known as open class words since their number can
grow as new names and activities arise. Function words can be thought of as the glue which sticks the
content words together. They are also known as closed class words since new ones are very unlikely to
occur.
Many words have more than one part-of-speech tag: e.g. water, shop, fish can be nouns or verbs.
In working systems tagsets are usually larger: some of the categories can be subdivided, and there are
others we have temporarily omitted.
Phrase definitions
A noun phrase is a noun or group of words that acts like a noun.
A verb phrase is a verb or a verb with its following predicate
A prepositional phrase is a preposition followed by a noun phrase
Consider the sentence
the man crossed the river with the crocodiles
This contains the noun phrases: the man, the river, the river with the crocodiles
The verb phrase: crossed the river with the crocodiles
The prepositional phrase: with the crocodiles
See Covington, page 83 - 85 or Allen chapter 2,
Phrase structure grammars capture some of the structure of language, and define how a parser moves
from state to state in a sentence. They model some significant characteristics, such as recursion. PSG
Rules include direct recursion, such as
NP --> Det NP
or indirect, such as
NP --> Noun PP ( as in he crossed rivers with crocodiles)
PP --> Prep NP
NP
det
VP
noun
verb
NP
det
the
man
crossed
the
noun
river
References
You do not have to read all of these, but should read some of them
Allen, James, 1995 Natural Language Understanding, Benjamin/Cummings
(especially chap 2 )
Charniak, Eugene, 1993, Statistical Language Learning, MIT press
(especially chapter 1 which compares symbolic and rule based approaches.)
Covington, Michael, 1994 Natural Language Processing for Prolog Programmers, Prentice Hall (see
chap 4 for a clear, simple introduction). Good explanation even if you do not know Prolog
Russell, Stuart, and Norvig, Peter, 2003, Artificial Intelligence, a Modern Approach, Prentice Hall, chaps
22 and 23
Background reading
Pinker, Steven, 1994, The Language Instinct, - Interesting and entertaining, neo-Chomskyist
Sampson, Geoffrey, 2005, The Language Instinct Debate, pub. Continuum
Crystal, David, 1995 The Cambridge Encyclopaedia of the English Language, CUP - dip into this.
Lieberman, Philip, 2000, The Evolution of Language and our Reptilian Brain Interesting, new approach.
Pick out his main themes and skip the rest.
Chomsky, Noam, 2000, New horizons in the study of language and mind, CUP Hard to understand.
Gives you an idea of his approach.