Professional Documents
Culture Documents
Outline
Applications
Course implementation
p
Semantic analysis
(or understanding)
(or,
Language
generation
i
Tainio
Syntactic parsing
VP
Verb
scored
NP
Det
Adjective
Noun
beautiful
goal
Morphology: score-d
Semantic analysis
y
= figure out the meaning of linguistic input (construct meaning
representations)
= process language to produce common-sense knowledge about
the world (extract data and construct models of the world)
Lexical semantics
meanings of component words
word sense disambiguation (e.g. country in political or musical sense)
Compositional semantics
how words combine to form larger meanings
What is language
g g understanding
g
1.
2.
3.
4.
Understanding language means knowing how to use it. - - To master a technique. [Wittgenstein]
At matriculation examination: the student is presented with a
portion of text or speech and then asked questions about it
...
It is difficult to give one generic definition
Thinking of a concrete application of semantic analysis is
probably the best way of defining the problem
Approaches
pp
to semantic analysis
y
Predicate logic
The sentence
a restaurant that serves Chinese food near TUT
corresponds to the meaning representation
x Restaurant( x) Serves( x, ChineseFood)
Near( LocationOf( x), LocationOf( TUT)
semantic analysis = creating meaning representations from ling. input
logical propositions enable inference
scalability problem (large vocabulary or unrestricted domain)
Approaches
pp
to semantic analysis
y
Statistical approach
statistical machine translation (as an example)
Approaches
pp
to semantic analysis
y
Information retrieval
Google solves a certain part of the problem in a statistical way:
answers to trivial kind of questions can be located using a web search
engine
assumes an database (Internet) and a clever page ranking system
10
Approaches
pp
to semantic analysis
y
pragmatics,
ti
socially
i ll probable
b bl
set of moves in a certain context
11
Ludwig
g Wittgenstein
g
((1889-1951))
Language World
propositions facts
elementary propositions states of affairs
names objects
[A.C.Grayling,
Wittgenstein,
Oxford Univ
Univ. Press]
12
Ludwig
g Wittgenstein
g
((1889-1951))
Later philosophy
language resembles a game: it is based on agreed-upon rules
what
h t consitutes
it t a rule
l is
i our collective
ll ti use off it (general
(
l practice,
ti
custom)
t )
understanding language means knowing how to use it
P di t llogic
Predicate
i may still
till b
be good
d ffor meaning
i representations
t ti
13
Applications
pp
of semantic analysis
y
Information extraction
extract small amounts of pertinen information from large bodies of text
find an answer to a question for example
Text summarization
Information retrieval (cf. Google) and document classification
Machine
ac e translation
a sa o
Human-computer interaction
conversational agents: book plane tickets, query for a restaurant, ...
Expert systems
free help: please show me how to widen the margins of my document
Surveillance
14
Applications
pp
of semantic analysis
y
Jim Gray
15
Ambiguity of language
I made her duck, for example, could mean [Jurafsky&Martin]
16
17
Comparison
p
with artificial intelligence
g
((AI))
Relation to compilers
18
Resources: Web
19
Resources: dictionaries
How about
H
b t analyzing
l i meaning
i
by looking up in a dictionary?!
breaking down words into simpler
words is not straightforward
symbol grounding problem:
terminal concepts must be associated
with physical or sensory data
but: Helen Keller case
(a deaf-blind college
graduate)
d t )
20
21
A l h t frame
Apply_heat
f
includes
i l d off the
th following
f ll i elements
l
t (and
( d more))
22
Resources: ConceptNet
p
((2002@MIT
@
))
ConceptNet
nodes:
semistructured
English fragments
archs:
twenty semantic
relations
23
Scope
p of this course
24
Seminar topics
p ((choose yyours))
Predicate logic approach:
M
Meaning
i representations.
t ti
45 min.
i
First order predicate calculus. 45 min.
Syntax-driven semantic analysis. 45 min
Idioms robustness,
Idioms,
robustness and special issues
issues. 45 min
Lexical semantics:
Introduction to lexical semantics and WordNet (Princeton). 45 min.
Word sense disambiguation
disambiguation. 45 min
Latent semantic analysis. 45 min
Information retrieval
Classical methods
methods. 45 min
min.
How Google works? 45 min
(Pragmatics. 2 x 45 min)
Machine
M hi ttranslation
l ti
Introduction and overview. 45 min
Statistical machine translation. 45 min