Professional Documents
Culture Documents
solve
problem
has
algorithm
complexity
is a
sorting algorithm
Rules
Relations
Concept Hierarchy
Concepts
Synonyms
Terms
A set of concept instances, i.e. its extensions: a term can be considered a concept if it has
instances.
A set of linguistic realizations.
Identify sense tsense related to domain from the list of sense (disambiguating sense)
Identify sense tsense related to domain from the list of sense (disambiguating sense)
Simplified LESK
To solve combinatorial explosion
Runs a separate disambiguation process for each ambiguous word in the input text
Adapted LESK
Enlarged context : consider hypernyms, hyponyms, holonyms, meronyms, troponyms,
attribute relations, and their associated definitions
Less accuracy
8
N2
2N3
N1+N2+2N3
C2
10
11
12
For example
WNs1 e.g. a hidden storage space for money or
provisions or weapons
WNs2 e.g. a secret store of valuables or money
WNs3 e.g. RAM memory that is set aside as a
specialized buffer storage, which is continually updated;
used to optimize data transfers between system
elements with different characteristics
13
14
15
Annotator 1
Annotator 2
Annotator 3
75%
56%
78%
Bio MedicalRecall
Our
approach
58.70%
57.73%
20.27%
16
Why LESK ?
Conclusion
Choosing a best WSD algorithm based on
17
References
K. Balachandran and S. Ranathunga, "Domain-Specific Term Extraction for Concept Identification in Ontology Construction", in IEEE/WIC/ACM International Conference on
Web Intelligence, Omaha, Nebraska, USA, 2016, pp. 34-41.
P. Buitelaar, P. Cimiano, and B. Magnini, Ontology learning from text: methods, evaluation and applications vol. 123: IOS press, 2005.
X. Zhou, X. Zhang, and X. Hu, "MaxMatcher: Biological concept extraction using approximate dictionary lookup," in PRICAI 2006: Trends in Artificial Intelligence, ed: Springer,
2006, pp. 1145-1149.
L. V. Subramaniam, S. Mukherjea, P. Kankar, B. Srivastava, V. S. Batra, P. V. Kamesam, et al., "Information extraction from biomedical literature: methodology, evaluation and
an application," in Proceedings of the twelfth international conference on Information and knowledge management, 2003, pp. 410-417.
G. Hirst and D. St-Onge, "Lexical chains as representations of context for the detection and correction of malapropisms," WordNet: An electronic lexical database, vol. 305,
pp. 305-332, 1998.
S. Banerjee and T. Pedersen, "An adapted Lesk algorithm for word sense disambiguation using WordNet," in Computational linguistics and intelligent text processing, ed:
Springer, 2002, pp. 136-145.
Z. Wu and M. Palmer, "Verbs semantics and lexical selection," in Proceedings of the 32nd annual meeting on Association for Computational Linguistics, 1994, pp. 133-138.
M. Lesk, "Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone," in Proceedings of the 5th annual
international conference on Systems documentation, 1986, pp. 24-26.
C. Leacock and M. Chodorow, Combining Local Context and Wordnet Similarity for Word Sense Disambiguation, WordNet: An Electronic Lexical Database, vol. 49, pp. 265283, MIT Press, 1998.
J. J. Jiang and D. W. Conrath, Semantic similarity based on corpus statistics and lexical taxonomy, in Proc. Int. Conf. Research in Computational Linguistics, 1998, pp. 1933.
18
Questions ?
Thank You
19