A metric for word similarity in WordNet

T. Hong-Minh, D. J. Smith

Research output: Contribution to conferencePaper

Abstract

Understanding concepts expressed in natural language is a challenge in Natural Language Processing and Information Retrieval. It is often decompressed into comparing semantic relations between concepts, which can be done by using Hidden Markov model and Bayesian Network for part of speech tagging. Alternatively, the knowledge-based approach can also be applied but it was not well explored due to the lack of machine readable dictionaries (such as lexicons, thesauri and taxonomies). However, more dictionaries have been developed so far. Following this approach, we present a measure for semantic similarity between concepts. By exploiting advantages of distance (edge-base) approach for taxonomic tree-like concepts, we enhance the strength of information theoretic (node-based) approach. Our measure therefore gives a complete view on word similarity, which can not be achieved by solely applying node-based approach.
Original languageEnglish
Publication statusPublished - Mar 2006
EventInternational Conference on High Performance Scientific Computing - Hanoi, Vietnam
Duration: 6 Mar 200610 Mar 2006

Conference

ConferenceInternational Conference on High Performance Scientific Computing
CityHanoi, Vietnam
Period6/03/0610/03/06

Cite this