Evaluating the impact of some linguistic information on the performances of a similarity-based and translations-oriented Word Sense Disambiguation method - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Evaluating the impact of some linguistic information on the performances of a similarity-based and translations-oriented Word Sense Disambiguation method

Myriam Rakho
  • Fonction : Auteur
  • PersonId : 933767
Mathieu Constant

Résumé

In this article, we present an experiment of linguistic parameter tuning in the representation of the semantic space of polysemous words. We evaluate quantitatively the influence of some basic linguistic knowledge (lemmas, multi-word expressions, grammatical tags and syntactic relations) on the performances of a similarity-based Word-Sense disambiguation method. The question we try to answer, by this experiment, is which kinds of linguistic knowledge are most useful for the semantic disambiguation of polysemous words, in a multilingual framework. The experiment is about 20 French polysemous words (16 nouns and 4 verbs) and we make use of the French-English part of the sentence-aligned EuroParl Corpus for training and testing. Our results show a strong correlation between the system accuracy and the degree of precision of the linguistic features used, particularly the syntactic dependency relations. Furthermore, the lemma-based approach absolutely outperforms the word form-based approach. The best accuracy achieved by our system amounts to 90%.
Fichier principal
Vignette du fichier
ConstantRakho-lrec2010.pdf (589.25 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00762911 , version 1 (09-12-2012)

Identifiants

  • HAL Id : hal-00762911 , version 1

Citer

Myriam Rakho, Mathieu Constant. Evaluating the impact of some linguistic information on the performances of a similarity-based and translations-oriented Word Sense Disambiguation method. Seventh International Conference on Language Resources and Evaluation (LREC'10), May 2010, Malta. pp.1200-1205. ⟨hal-00762911⟩
140 Consultations
132 Téléchargements

Partager

Gmail Facebook X LinkedIn More