Implementation of Collocation Extraction in Unitex - Archive ouverte HAL Accéder directement au contenu
Rapport Année : 2007

Implementation of Collocation Extraction in Unitex

Résumé

Collocation extraction is an elaborate problem in the eld of corpus linguistics that requires both statistical and linguistic information in order to be successful. The problem is attracting more attention as the importance of collocations, (aka multi-word expressions) is recognized by the NLP community. In this report, I present the work that I reviewed, and the Colloc module that was the result of this work, by explaining reasons behind taken decisions during the implementation, and of course, what remains to be done.

Domaines

Autre [cs.OH]
Fichier principal
Vignette du fichier
hal.pdf (220.67 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00628579 , version 1 (03-10-2011)

Identifiants

  • HAL Id : hal-00628579 , version 1

Citer

Burak Arslan. Implementation of Collocation Extraction in Unitex. 2007. ⟨hal-00628579⟩
117 Consultations
316 Téléchargements

Partager

Gmail Facebook X LinkedIn More