Skip to Main content Skip to Navigation
Reports

Implementation of Collocation Extraction in Unitex

Abstract : Collocation extraction is an elaborate problem in the eld of corpus linguistics that requires both statistical and linguistic information in order to be successful. The problem is attracting more attention as the importance of collocations, (aka multi-word expressions) is recognized by the NLP community. In this report, I present the work that I reviewed, and the Colloc module that was the result of this work, by explaining reasons behind taken decisions during the implementation, and of course, what remains to be done.
Document type :
Reports
Complete list of metadatas

https://hal-upec-upem.archives-ouvertes.fr/hal-00628579
Contributor : Lingu Ligm <>
Submitted on : Monday, October 3, 2011 - 4:22:56 PM
Last modification on : Wednesday, February 26, 2020 - 7:06:06 PM
Long-term archiving on: : Tuesday, November 13, 2012 - 3:01:53 PM

File

hal.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00628579, version 1

Collections

Citation

Burak Arslan. Implementation of Collocation Extraction in Unitex. 2007. ⟨hal-00628579⟩

Share

Metrics

Record views

278

Files downloads

75