Skip to Main content Skip to Navigation
Conference papers

Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger

Abstract : This paper evaluates the impact of external lexical resources into a CRF-based joint Multiword Segmenter and Part-of-Speech Tagger. We especially show different ways of integrating lexicon-based features in the tagging model. We display an absolute gain of 0.5\% in terms of f-measure. Moreover, we show that the integration of lexicon-based features significantly compensates the use of a small training corpus.
Document type :
Conference papers
Complete list of metadata

https://hal-upec-upem.archives-ouvertes.fr/hal-00790624
Contributor : Matthieu Constant Connect in order to contact the contributor
Submitted on : Wednesday, February 20, 2013 - 4:01:25 PM
Last modification on : Tuesday, October 19, 2021 - 11:26:19 AM
Long-term archiving on: : Tuesday, May 21, 2013 - 9:27:07 AM

File

constant-tellier-lrec2012.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00790624, version 1
`

Citation

Mathieu Constant, Isabelle Tellier. Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger. 8th International Conference on Language Resources and Evaluation (LREC'12), May 2012, Turkey. pp.646-650. ⟨hal-00790624⟩

Share

Metrics

Record views

649

Files downloads

235