Skip to Main content Skip to Navigation
Conference papers

Corpus oraux et chunking

Abstract : This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is based on a preprocessing stage that consists in reformatting and tagging utterances that breaks the syntactic structure of the text. The chunking stage uses large-coverage and fine-grained lexical resources for general language that have been augmented with resources specific to spoken. We show that it reaches a score of 84.1% f-measure.
Document type :
Conference papers
Complete list of metadata

https://hal-upec-upem.archives-ouvertes.fr/hal-00637677
Contributor : Matthieu Constant Connect in order to contact the contributor
Submitted on : Wednesday, November 2, 2011 - 4:15:29 PM
Last modification on : Tuesday, October 19, 2021 - 11:26:18 AM
Long-term archiving on: : Friday, February 3, 2012 - 2:35:50 AM

File

blanc_constant_dister_watrin_J...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00637677, version 1

Citation

Olivier Blanc, Mathieu Constant, Anne Dister, Patrick Watrin. Corpus oraux et chunking. 27èmes Journées d'Études sur la Parole (JEP'08), 2008, France. pp.4. ⟨hal-00637677⟩

Share

Metrics

Record views

256

Files downloads

186