Skip to Main content Skip to Navigation
Conference papers

Corpus oraux et chunking

Abstract : This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is based on a preprocessing stage that consists in reformatting and tagging utterances that breaks the syntactic structure of the text. The chunking stage uses large-coverage and fine-grained lexical resources for general language that have been augmented with resources specific to spoken. We show that it reaches a score of 84.1% f-measure.
Document type :
Conference papers
Complete list of metadatas

https://hal-upec-upem.archives-ouvertes.fr/hal-00637677
Contributor : Matthieu Constant <>
Submitted on : Wednesday, November 2, 2011 - 4:15:29 PM
Last modification on : Wednesday, February 26, 2020 - 7:06:06 PM
Long-term archiving on: : Friday, February 3, 2012 - 2:35:50 AM

File

blanc_constant_dister_watrin_J...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00637677, version 1

Citation

Olivier Blanc, Mathieu Constant, Anne Dister, Patrick Watrin. Corpus oraux et chunking. 27èmes Journées d'Études sur la Parole (JEP'08), 2008, France. pp.4. ⟨hal-00637677⟩

Share

Metrics

Record views

244

Files downloads

169