Skip to Main content Skip to Navigation
Conference papers

Corpus oraux et chunking

Abstract : This paper describes a process of partial parsing of a spontaneous spoken corpus in French. It is based on a preprocessing stage that consists in reformatting and tagging utterances that breaks the syntactic structure of the text. The chunking stage uses large-coverage and fine-grained lexical resources for general language that have been augmented with resources specific to spoken. We show that it reaches a score of 84.1% f-measure.
Document type :
Conference papers
Complete list of metadata
Contributor : Matthieu Constant Connect in order to contact the contributor
Submitted on : Wednesday, November 2, 2011 - 4:15:29 PM
Last modification on : Saturday, January 15, 2022 - 3:57:21 AM
Long-term archiving on: : Friday, February 3, 2012 - 2:35:50 AM


Files produced by the author(s)


  • HAL Id : hal-00637677, version 1


Olivier Blanc, Mathieu Constant, Anne Dister, Patrick Watrin. Corpus oraux et chunking. 27èmes Journées d'Études sur la Parole (JEP'08), 2008, France. pp.4. ⟨hal-00637677⟩



Record views


Files downloads