Skip to Main content Skip to Navigation
Journal articles

A stochastic model of voice generation and the corresponding solution for the inverse problem using Artificial Neural Network for case with pathology in the vocal folds

Abstract : A novel stochastic model to produce voiced sounds is proposed and, mainly, the corresponding identification of some model parameters using an Artificial Neural Network (ANN). The procedure described in this paper is about an intermediate step, which has as final objective to identify pathologies in the vocal folds through the voice of patients, that is, through a non-invasive method. The proposed model presented here uses the source-filter Fant theory and three main novelties are presented: a new mathematical model to produce voice obtained from the unification of two other deterministic one mass-spring-damper models obtained from the literature; a stochastic model that can generate and control the level of jitter resulting even in hoarse voice signals and/or with pathological characteristics but using a simpler model than those usually discussed in the literature; and the most important novelty, the identification of parameters of the proposed model, from experimental voice signals, using an ANN, particularly in a pathological case. The proposed neural network-based identification method requires a construction of a database from which an ANN can be trained to learn the nonlinear relationship between the parameters of the stochastic model and some relevant quantities of interest. The corresponding inverse stochastic problem is then solved in two cases: for one utterance corresponding to a normal voice and for another utterance corresponding to a pathological case corresponding to a nodulus in the vocal folds, helping to validate the model.
Complete list of metadata

https://hal-upec-upem.archives-ouvertes.fr/hal-03193501
Contributor : Christian Soize <>
Submitted on : Sunday, April 18, 2021 - 4:40:27 PM
Last modification on : Monday, April 19, 2021 - 2:08:32 PM
Long-term archiving on: : Monday, July 19, 2021 - 6:18:02 PM

File

BSPC_cataldo_soize_2021_prepri...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03193501, version 1

Collections

Citation

Edson Cataldo, Christian Soize. A stochastic model of voice generation and the corresponding solution for the inverse problem using Artificial Neural Network for case with pathology in the vocal folds. Biomedical Signal Processing and Control, Elsevier, 2021, 68, pp.102623. ⟨hal-03193501⟩

Share

Metrics

Record views

18

Files downloads

22