Introduction of the speaking rate in the model of speech recognition

doi:10.60692/8jfs0-qt340

Published November 7, 2002 | Version v1

Publication Open

Introduction of the speaking rate in the model of speech recognition

1. Mohamed I University

We propose an improvement to the centisecond TLHMM model applied to the sound duration. Indeed, the distribution of the sound duration depends on the speaking rate. An adaptation in a post-processing step is needed. This adaptation is studied by proposing a model of the speaking rate based on average syllabic duration. The experiments elaborated on a set of BDSONS show the interest of this approach. This work is a continuation of those of (Meziane et al., 1999) and (Suaudeau, 1994).

Translated Descriptions

This is an automatic machine translation with an accuracy of 90-95%

Translated Description (Arabic)

نقترح تحسينًا لنموذج TLHMM في المائة من الثانية المطبق على مدة الصوت. في الواقع، يعتمد توزيع مدة الصوت على معدل التحدث. هناك حاجة إلى التكيف في خطوة ما بعد المعالجة. تتم دراسة هذا التكيف من خلال اقتراح نموذج لمعدل التحدث بناءً على متوسط المدة المقطعية. تُظهر التجارب التي تم تطويرها على مجموعة من BDSONS الاهتمام بهذا النهج. هذا العمل هو استمرار لعمل (ميزيان وآخرون، 1999) و (سودو، 1994).

Translated Description (French)

Nous proposons une amélioration du modèle TLHMM centiseconde appliqué à la durée du son. En effet, la répartition de la durée du son dépend de la cadence de parole. Une adaptation dans une étape de post-traitement est nécessaire. Cette adaptation est étudiée en proposant un modèle du taux d'élocution basé sur la durée syllabique moyenne. Les expériences élaborées sur un ensemble de BDSONS montrent l'intérêt de cette approche. Ce travail s'inscrit dans la continuité de ceux de (Meziane et al., 1999) et (Suaudeau, 1994).

Translated Description (Spanish)

Proponemos una mejora al modelo TLHMM de centisegundos aplicado a la duración del sonido. De hecho, la distribución de la duración del sonido depende de la velocidad de habla. Se necesita una adaptación en un paso posterior al procesamiento. Esta adaptación se estudia proponiendo un modelo de la velocidad de habla basado en la duración media silábica. Los experimentos elaborados en un conjunto de BDSONS muestran el interés de este enfoque. Este trabajo es una continuación de los de (Meziane et al., 1999) y (Suaudeau, 1994).

Files

260406.pdf.pdf

Files (15.8 kB)

Please wait a few minutes before your translated files are ready Note: Some files might be protected thus translations might not work.

Name	Size	Download all
260406.pdf.pdf md5:1a12adfd829d9875969fd96c60233d3f	15.8 kB	Preview Download

Additional details

Translated title (Arabic): ادخال معدل التحدث في نموذج التعرف على الكلام
Translated title (French): Introduction du taux de parole dans le modèle de reconnaissance vocale
Translated title (Spanish): Introducción de la velocidad de habla en el modelo de reconocimiento de voz

Other: https://openalex.org/W2148868227
DOI: 10.1109/pcee.2000.873603

Is Global South Knowledge: Yes
Country: Morocco

https://openalex.org/W1483979448
https://openalex.org/W2019728943

	All versions	This version
Views	1	1
Downloads	1	1
Data volume	15.8 kB	15.8 kB

Introduction of the speaking rate in the model of speech recognition

Translated Descriptions

Translated Description (Arabic)

Translated Description (French)

Translated Description (Spanish)

Files

260406.pdf.pdf

Files (15.8 kB)

Additional details

Additional titles

Identifiers

Related works

GreSIS Basics Section

References

Introduction of the speaking rate in the model of speech recognition

Creators

Description

Translated Descriptions

Translated Description (Arabic)

Translated Description (French)

Translated Description (Spanish)

Files

260406.pdf.pdf

Files (15.8 kB)

Additional details

Additional titles

Identifiers

Related works

GreSIS Basics Section

References