当前位置： SCI文献检索 > LANGUAGE AND SPEECH期刊下所有文献 > Spontaneous speech events in two speech databases of human-computer and human-human dialogs in Spanish.

Spontaneous speech events in two speech databases of human-computer and human-human dialogs in Spanish.

Abstract：

:Previous works in English have revealed that disfluencies follow regular patterns and that incorporating them into the language model of a speech recognizer leads to lower perplexities and sometimes to a better performance. Although work on disfluency modeling has been applied outside the English community (e.g., in Japanese), as far as we know there is no specific work dealing with disfluencies in Spanish. In this paper, we follow a data driven approach in exploring the potential benefit of modeling disfluencies in a speech recognizer in Spanish. Two databases of human-computer and human-human dialogs are considered, which allow the absolute and relative frequencies of disfluencies in the two situations to be compared. The rate of disfluencies in human-human dialogs is found to be very close to that found for similar databases in English. Due to setup factors, the rate of disfluencies found in human-computer dialogs was remarkably higher than that reported for similar databases in English. In any case, from the point of view of speech recognition, the high frequencies of disfluencies and the distinct features of the acoustic events related to them support the need for explicit acoustic models. The regularities observed in the distribution of filled pauses and speech repairs reveal that including them in the language model of the speech recognizer may be also helpful. The extent to which the number of events depends on utterance length and on the speaker is also explored. Statistics are shown that follow previous studies for English, and a sizeable space is devoted to comparing our results with them. Finally, various possible cues for the automatic detection of speech repairs--a key issue from the point of view of speech understanding--are explored: silent pauses, filled pauses, lengthenings, cut off words and discourse markers. As previously observed for English, none of them was found to be reliable by itself. More information, especially at the acoustic-prosodic level, is no doubt needed to reliably detect speech repairs.

journal_name

Lang Speech

journal_title

Language and speech

authors

Rodríguez LJ,Inés Torres M

doi

10.1177/00238309060490030201

subject

Has Abstract

pub_date

2006-01-01 00:00:00

pages

333-66

issue

Pt 3

eissn

0023-8309

issn

1756-6053

journal_volume

pub_type

杂志文章

在线工具

Spontaneous speech events in two speech databases of human-computer and human-human dialogs in Spanish.