New models of speech timing?
There are many statistics used to characterize timing patterns in speech, at various scales, with applications in many areas. Among them:
- Intervals between phonetic events, by category and/or position and/or context;
- Overall measures of speaking rate (words per minute, syllables per minute), relative to total time or total speaking time (leaving out silences);
- Mean and standard deviation of speech segment and silence segment durations;
- …and so on…
There are many serious problems with these measures. Among the more obvious ones:
- The distributions are all far from "normal", and are often multi-modal;
- The timing patterns have important higher-order and contextual regularities;
- The timing patterns of segments/syllables/words and the timing patterns of phrases (i.e. speech/silence) and conversational turns are arguably (aspects of) the same thing at different time scales;
- Connections with patterns of many other types should also be included — phonetic and syllabic dynamics, pitch patterns, rhetorical and conversational structure, …
Read the rest of this entry »