Audiobooks as birdsong
Wonkier but more accurate title: "Generating the distribution of audiobook speech segment durations".
In "Finch linguistics" 7/13/2011, I observed that the distribution of birdsong motif repetitions indicates that the underlying process is non-markovian in a particularly simple way: the probability of adding another motif to a zebra-finch song is not constant, but rather is an exponentially-decaying function of the number of previous motif repetitions.
And in "Modeling repetitive behavior" 5/15/2015 (and posts linked therein), I suggested that this is likely to be a shared property of several sorts of repetitive behavior, primate as well as avian.
A few days ago, as a result of a conversation with João Sedoc and Tianlin Liu, I decided to apply the same idea to the distribution of speech-segment durations in (a locally re-aligned version of) the LibriSpeech corpus.
Read the rest of this entry »