Language Log

Archive for October, 2014

Um, there's timing information in Switchboard?

October 5, 2014 @ 6:59 am· Filed by Mark Liberman under Psychology of language

We start with a psycholinguistic controversy. On one side, there's Herbert Clark and Jean Fox Tree, "Using uh and um in spontaneous speaking", Cognition 2002.

The proposal examined here is that speakers use uh and um to announce that they are initiating what they expect to be a minor (uh), or major (um), delay in speaking. Speakers can use these announcements in turn to implicate, for example, that they are searching for a word, are deciding what to say next, want to keep the floor, or want to cede the floor. Evidence for the proposal comes from several large corpora of spontaneous speech. The evidence shows that speakers monitor their speech plans for upcoming delays worthy of comment. When they discover such a delay, they formulate where and how to suspend speaking, which item to produce (uh or um), whether to attach it as a clitic onto the previous word (as in “and-uh”), and whether to prolong it. The argument is that uh and um are conventional English words, and speakers plan for, formulate, and produce them just as they would any word.

And on the other side, there's Daniel C. O'Connell and Sabine Kowal, "Uh and Um Revisited: Are They Interjections for Signaling Delay?", Journal of Psycholinguistic Research 2005:

Clark and Fox Tree (2002) have presented empirical evidence, based primarily on the London–Lund corpus (LL; Svartvik & Quirk, 1980), that the fillers uh and um are conventional English words that signal a speaker’s intention to initiate a minor and a major delay, respectively. We present here empirical analyses of uh and um and of silent pauses (delays) immediately following them in six media interviews of Hillary Clinton. Our evidence indicates that uh and um cannot serve as signals of upcoming delay, let alone signal it differentially: In most cases, both uh and um were not followed by a silent pause, that is, there was no delay at all; the silent pauses that did occur after um were too short to be counted as major delays; finally, the distributions of durations of silent pauses after uh and um were almost entirely overlapping and could therefore not have served as reliable predictors for a listener. The discrepancies between Clark and Fox Tree’s findings and ours are largely a consequence of the fact that their LL analyses reflect the perceptions of professional coders, whereas our data were analyzed by means of acoustic measurements with the PRAAT software (www.praat.org). […] Clark and Fox Tree’s analyses were embedded within a theory of ideal delivery that we find inappropriate for the explication of these phenomena.

Read the rest of this entry »

Permalink Comments (16)

Multilingual Jiang Zemin

October 4, 2014 @ 12:40 pm· Filed by Victor Mair under Multilingualism

This is an old video of Jiang Zemin berating a female reporter and defending the right of the central government in Beijing to handpick the Chief Executive of Hong Kong, in this case the first, Tung Chee-hwa. The video, which is an amazing display of Jiang's verbal pyrotechnics, is getting a lot of circulation these days, for obvious reasons. Here it is as recently posted by Shanghaiist on Facebook.

Post by Shanghaiist.

Read the rest of this entry »

Permalink Comments (2)

A record-setting pangrammatic window

October 3, 2014 @ 11:18 pm· Filed by Ben Zimmer under Language and technology, Language on the internets, Language play

A few months ago, I posted here (and on Slate's Lexicon Valley blog) about PangramTweets, a bot created by Jesse Sheidlower that combs Twitter for tweets that include all 26 letters of the alphabet. I mentioned that it would be interesting to see if PangramTweets turns up any particularly short "pangrammatic windows," i.e., pangrammatic strings in naturally occurring text. At the time, the shortest known example was 42 letters long, in a passage from Piers Anthony's Cube Route:

"We are all from Xanth," Cube said quickly. "Just visiting Phaze. We just want to find the dragon."

My post inspired Malcolm Rowe, a software engineer at Google, to set about finding short pangrammatic windows in an automated fashion, first on the Project Gutenberg corpus and then on the megacorpus of web pages indexed by Google. (Let's hear it for Google's 20 percent time!) On his blog, Malcolm now reports on his findings, including the discovery of a 36-letter pangrammatic window that appeared in a review of the movie Magnolia on PopMatters:

Further, fractal geometries are replicated on a human level in the production of certain “types” of subjectivity: for example, aging kid quiz show whiz Donnie Smith (William H. Macy) and up and coming kid quiz show whiz Stanley Spector (Jeremy Blackman) are connected (or, perhaps, being cloned) in ways they couldn’t possibly imagine.

Read the rest of this entry »

Permalink Comments (14)

Plural data

October 3, 2014 @ 4:34 pm· Filed by Mark Liberman under Linguistics in the comics

Today's xkcd:

Mouseover title: "If you want to have more fun at the expense of language pedants, try developing an hypercorrection habit."

That should be "…developing another hypercorrection habit", since making data plural in that situation is exactly analogous to using whom in "Whom are you, anyways?". But then, as Ben Zimmer has pointed out to me, that would spoil the joke involved in the choice of an in "an hypercorrection".

Permalink Comments (37)

Translating the Umbrella Revolution

October 3, 2014 @ 10:57 am· Filed by Victor Mair under Topolects, Translation

Far from prohibiting translation (see the last item here), the young demonstrators in Hong Kong are offering free translation services for the media and others who may be in need of them.

The following photograph was shared on Twitter by Newsweek's Lauren Walker:

Read the rest of this entry »

Permalink Comments (6)

Half-fast

October 3, 2014 @ 3:46 am· Filed by Mark Liberman under Humor

From David Donnell:

"Not for nothin'," as the native NY'ers say, but I saw this commercial on the idiot-box tonight and was tickled by the play on words. Surprised to google and discover "half-fast" has been around for some time. But the TV ad still makes me laugh!

Permalink Comments (33)

Tasteless coffee

October 2, 2014 @ 9:22 pm· Filed by Victor Mair under Lost in translation

From "Signspotting around the world: Funny fails", a "Lonely Planet travel signs" feature of CNN Travel, I have selected an ensemble of four signs to illustrate different types of translation difficulties.

The first was spotted in a Beijing cafe:

Read the rest of this entry »

Permalink Comments (15)

Must-read for Wednesday afternoon

October 1, 2014 @ 3:23 pm· Filed by Mark Liberman under Sociolinguistics

Josef Fruehwald, "America's Ugliest Accent: Something's ugly alright", Val Systems 10/1/2014.

Update — See "The beauty of Brummie", 7/28/2004 — some quotes therein from Steve Thorne:

In May 2002, I recorded short samples of 20 different accents of English… In order to limit the influence of extraneous variables, the speakers chosen were all male, white, aged between 35 and 40, and upper-working to lower-middle class. These recordings were played to 96 native and 109 non-native English speakers who were then asked to briefly describe each accent and rate each one on a scale of 1-10 (1 = very unpleasant, 5 = neutral, 10 = very pleasant). […]

Read the rest of this entry »

Permalink Comments (35)

« Previous Page

Next Entries »

Archive for October, 2014

Um, there's timing information in Switchboard?

Multilingual Jiang Zemin

A record-setting pangrammatic window

Plural data

Translating the Umbrella Revolution

Half-fast

Tasteless coffee

Must-read for Wednesday afternoon

Follow us on Twitter

Archives [+/–]

Blogroll [+/–]

Meta