Language Log

Audiobooks as birdsong

June 10, 2018 @ 9:33 am · Filed by Mark Liberman under Phonetics and phonology

Wonkier but more accurate title: "Generating the distribution of audiobook speech segment durations".

In "Finch linguistics" 7/13/2011, I observed that the distribution of birdsong motif repetitions indicates that the underlying process is non-markovian in a particularly simple way: the probability of adding another motif to a zebra-finch song is not constant, but rather is an exponentially-decaying function of the number of previous motif repetitions.

And in "Modeling repetitive behavior" 5/15/2015 (and posts linked therein), I suggested that this is likely to be a shared property of several sorts of repetitive behavior, primate as well as avian.

A few days ago, as a result of a conversation with João Sedoc and Tianlin Liu, I decided to apply the same idea to the distribution of speech-segment durations in (a locally re-aligned version of) the LibriSpeech corpus.

Read the rest of this entry »

Permalink Comments (1)

Beyond the dreams of eagles

June 10, 2018 @ 6:44 am · Filed by Mark Liberman under Linguistics in the comics

I'm about to head to Poland for Speech Prosody 2018, and then to Helsinki for a CHIST-ERA committee meeting, so today's SMBC is especially meaningful to me:

Permalink Comments (6)

Spelling

June 9, 2018 @ 12:11 pm · Filed by Mark Liberman under Linguistics in the comics

This strip was recently reposted (colorized) in Danielle Corsetto's webcomic Girls With Slingshots:

Read the rest of this entry »

Permalink Comments (12)

A Philadelphian who doesn't like cheesesteaks and hoagies

June 7, 2018 @ 12:44 pm · Filed by Victor Mair under Humor, Idioms, Insults, Language and culture

[*cheesesteak; hoagie]

Recently, a new phrase has swept through the internet in China: dìyù tuōyóupíng 地域拖油瓶.

People who introduced me to this expression told me that it refers to somebody who is not good at or who is unfamiliar with things associated with the place where he / she is from. Of course, I had no problem with dìyù 地域, which means "region(al)", but I couldn't quite grasp the nuances of 拖油瓶 in this phrase.

Originally a Wu topolecticism, syllable by syllable it literally means "drag (along) oil bottle", but as a whole it signifies "children from the previous marriage of a woman who is about to remarry" (Wiktionary); "(derog.) (of a woman) to bring one's children into a second marriage / children by a previous marriage" (MDBG).

Read the rest of this entry »

Permalink Comments (7)

Kim Cattrall's alveolar plosives

June 7, 2018 @ 8:58 am · Filed by Mark Liberman under Linguistics in the news, Phonetics and phonology

Caity Weaver, "Kim Cattrall Can Talk to Me About Anything", NYT 6/6/2018:

Because I’m one of the youngest people alive (29), I was not old enough to be interested in a program with “sex” in the title when “Sex and the City” premiered on HBO in 1998, 20 years ago today.

Consequently, beyond the broadest outlines of the plot — there are four friends, having sex, and the city — the only detail I know firmly about the show is: Sa-MANh-thAH TAL-hkss hLike thIS.

If you have ever seen even one second of the actress Kim Cattrall in character as Samantha Jones, the vamp of “Sex and the City,” you know what I mean. From Ms. Cattrall’s larynx, the words of Samantha slunk and shimmied across the Manhattan of the early aughts, her voice sliding around ribald puns as if extra lubricated. […]

What you might not know is that Kim Cattrall’s real voice is as unlike the voice of Samantha Jones as a late October morning is unlike a Fourth of July high noon. I know this. I know this in my bones. I know this so well the knowing will be imprinted in the DNA of my descendants for a hundred generations — because I am unable to stop listening to the same four podcast episodes featuring Ms. Cattrall, over and over.

They’re very relaxing. […]

Read the rest of this entry »

Permalink Comments (13)

"Loaded to bear"?

June 6, 2018 @ 12:13 pm · Filed by Mark Liberman under Idioms, Usage

Vicki Needham and Niv Ellis, "Trump to face lion’s den at G-7 summit", The Hill 6/6/2018:

President Trump will walk into a lion’s den of angry allied leaders at this week’s Group of Seven summit, where he is expected to face a firestorm of criticism over his decision to hit them with steep tariffs on steel and aluminum. […]

Bill Reinsch, a trade expert with the Center for Strategic and International Studies, said Trump is likely to get an earful from the U.S. allies. […]

Reinsch said he expects the summit to be one of the most tense in recent history and said the other six countries are “loaded to bear.”

Read the rest of this entry »

Permalink Comments (25)

Fub

June 5, 2018 @ 7:34 pm · Filed by Victor Mair under Dialects, Language and computers, Pronunciation

The University of Pennsylvania is instituting a Two-Step Verification for PennKey WebLogins. Up till now, our PennKey for login consisted of a Username and Password. After much effort and practice, I finally mastered that. Now, however, for the sake of greater security, after using our PennKey to log in, we will in addition be asked to go through a second step that requires us to enter a randomly generated number that will be sent to us via cell phone.

That really freaked me out, since I don't have a cell phone.

Read the rest of this entry »

Permalink Comments (48)

Stoop to no lengths

June 5, 2018 @ 11:06 am · Filed by Mark Liberman under Idioms, Usage

Alex Isenstadt, "Trump warns supporters about 'really angry' Democrats", Politico 6/4/2018:

President Donald Trump on Monday afternoon marked 500 days in office by grimly warning supporters that Democrats are motivated to turn out for the midterm elections — and that they’re “really, really angry.”

During a national conference call with grassroots supporters to commemorate the 500-day milestone, Trump implored his backers not to become complacent ahead of the November elections because Democrats were determined to roll back his first-term accomplishments.

“It’s very important that they come out now for the midterms. Historically, they tend not to. They get a little complacent, I guess. Something happens and they tend not to. But it’s going to be very important because they are angry, the other side is really, really angry. And they stoop to no lengths. It’s an incredible thing we’re witnessing,” the president said on the 15-minute call, which was organized by the White House Office of Political Affairs.

Read the rest of this entry »

Permalink Comments (11)

The importance of proper parsing and punctuation

June 4, 2018 @ 8:34 pm · Filed by Victor Mair under Ambiguity, Parsing, Punctuation

Currently circulating on Facebook and on Chinese social media are seemingly impenetrable sentences with the same character repeated numerous times. When you first look at them, your eyes glaze over and you can't make any sense of them. But if you slow down and think about such sentences, you usually can figure them out without too much effort. In fact, I could read some of the following right off upon first encounter. Others required more effort before I was able to crack them.

Although it looks formidable, of the six sample sentences treated in this post, this one was easiest for me. I could understand it at one go. [N.B.: In my treatment of these sentences, I first give the Pinyin with spaces between each syllable, then repeat the Pinyin with requisite parsing and punctuation.]

1.

míng míng míng míng míng bái bái bái xǐ huān tā dàn tā jiù shì bù shuō

明明明明明白白白喜欢他但他就是不说

Míngmíng míngmíng míngbái Báibái xǐhuān tā, dàn tā jiùshì bù shuō.

"Mingming clearly knew that Baibai liked her, but he just wouldn't say it."

Read the rest of this entry »

Permalink Comments (17)

No dictation

June 4, 2018 @ 8:05 pm · Filed by Victor Mair under Language and education, Language teaching and learning, Spelling, Writing, Writing systems

The boy in the photos below is Alexander Aurelius Wang. He is one of our youngest fans in Shenzhen. He doesn't like writing characters from dictation (tīngxiě 听写 / 聽寫):

Read the rest of this entry »

Permalink Comments (4)

Corpora and the Second Amendment: Preliminaries and caveats

June 4, 2018 @ 3:23 pm · Filed by Neal Goldfarb under Language and the law

[An introduction and guide to my series of posts "Corpora and the Second Amendment" is available here. The corpus data that is discussed can be downloaded here. That link will take you to a shared folder in Dropbox. Important: Use the "Download" button at the top right of the screen.]

Before I get down to the business of discussing the corpus data and its implications for the Supreme Court's analysis in Heller, I want to say a few things about what this series of posts will and won't be about, I want to offer some caveats, and I want to outline the sequence that the posts will follow.

What the posts will and won't be about

These posts are going to focus on the meaning of the phrase keep and bear arms and on the Court's analysis of that phrase. I won't be talking about the other parts of the Second Amendment (a well-regulated militia, the security of a free state, the right of the people, and infringed).

The discussion will concentrate on linguistic issues rather legal issues. I won't be talking about whether the Court's holding in Heller is correct. I will, however, talk about what my linguistic analysis means for Heller's conclusion that the Second Amendment's text is unambiguous and therefore that the prefatory clause plays no role in the amendment's interpretation.

Read the rest of this entry »

Permalink Comments (3)