Archive for Phonetics and phonology

Was it "getting" or "gay"?

Patrick Redford, "There's Nothing To Say About What Andrew Wiggins Said That's Not Conditional", Deadspin 1/9/2018:

Andrew Wiggins went off for 40 points on the Thunder last night in a lively game that featured 32-year-old interim coach Ryan Saunders getting his first win and Thunder guard Dennis Schröder getting ejected for shoving. Wiggins was asked about Schröder’s ejection after the game, and he either said, “He was getting—he was acting crazy,” or, “He was gay. He was acting crazy.” Those are obviously two very different quotes, and as much as I think he’s mumbling “getting,” the tape is ultimately inconclusive.

"Andrew Wiggins: Would never disrespect LGBTQIA community", ESPN 1/9/2018:

Hours after he called Oklahoma City Thunder guard Dennis Schroder "gay," Minnesota Timberwolves forward Andrew Wiggins sought to clarify his remark, saying early Wednesday morning that he wouldn't use "any term to disrespect" the LGBT community.

Read the rest of this entry »

Comments (21)

Of reindeer and Old Sinitic reconstructions

This is a piece that I've been meaning to write for a long time, but never found the opportunity.  Now, inspired by the season and about to embark on extended holiday travel, I'm determined not to put it off for yet another year.

The genesis of my ruminations on this topic are buried in decades-old tentative efforts to identify the fabulous creature known in Chinese myth as the qilin (Hanyu Pinyin), also spelled as ch'i2-lin2 (Wade-Giles Romanization) and kirin in Japanese, which the whole world knows as the name of a famous beer (fanciful, stylized depictions of the kirin are to be found on bottles and cans of the beer).

The qilin is usually referred to in English as a kind of unicorn, but I knew that couldn't be right, since no account of the qilin from antiquity describes it as having only one horn.  The Chinese xièzhì 獬豸 ("goat of justice") does have a single, long, pointed horn, but that is another matter, for which see "Lamb of Goodness, Goat of Justice" (pp. 86-93) in Victor H. Mair, "Religious Formations and Intercultural Contacts in Early China," in Volkhard Krech and Marion Steinicke, ed., Dynamics in the History of Religions between Asia and Europe:  Encounters, Notions, and Comparative Perspectives (Dynamics in the History of Religion, 1 [Ruhr-Universität Bochum]) (Leiden:  Brill, 2011), pp. 85-110 (available on Google Books).  Since customs pertaining to the goat of justice, as with the reindeer, existed in cultures spread across northern Eurasia, I suspect that an extra-Sinitic loanword may also be lurking behind xièzhì 獬豸.

Read the rest of this entry »

Comments (58)

Of jackal and hide and Old Sinitic reconstructions

[The first page of this post is a guest contribution by Chris Button.]

I've been thinking a little about the word represented by chái 豺* which I would normally reconstruct as *dzrəɣ (Zhengzhang *zrɯ) ignoring any type a/b distinctions. However, it occurred to me that a reconstruction of *dzrəl (for which Zhengzhang would presumably have *zrɯl) would give the same Middle Chinese reflex (I'm not citing Baxter/Sagart since they don't support lateral codas presumably for reasons of symmetry). I'm not sure if outside of its phonetic speller cái 才 there is any reason to go with -ɣ rather than -l in coda position for 豺. However, if we go with a lateral coda as *dzrəl, it looks suspiciously similar to Old Iranian šagāl from Sanskrit śṛgāla (perhaps even more so if we fricativize the Old Iranian /g/ to /ɣ/ intervocalically as in modern Persian).

[*VHM:  This is always a challenging word for translators.  "jackal" and "dhole" are two possibilities.]

Read the rest of this entry »

Comments (22)

Really weird sinographs, part 4

A video introducing 70 obscure Chinese characters (shēngpì zì 生僻字):

Read the rest of this entry »

Comments (6)

A new Sinograph

On being ugly and poor, with an added note on consumerism.

Every so often, for one reason or another, somebody creates a completely new Chinese character.  Here's the latest:

Read the rest of this entry »

Comments (23)

A better way to calculate pitch range

Today's topic is a simple solution to a complicated problem. The complicated problem is how to estimate "pitch range" in recordings of human speakers. As for the simple solution — wait and see.

You might think that the many differences between the perceptual variable of pitch and the physical variable of fundamental frequency ("f0") arise because perception is complicated and physics is simple. But if so, you'd be mostly wrong. The biggest problem is that physical f0 is a complex and often fundamentally incoherent concept. And even in the areas where f0 is well defined, f0 estimation (usually called "pitch tracking") is prone to errors.

Read the rest of this entry »

Comments (9)

Of knots, pimples, and Sinitic reconstructions

A couple of months ago, we talked about gēda 疙瘩, which is one of those very cool, two syllable Sinitic words, neither of whose syllables means anything by itself (i.e., not only is it a disyllabic lexeme, it is also a disyllabic morpheme).  Furthermore, gēda 疙瘩 is highly polysemous, with the following meanings:  "pimple; knot; swelling on the skin; lump; nodule; blotch; a knot in one's body or heart (–> hangup; problem; preoccupation)".

See "Too hard to translate soup" (9/2/18).

Read the rest of this entry »

Comments (14)

The holy day of [??]

From Elijah Granet:

In Trump’s recent remarks on the Squirrel Hill synagogue shooting, he (at ~1:12 in the linked video) refers to the shooting taking place “on the holy day of Sabbath”, pronouncing “Sabbath” in a bizarre way, with the vowels completely off. My best guess—and a few people on Twitter seem to agree— is that the teleprompter actually said “Shabbat”, but Trump was unsure how to pronounce it, and ended up saying “Sabbath”. Still, it seems a bit strange since Shabbat is not a particularly hard word to read phonetically, and given that Trump’s daughter and son-in-law are shomer shabbos, one would think he had heard the word before. It’s equally possible that Trump didn’t know the word “Sabbath” and was trying to give it a “Jewish” pronunciation, and thus over-corrected.

The audio for the phrase "on the holy day of Sabbath":

And just the pronunciation of "Sabbath":

Read the rest of this entry »

Comments (25)

Pinyin for phonetic annotation

One more reason for me to love Wikipedia.

I just noticed in this article on Chinese honorifics that some example sentences are phonetically annotated with Pinyin.  Not only that, it observes properly spaced word division, which must be technically difficult to achieve.  Furthermore, the Pinyin annotations are appropriately small, yet clear.

I don't know how widespread this usage has become in Wikipedia or elsewhere, but I can tell you that learning about it this morning brought me great joy.

Read the rest of this entry »

Comments (19)

Word, syllable, morpheme, phoneme

What is the basic unit of discursive, communicative language — word, syllable, morpheme, or phoneme?

This topic came up in the comments to the following posts:

"The concept of word in Sinitic" (10/3/18)

"Words in Vietnamese" (10/2/18)

"Diacriticless Vietnamese on a sign in San Francisco" (9/30/18)

"Words in Mandarin: twin kle twin kle lit tle star" (8/14/12)

Read the rest of this entry »

Comments (37)

Too hard to translate soup

From a menu in a restaurant in Oxford, Ohio:

Read the rest of this entry »

Comments (38)

LRNLP 2018

On Monday, I'm pursuing the quixotic enterprise of talking to an NLP workshop about phonetics.

LRNLP ("Language Resources for NLP") 2018 is a workshop associated with COLING 2018 in Santa Fe NM.  My abstract:

Semi-automatic analysis of digital speech collections is transforming the science of phonetics, and offers interesting opportunities to researchers in other fields. Convenient search and analysis of large published bodies of recordings, transcripts, metadata, and annotations – as much as three or four orders of magnitude larger than a few decades ago – has created a trend towards “corpus phonetics,” whose benefits include greatly increased researcher productivity, better coverage of variation in speech patterns, and essential support for reproducibility.

The results of this work include insight into theoretical questions at all levels of linguistic analysis, as well as applications in fields as diverse as psychology, sociology, medicine, and poetics, as well as within phonetics itself. Crucially, analytic inputs include annotation or categorization of speech recordings along many dimensions, from words and phrase structures to discourse structures, speaker attitudes, speaker demographics, and speech styles. Among the many near-term opportunities in this area we can single out the possibility of improving parsing algorithms by incorporating features from speech as well as text.

Due to semester-initial commitments at Penn, I won't be able to stay for COLING, but I'm looking forward to an interesting day of presentations at the workshop.

 

Comments (2)

All the thyme in the world

A small recent experience with herb-related synchronicity reminded me of some very interesting phonetics research.

This all began a couple of weeks ago, when Larry Hyman and I were discussing what seasonings to use on meat to be grilled for dinner. I suggested thyme, and Larry reminded me of Suzanne Gahl's 2008 paper "Time and Thyme Are not Homophones: The Effect of Lemma Frequency on Word Durations in Spontaneous Speech":

Frequent words tend to shorten. But do homophone pairs, such as time and thyme, shorten equally if one member of the pair is frequent? This study reports an analysis of roughly 90,000 tokens of homophones in the Switchboard corpus of American English telephone conversations, in which it was found that high-frequency words like time are significantly shorter than their low-frequency homophones like thyme. The effect of lemma frequency persisted when local speaking rate, predictability from neighboring words, position relative to pauses, syntactic category, and orthographic regularity were brought under statistical control. These findings have theoretical implications for the locus of frequency information in linguistic competence and in models of language production, and for the role of articulatory routinization in shortening.

Read the rest of this entry »

Comments (20)