Cambodian voice traffic

A Rest of World article from November that I missed when it first came out, but am posting on now because it speaks to the comments on several recent Language Log posts (e.g., here and here):

"Fifty percent of Facebook Messenger’s total voice traffic comes from Cambodia. Here’s why:

Keyboards weren't designed for Khmer. So Cambodians have just decided to ignore them", By Vittoria Elliott and Bopha Phorn (12 November 2021)

The first four paragraphs of this longish article

In 2018, the team at Facebook had a puzzle on their hands. Cambodian users accounted for nearly 50% of all global traffic for Messenger’s voice function, but no one at the company knew why, according to documents released by whistleblower Frances Haugen.

One employee suggested running a survey, according to internal documents viewed by Rest of World. Did it have to do with low literacy levels? they wondered. In 2020, a Facebook study attempted to ask users in countries with high audio use, but was only able to find a single Cambodian respondent, the same documents showed. The mystery, it seemed, stayed unsolved.

The answer, surprisingly, has less to do with Facebook, and more to do with the complexity of the Khmer language, and the way users adapt for a technology that was never designed with them in mind.

Read the rest of this entry »

Comments (6)


Sanctioned behaviors/ideas/methods?

In the comments on Tuesday's "Come and go" post, Andrew Gelman wrote

Here's an example: the statistician Steve Stigler quoted as saying, “I don’t think in science we generally sanction the unequivocal acceptance of significance tests.” Unfortunately, I have no idea what he means here, given the two completely opposite meanings of the word “sanction.”

and Philip Anderson responded

In British English at least, it’s possible to sanction people, or organisations/states, with the sense of imposing sanctions on them (although it sounds strained to me), but if behaviour, or an idea, is sanctioned, it can only mean permitted. So I see no ambiguity in your example.

So this morning's Breakfast Experiment™ is a preliminary peek into this issue.

tl;dr: I share Andrew's intuition rather than Philip's — but the data seems to offer Philip (at least statistical) support.

Read the rest of this entry »

Comments (17)


Betelgeuse in Greek, Latin, Arabic, English, and Chinese

AntC led me down a deep, dark rabbit hole by asking:  "Hi Professor Mair, is the Contributing Writer confused, or is it the interwebs?"

He was prompted to ask that question by having read the following statement in this article, "Orion’s love affair, Shen Xiu’s long-distance friendship on Taiwan’s winter sky", Taiwan News, by P.K. Chen, Contributing Writer (2/8/22):

The Greek constellation Orion is called “Shen Xiu” (參宿, “The Three Stars”) in China; “Shen” or “three” refers to the three stars on Orion’s belt, while “Xiu” or “place for rest” refers to where the moon remains fixed and “rests.”

Trying to figure out the relationships among the names of the constellation and its constituent stars in Greco-Latin and Sinitic nomenclature ate up an entire evening.  To start with, there are many possibilities for how to pronounce 參宿, the Chinese equivalent to Orion (constellation name): sānsù (Google Traslate), cānsù (zdic), shēnxiù (Wiktionary).  So we've got a lot of variation involving both characters of the term.  But that's just the beginning of our attempts to grapple with the language and lore concerning Sinoxenic words for Orion. 

Read the rest of this entry »

Comments (43)


Fay-Cutler malapropism of the week

Also the funniest:

Read the rest of this entry »

Comments (20)


Pen scanner

New product:

With the Scanmarker Air no more Retyping- Simply Scanning!

Scan any text in a document or book and it's instantly available on your PC/Mac in any program including Word, Google Docs, Evernote and more. You can also use it on your smartphone/tablet with our app.

  • Super Easy to use
  • Scanmarker Air is 30 times faster than manual retyping
  • Scans up to 3,000 characters a minute and will save hours of tedious work
  • Can read aloud any scanned text
  • Instant translation to over 70 languages- including reading the translation aloud!

Read the rest of this entry »

Comments (25)


Moth eyebrows: lectio difficilior et tertium comparationis

Dieter Maue, a specialist on Old Uyghur, Tocharian, Sanskrit, and Brahmi script, wrote to ask:

The simile 'like the moon of the third day' (tertium comparationis: delicate, graceful; curved (eyebrows)) is currently occupying my mind. Attested in Tocharian A and in Uigur, it sounds, but it doesn't seem to be, Indian.

Tentatively I have translated Uig. üč yaŋıdakı ay täŋri ‘third day’s moon god’ into Chinese word for word; but sān rì yuè 三日月("moon of the third day") is not found in the dictionaries. In the Chinese Tripitaka, there is just one suitable instance. Elsewhere, the moon of the third day seems to be called éméi yuè 蛾眉月 ("moth eyebrow moon" — only poetically?). According to Giles (ChinEnglDict s.no. 7714 ): “ éméi 蛾眉 moth eyebrows, – alluding to the delicate curved eye-markings of the silkworm moth … moth-eyebrows is used figuratively for a lovely girl.   Also wrongly explained as referring to the small curved antennæ of the silkworm moth. ­ Éméi yuè 蛾眉月‚ the crescent moon’. “  The antennae of Bombyx mori are clearly visible, while I cannot find anything which corresponds to  the “eye-markings”. Do you have an idea how to solve the problem?

Read the rest of this entry »

Comments (11)


Forms and meanings of "come and go"

"NBC created ‘boomerang effect’ by refusing to run ad calling out China, Olympics: Concha", Fox News 2/6/2022:

You just played
a- a clip from that ad, right?
And all over social media,
people are now watching this ad when maybe,
if it aired on NBC,
it would have came
and gone

The end of this clip is obviously a substitution for "it would have come and gone" — and Mr. Concha apparently noticed the problem as he spoke, resulting in the 330 msec. silence after "came":

But this is Language Log, not Minor Talking Head Speech Errors Log. So what's the point?

Read the rest of this entry »

Comments (21)


Peter Stephen Du Ponceau and Vietnamese dictionaries

From Connected (2/4/22), a publication of the Peabody Essex Museum:

"Phillips Library digitizes dictionaries from Vietnam and unlocks stories of museum founders and their travels", by Kathlene Baldanza

The blog post is accompanied by beautiful images of pages from the dictionaries.  Here are the first three paragraphs:

Two recently digitized manuscript dictionaries in the Phillips Library collection are once again sparking conversation. In 1819, John White, a lieutenant in the US Navy, received dictionaries from an Italian Catholic priest named Joseph Morrone in Saigon and deposited them with the East India Marine Society in Salem. The members of the East India Marine Society were the founders of what is today the Peabody Essex Museum. Published in the US in 1838, the dictionaries fueled a trans-Atlantic debate about the nature of Asian languages. Catholic missionaries, their Vietnamese interlocutors, and Salem mariners made the initial connections that allowed for the scholarly conversation that played out in the pages of journals including The North American Review, The Foreign Quarterly Review, and The Canton Register.

Read the rest of this entry »

Comments (20)


Taishan and Chinatown

From Bob Ramsey:


Pell Street in New York’s Chinatown, 1899

Read the rest of this entry »

Comments (30)


The meaning of filled pauses?

Comments (12)


Ask Language Log: When substituting synonyms fails

From S.S-L.:

I have a 2.5-year-old daughter, so I've been paying close attention to traps for the unwary in English-language learning. One thing I'm catching are instances where it feels like x is a synonym for y, and hence that you can just regex-replace every instance of x with y, and yet that fails. A couple instances:

    1.  She has a teddy bear named Tobias. She said both "Let's clean up Tobias" and "Let's clean up him." The former is perfectly good English; the latter is definitely not. A naïve understanding of how pronouns work might lead you to believe that you could just find-and-replace proper nouns with pronouns, but it's not true.
    2.  One might naïvely think that "to allow" and "to let" are synonyms, and hence that you could sub the one for the other, but it's "I allow him to do [something]" versus "I let him do something." The former requires "to be" as a helping verb; the latter does not.

When I hear examples like this, I try to imagine programming a computer to generate valid sentences, or at least imagine putting together the minimal set of rules for a new English learner. Is there some nice compact way of representing all these sorts of exceptions to naïve synonymy? Or is it really just a long list that native English speakers have long since internalized?

Read the rest of this entry »

Comments (49)


Nose, iris, pupil

Last week, a master's student went to the board to write the Chinese character for "nose" (bí 鼻), but forgot how to do so.  There is no simplified version.  The form of this character differs slightly between China and Japan:  in China it is 鼻 and in Japan it is 鼻.  Can you spot the difference?

Believe it or not, the top part of the character depicts a nose.  Here's the small seal script form, about two millennia ago (the bottom part is the phonophore, which was added long after the top part was invented):

鼻-seal.svg

Glyph origin

Phono-semantic compound (形聲, OC *blids): semantic (nose) + phonetic (OC *pids).

(OC *ɦljids) originally meant “nose” but came to be used to mean “self”, so the sense of “nose” has been replaced by (OC *blids). Some scholars interpret (OC *blids) as a combination of a nose ( (OC *ɦljids)) and two lungs ( (OC *pids)).

Read the rest of this entry »

Comments (30)


Sun Yat-sen, Chiang Kai-shek, Mao Zedong

Under the rubric, "An Odd Question", Doug Adams (the Tocharianist) asked:

Why do we always refer to Sun Yat-sen (1866-1925) and Chiang Kai-shek (1887-1975) in Cantonese (?) phonological form rather than Mandarin?

Simple reply

Before about 1975, Cantonese was by far the most widespread and prevalent Sinitic language around the world outside of China, and Sun's Cantonese art name, Yat-sen, was so deeply ingrained and familiar in English for decades — both in speech and in writing — that it would have been very difficult to change it to Mandarin Yìxiān 逸仙 ("Liberated Transcendent").  Anyway, he had many other different names for different purposes, and some of them were as popular as Yat-sen, e.g., Chung-shan / Zhongshan, which actually derives from a Japanese pseudonym / nom de guerre (Nakayama Kikori [see below]) given to him by a Japanese friend.  Chung-shan / Zhongshan 中山 was / is so widespread in China that his hometown was renamed after it, making Zhongshan one of the few cities in China to be named after a person.  Zhongshan is also used as the name of the style of jacket that Sun Yat-sen liked to wear:  Zhongshan suit (simplified Chinese: 中山装; traditional Chinese: 中山裝; pinyin: Zhōngshān zhuāng), but in the PRC it came to be known as the Mao suit.  (I'm the proud owner of a Zhongshan suit, which I had tailor made in Taipei in 1971.)  There are dozens of other things and places called Zhongshan in China, a few of them referring to states from much earlier times that are completely unrelated to Sun Yat-sen / Zhongshan, for which see here.

Read the rest of this entry »

Comments (13)