Perils of topic modeling

Today's xkcd illustrates why topic modeling can be tricky, for people as well as for machines:

The mouseover title: "As the 'exotic animals in homemade aprons hosting baking shows' YouTube craze reached its peak in March 2020, Andrew Cuomo announced he was replacing the Statue of Liberty with a bronze pangolin in a chef's hat."

Read the rest of this entry »

Comments (11)


"This laptop is loaded to bear"

Ewan Spence, "Apple Leak Reveals Radical New MacBook Pro", Forbes 5/4/2020:

Apple may finally be getting round to updating the 13-inch MacBook Pro with Intel’s tenth generation processors. The good news is that the MacOS powered laptop going to get a bucketload of extra power.[…]

This laptop is loaded to bear in terms of memory and storage as well. The current 13-inch MacBook Pro can be upgraded to 16 GB of RAM and 2 TB of storage, so we’re looking at a doubling of the core specs.

Read the rest of this entry »

Comments (18)


The sound and sense of Tocharian

Readers of Language Log will certainly be aware of Tocharian, but when I began my international research project on the Tarim Basin mummies in 1991, very few people — only a tiny handful of esoteric researchers — had ever heard of the Tocharians and their language since they went extinct more than a millennium ago, until fragmentary manuscripts were discovered in the early part of the 20th century and were deciphered by Sieg und Siegling (I always love the sound of their surnames linked together by "und"), two German Indologists / philologists — Emil Sieg (1866-1951) and Wilhelm Siegling (1880-1946), in the first decade of the last century.

Read the rest of this entry »

Comments (10)


French (near) homonyms – "calembours pourris"

[h/t Stephan Hurtubise]

Read the rest of this entry »

Comments (13)


Metathesis in action

At the end of the May 1 episode of the NPR show "Milk Street", host Christopher Kimball interviews Dr. Aaron Carroll about a recent California court decision that could force coffee to come with a label warning that it contains a chemical known to cause cancer.

The chemical in question is acrylamide, and it's apparently created (in small quantities) whenever carbohydrates are heated above about 250 degrees farenheit — so bread, crackers, cake, cookies, pizza, pretzels, fried potatoes, corn chips, and lots of other things besides coffee that most people eat regularly. Dr. Carroll argues that the quantities of acrylamide involved are far too small to pose any measurable danger, and that warnings like this one have the bad effect of persuading people to ignore all such messages.

But this is Language Log, not Cancer Warning Over-Reach Log, so what's the linguistic point? It's the way that Dr. Carroll pronounces the name of the chemical in question.

Read the rest of this entry »

Comments (51)


"The old man at the pass loses his horse"

For many years, Melinda Takeuchi, professor of Japanese art history at Stanford, regularly competed with horse and carriage in combined driving events.  Here's an example of what the sport looks like.

Not long ago, her carriage driving days came to an abrupt end due to an accident, which she describes thus:

I had a horrendous carriage wreck a couple of years ago — 5 dashing deer spooked my horse and she bolted. carriage flipped. i was life-flighted to stanford emergency where they discovered 8 broken ribs and a malignant cyst in the pancreas. by one of those crazy serendipitous miracles, the cancer was discovered in time to blitz it. so i survived against all odds, but my daredevil days are over. thank the goddess for horses in these days of shelter in place.

Read the rest of this entry »

Comments (6)


The importance of archeology for historical linguistics

The last two comments, here and here, to this post ("Once more on Sinitic *mraɣ and Celtic and Germanic *marko for 'horse'" (4/28/20), like hundreds of others that have been posted on Language Log over the years, show how linguists need to at least think about the significance of archeological findings for their deliberations.  It would be folly to completely ignore evidence from archeology when attempting to clarify the development of language.  Indeed, archeological materials that are securely dated and identified with regard to culture type provide a benchmark for historical linguistic research.

Read the rest of this entry »

Comments (1)


Rire la Rémumligne!

Comments (7)


Fractured Japanese-to-English translation on amazon.com

From Paul Shore:

I don't know whether the item below, an Amazon translation of an Amazon customer review, is Language-Log-worthy; but I thought that at the very least you might be amused by its sublime anti-logic.  The January 1, 2017 review, written by "横川いずみ", is of Freedom Betrayed, Herbert Hoover's massive, radical critique of U.S. foreign relations from the thirties to the fifties, which wasn't published until 2011, roughly a half-century after Hoover completed it.  In the heading, 横川いずみ rates the book five stars out of five and "[v]ery good".  The Japanese original of the review text is as follows:

Read the rest of this entry »

Comments (10)


Another kind of political lip-syncing

I've previously featured comedy turns from Kylie Scott ("Drunk in the club after Covid") and Sarah Cooper ("How to medical"), lip-syncing recorded passages from Donald Trump's press events. Here's another approach, from @JaneyGodley, substituting her own voice for that of the First Minister of Scotland, Nicola Sturgeon:


Read the rest of this entry »

Comments (7)


Wolf Warrior Diplomacy

A little over two years ago, I made a rather detailed post on Lycogala epidendrum, commonly known as wolf's milk or groening's slime, and its metaphorical applications in China:

"Wolf's milk, a slime mold attractive to young Chinese?" (4/7/18)

During the interim, the popularity of this lowly amoeba has only grown, until it has become the model for an aggressive style of diplomacy on the world stage called in Chinese "zhàn láng wàijiāo 戰狼外交" ("wolf warrior diplomacy").  Synergistically, it has joined forces with another microoranism, this one called severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), also known as coronavirus disease 2019 (COVID-19) and a host of other names that I will refrain from mentioning here for fear of pushing the wrong buttons (this is highly fraught topic, one that must be treated delicately, lest one stirs up a hornets' nest of conflicting onomastic opinions).  Together, COVID-19 and wolf warrior diplomacy have brought the world to the brink of pandemic strife.

Read the rest of this entry »

Comments (6)


Learning empiricism

Comments (20)


Scope ambiguity of the week

A recent NYT headline seems like the premise for a particularly dark dystopian movie: Emily Oster, "Only Children Are Not Doomed", NYT 4/27/2020. A sort of cross between 12 Monkeys and Lord of the Flies? No:

The coronavirus pandemic has created a lot of confusion, but it also may bring into focus a question many parents (or expectant parents) ask: What is the right number of kids for my family? Quarantine or not, having siblings shapes one’s experiences and development. On balance, is this for good or for ill? […]

Overall, when it comes to what economists call success, having siblings simply does not seem to matter.

But what about the awkward only child? The data has largely rejected that idea for decades. One 1987 review article, which summaries 140 studies, found some evidence of more “academic motivation” among only children, but no differences on personality traits like extroversion. In other words, although you might expect a built-in playmate makes a kid more social, the data doesn’t bear that out.

Read the rest of this entry »

Comments (22)