A (troop / troupe of) dragon(s) tromping / flying
This is the theme of the forthcoming CCTV Spring Festival Gala to ring in the new year of 2024:

(source)
Read the rest of this entry »
This is the theme of the forthcoming CCTV Spring Festival Gala to ring in the new year of 2024:

(source)
Read the rest of this entry »
By chance, while I was looking for something else about a supposed Russian mistranslation, I came upon this famous example:
“We Will Bury You” — How A Mistranslation Almost Started WW3
And the story of the man behind those fateful words
A Renaissance Writer
Exploring History
Medium (Jul 14, 2020)
Although this happened nearly seven decades ago, I still remember the electrifying impact Khrushchev's words had on the world. Furthermore, from time to time during the interim between then and now, I heard echoes of this sensational, ominous warning on the part of the Soviet leader, but sometimes also allegations that it was the result of a mistranslation.
Since I write for Language Log and am hopefully in a position — with the help of Language Log readers — to set the record straight (or at least straighter than it was before), I thought that I had better read the Medium article carefully and seek additional confirmatory and contradictory evidence.
Read the rest of this entry »
The bio from a recent talk announcement described the speaker as a "Production Engineer …, a job which, for the most part, means he is a professional toil tackler."
That's a striking phrase, and one that was new to me. I soon discovered that it's new to Google as well, though the search turned up the source of its constituent words in Chapter 6, "Eliminating Toil", from a Workbook associated with Google's Site Reliability Engineering (=SRE) page.
Read the rest of this entry »
Choices for the 2023 Word Of The Year are starting to come out —
Read the rest of this entry »
Nasr et al., "Scalable Extraction of Training Data from (Production) Language Models", arXiv.org 11/28/2023:
This paper studies extractable memorization: training data that an adversary can efficiently extract by querying a machine learning model without prior knowledge of the training dataset. We show an adversary can extract gigabytes of training data from open-source language models like Pythia or GPT-Neo, semi-open models like LLaMA or Falcon, and closed models like ChatGPT. Existing techniques from the literature suffice to attack unaligned models; in order to attack the aligned ChatGPT, we develop a new divergence attack that causes the model to diverge from its chatbot-style generations and emit training data at a rate 150x higher than when behaving properly. Our methods show practical attacks can recover far more data than previously thought, and reveal that current alignment techniques do not eliminate memorization.
Read the rest of this entry »
Permalink Comments off
Recent buzz over "Q*" started with stories about 10 days ago. A recent Wired article explains:
Last week, after briefly deposed CEO Sam Altman was reinstalled at OpenAI, two reports claimed that a top-secret project at the company had rattled some researchers there with its potential to solve intractable problems in a powerful new way.
“Given vast computing resources, the new model was able to solve certain mathematical problems,” Reuters reported, citing a single unnamed source. “Though only performing math on the level of grade-school students, acing such tests made researchers very optimistic about Q*’s future success.” The Information said that Q* was seen as a breakthrough that would lead to “far more powerful artificial intelligence models,” adding that “the pace of development alarmed some researchers focused on AI safety,” citing a single unnamed source.
Read the rest of this entry »
Japanese words of the year are always exciting and surprising, but this year's takes the cake.
are あれ
pronunciation
distal demonstrative, something far off removed from both speaker and listener: that, yon
Read the rest of this entry »
[Here's the conclusion to the hoped for trifecta on things Indian — see the preface here. It comes in the form of a guest post by Arun Prasad]
Read the rest of this entry »
The Chinese Football Association used dǎngguó 党国 ("party state" — nettlesome term to be explained fully below) in a powerpoint on its plans for '24. Awkward political illiteracy!
Here's a screenshot.
Read the rest of this entry »
Long have we pondered the overwhelming dominance by individuals of Indian heritage over the spelling bees. Do they have some sort of mysterious power or secret for memorizing hundreds of thousands of obscure words?
Now we have an answer from one of the masters himself, Dev Shah, a ninth-grader living in Largo, Florida, who won the Scripps National Spelling Bee in June of this year.
Opinion
I won the National Spelling Bee.
This is what it takes to master spelling.
By Dev Shah, WSJ
——————-
The annual Scripps National Spelling Bee is an incredible event. Each year, some 11 million students from across the country take part in the spelling bee circuit, all vying for the championship title. After competing in rigorous local bees, about 200 spellers make it to the national stage, and a handful of them qualify for the grand finals. Of course, only one can be crowned the National Spelling Bee champion. This year that student was me.
Read the rest of this entry »
[This is the first of two consecutive posts on things Indian. After reading them, if someone is prompted to send me material for a third, I'll be happy to make it a trifecta.]
Our entry point to the linguistically compelling topic of today's post is this Nikkei Asia (11/29/23) article by Barkha Shah in its "Tea Leaves" section:
Why it's worth learning ancient Sanskrit in the modern world:
India’s classical language is making a comeback via Telegram and YouTube
The author begins with a brief introduction to the language:
The language had its heyday in ancient India. The Vedas, a collection of poems and hymns, were written in Sanskrit between 1500 and 1200 B.C., along with other literary texts now known as the Upanishads, Granths and Vedangas. But while Sanskrit became the foundation for many (though not all) modern Indian languages, including Hindi, it faded away as a living tongue.
Read the rest of this entry »
In addition to a proto-regular-expression for English monosyllables, Benjamin Lee Whorf's 12/1940 Technology Review article has a weird diagram showing how a linguist (?) would organize French language instruction along the lines of mid-20th-century factory work:

Read the rest of this entry »