The role of long-distance communication in human history

If one has a knee-jerk reaction to attribute all distant cultural resemblances to chance coincidence (independent invention), that would be to make a mockery of human mobility and adaptability.  It would be as if people never deigned or had the opportunity to borrow something from another group.

I can give hundreds of long distance cultural correspondences that could not possibly have been due to chance coincidence — so complicated, intricate, and exact are they, especially when accompanied by textual, artistic, and other types of evidence, much of it hard / material.  Moreover, we often have the bodies and the goods and the words — at transitional stages and times — to go along with the transmission.  For some examples, see the "Selected readings" below.  Many more could be adduced.

Read the rest of this entry »

Comments (14)


GLM-130B: An Open Bilingual Pre-Trained Model

Description of a General Language Model (GLM; also GLaM) project based at Tsinghua University in Beijing, but with users and collaborators around the world.

Homepage (August 4, 2022)

This prospectus is difficult for outsiders to understand because of the large number of unexplained acronyms, abbreviations, initialisms, etc. and other such participants' terminology.

GLM-130B is an open bilingual (English & Chinese) bidirectional dense model with 130 billion parameters, pre-trained using the General Language Model (GLM) algorithm1. It is designed to support inference tasks with the 130B parameters on a single A100 (40G * 8) or V100 (32G * 8) server. As of July 3rd, 2022, GLM-130B has been trained on over 400 billion text tokens (200B each for Chinese and English) and exhibits the following unique features:

    • Bilingual: supports both English and Chinese.
    • Performance (EN): better than GPT-3 175B (+5.0%), OPT-175B (+6.5%), and BLOOM-176B (+13.0%) on LAMBADA and slightly better than GPT-3 175B (+0.9%) on MMLU.
    • Performance (CN): significantly better than ERNIE TITAN 3.0 260B on 7 zero-shot CLUE datasets (+24.26%) and 5 zero-shot FewCLUE datasets (+12.75%).
    • Fast Inference: supports fast inference on both SAT and FasterTransformer (up to 2.5X faster) with a single A100 server.
    • Reproducibility: all results (>30 tasks) can be easily reproduced with open-sourced code and model checkpoints.
    • Cross-Platform: supports training and inference on NVIDIA, Hygon DCU, Ascend 910, and Sunway.

The model checkpoints of GLM-130B and code for inference are publicly available at our GitHub repo. The code for pre-training and fine-tuning as well as the research paper are coming soon.

Read the rest of this entry »

Comments off


Battle for Taiwanese

Comments (15)


Whose New Year is it anyway?

The struggle for cultural priority, supremacy, and naming between China and Korea is perennial:  fishing nets, printing with metal movable type, kimchi….  Now it's over the lunar new year that is currently being celebrated.

"NewJeans' Danielle apologizes for calling the 'Lunar New Year' 'Chinese New Year'"

Yaki-Jones, allkpop (1/21/23)

 

"Chinese netizens terrorize the Instagrams of Korean celebrities who gave lunar new year greetings, including IVE's Wonyoung and CL"

Yaki-Jones, allkpop (1/22/23)

Might be better to avoid the orthological controversy altogether and just refer to it as the Lunar New Year.

Read the rest of this entry »

Comments (13)


The difficulty of expressing "nothing"

This is a clever attempt to write a spring couplet (chūnlián 春聯), not in the usual Sinoglyphs / Chinese characters, but in pictographs:


(source)

Read the rest of this entry »

Comments (16)


The end of dialect fieldwork

The title and following paragraph of this post are from a genuine, serious, highly experienced, and well-published dialectologist who wishes to remain anonymous (he is quoting from a communication by the Practicum Education Department of USC's Suzanne-Dworak-Peck School of Social Work at the University of Southern California):

As we enter 2023, we would like to share a change we are making at the Suzanne-Dworak-Peck School of Social Work to ensure our use of inclusive language and practice.  Specifically, we have decided to remove the term “field” from our curriculum and practice and replace it with “practicum.” This change supports anti-racist social work practice by replacing language that could be considered anti-Black or anti-immigrant in favor of inclusive language. Language can be powerful, and phrases such as “going into the field” or “field work” may have connotations for descendants of slavery and immigrant workers that are not benign.

Read the rest of this entry »

Comments (73)


Happy, joyous, auspicious, glorious, fecund… lunar Year of the Rabbit

Received these greetings from one of my dear PRC M.A. students this morning:

yuàn xīnnián, shèng jiùnián, huānyú qiě shèngyì, wànshì jǐn kěqī

yuàn píng'ān xǐlè, zhūshì shùnsuì. xīnnián kuàilè, tùnián jíxiáng, héjiā huānlè, píng'ān xìngfú. zhù Méi jiàoshòu xīnnián kuàilè 🎊

願新年,勝舊年,歡愉且勝意,萬事盡可期
願平安喜樂,諸事順遂。新年快樂,兔年吉祥,闔家歡樂,平安幸福。祝梅教授新年快樂🎊

Google Translate does a pretty good job of it with this:

May the new year be better than the old year, happy and successful, and everything can be expected

May you be safe and happy, and everything goes well. Happy New Year, auspicious Year of the Rabbit, happy family, peace and happiness. Happy New Year to Professor Mei 🎊

Read the rest of this entry »

Comments (3)


Why we all need subtitles now

[10:42 video — entertaining and informative]

Read the rest of this entry »

Comments (30)


On the difficulty of saying what a word is

Sophie MacDonald asks:

I have been an on-and-off reader of Language Log for several years, and have always enjoyed your contributions, though I’m not a linguist. I do work on formal language theory sometimes, but very much within mathematics and computer science, not linguistics.

Recently, a music theorist colleague asked me for help with a question. She is engaging with the body of literature that applies linguistic ideas and methods to the study of music, and she is in particular working with the idea that it is hard to give a definition of a chord or a melodic phrase that actually makes sense within musical practice. She was asking for linguistic sources indicating the difficulty of saying what a word is, which might be useful for the point she is making.

Read the rest of this entry »

Comments (16)


Liuzhou Snail Rice Noodles


Liuzhou Snail Rice Noodles from China. (Facebook, Li Chong-lim photo)

The photograph is from this article:

China’s ‘propaganda noodle soup’ ordered off the market in Taiwan

Noodle packaging has ‘You are Chinese, and I am too’ emblazoned across it

By Huang Tzu-ti, Taiwan News (1/17/23)

Read the rest of this entry »

Comments (5)


Xu Wenkan (1943-2023)


(source)

Xú Wénkān 徐文堪, who was born in October of 1943 in the metropolis of Shanghai, died in the same city on January 4, 2023 of complications arising from the novel coronavirus.  He was one of the leading lexicographers compiling and editing the Hànyǔ Dà Cídiǎn 漢語大詞典 (Unabridged Dictionary of Sinitic), a comprehensive work of 23,000 head characters, 370,000 words, and 1,500,000 citations in 12 large volumes plus index, in the editing of which Wenkan played a key role from its beginning in 1977 to its completion in 1994.

Read the rest of this entry »

Comments (1)


Special womem

Comments (11)


Polynesian sweet potatoes and jungle chickens: verbal vectors

To this post made more than half a month ago, "The invention, development, and decipherment of writing" (12/30/22), after a couple of important comments on other subjects (Phrygian inscriptions; Egyptian, Mesopotamian, Chinese stimulus diffusion of writing), R. Fenwick made the following vital remarks, and just in the nick of time before comments closed:

@David Marjanović:
Heyerdahl obviously demonstrated that such a voyage wasn't physically impossible. There's just no reason to assume it actually happened.

Except that it actually did happen. It's true that Heyerdahl's specific settlement model has subsequently not held water – we know now that Polynesia was settled rather from the western Pacific and ultimately Taiwan – but at least one successful return voyage in the other direction was almost certainly achieved. And we do have at least one very powerful reason to conclude that it was: sweet potatoes. They've been widely cultivated by eastern Polynesian peoples since well before European incursion into the Pacific (the earliest ¹⁴C dates we have on sweet potato remains are from c. 1210—1400 AD, on the Cook Islands), but sweet potatoes are native to South and Central America, and early Polynesian seafarers most likely took on sweet potato cultivation as a result of direct trade with the Inca. There's even a singular but stark linguistic footprint of the interaction, as Proto-Eastern Polynesian *kumara "sweet potato" (cp. Māori, Rarotongan kūmara, Rapanui, Tuamotuan kumara, Marquesan kūma‘a, Hawaiian ‘uwala) is virtually identical to terms for sweet potato in the Quechuan languages (e.g. Cuzco khumara, Ayacucho kumar, Northern Pastaza kumal, Colonial Chincha cumar).

Other evidence for Polynesians visiting South America is unfortunately very thin, but I've read reports from a couple of archaeological excavations – in Chile, I think – where small quantities of avian bone have been recovered that are consistent with domestic chickens, a south-eastern Asian domesticate that formed a key part of Polynesian diets. I'll have to look into the literature a little further and see if the situation on the ground has improved regarding pre-Columbian chickens. There's also a recent study that also claimed a South American genetic component in portions of the Polynesian human population, but I haven't read the article yet.

Read the rest of this entry »

Comments (57)