Archive for Topolects

Test for dialect relatedness: especially for Northeast topolect groupies

Several of my PRC M.A. students have told me that the following tool for the computation of dialect closeness has become quite popular in China:

fāngyán yīnxì xiāngsì dù cèshì 方言音系相似度測試 ("Dialect phonological similarity test"),V3.2.358

(source)    

Read the rest of this entry »

Comments (28)

Northeastern topolect expressions, part 2

Following up on Diana Shuheng Zhang's notes on forty Northeasternisms (11/12/25), Yizhi Geng gives us another helping.  While Diana's collection is based mainly on Dalian city, Yizhi's comes from Changchun.

"mǎ húlu 马葫芦": "manhole" (lit., "horse gourd / calabash / cucurbit"), where "mǎ húlu gài 马葫芦盖" refers to "manhole-cover". According to older generations, this word came from Japanese, "manhōru マンホール", which was created during Japanese occupation. It seems to be interesting how this word came from English, to Japanese, and finally to Northeastern topolect dōngběi huà 东北话 we used in Changchun. 

"dà huí / xiǎo huí 大回 / 小回": "turn left / turn right" (lit., "big retreat / small retreat". It is said to also come from Japanese, but I cannot relate it to any Japanese expression I know. 

Read the rest of this entry »

Comments (2)

Northeastern topolect expressions

All places in China have topolect terms, some more than others, and some are more influential outside of their own region than others.  One regional variety whose speakers create numerous memorable expressions they are proud of is Dōngběihuà 東北話 ("Northeastern topolect").  I was inspired to make this post after reading a collection of twenty Northeasternisms.

I showed the collection to Diana Shuheng Zhang, who is an authentic Northeasterner.  Diana not only translated and explained the entire collection, she added twenty more, for a total of forty, commenting, "Can't stop laughing. Hope everybody enjoys our native expressions. :)" 

Please note that I (VHM) have added all the pinyin romanizations and a few literal translations).  Because some of the characters are unusual and I'm not a Northeastern speaker, I cannot guarantee the accuracy, especially down to the tones (and their sandhi), of all the transcriptions I have supplied.  Pay attention to Diana's valuable phonological notes.

Read the rest of this entry »

Comments (11)

Abstand und ausbau

Back in early April of this year, Kirinputra brought up this distinction at the end of a comment thread on Cantonese, but it came at the conclusion  of the thread, so — though it deserved discussion — there was no opportunity to hold one at that time.  Consequently, I reopen the deliberations now by quoting Kirinputra's final comment:

"Sinitic is like Romance" is not a working, truth-bearing analogy, esp. not for a layman audience. (Maybe some subset of Sinitic is like Romance, though.) "Sinitic" arguably harbors much more abstand diversity than Romance, for one thing. More importantly, Romance is by now evidence-based. Humans have a detailed understanding of the mechanics & timing of the divergence from a common ancestor. Sinitic is belief-based. The approach to the detailed reality is largely speculative & often circular.

Read the rest of this entry »

Comments (65)

L-complex

From Peter Daniels:

Do the 7 or 8 (or whatever) “dialects” of Sinitic constitute what Hockett called an “L-complex,” like Romance, such that you could traverse the entire domain and never encounter neighboring villages that didn’t understand each other, with cultural centers where the language described in the regional grammar book and dictionary is spoken, or are they distinct languages as far back as one can look?

Read the rest of this entry »

Comments (14)

Fundamental Sinitic linguistic issues solved through analysis of Chinese rap

Julesy just keeps getting better and better:

Read the rest of this entry »

Comments (8)

Boat people

"The endangered Tanka language in Hong Kong: phonological variations and lexical convergence with Cantonese", Cong Wang, Daxingwang Peng, Yanmei Dai & Chong Qi, Humanities and Social Sciences Communications volume 12, Article number: 1133 (July 19, 2025)

The first thing we need to take care of is to discuss their name:

According to official Liu Zongyuan (773–819) of the Tang dynasty, there were Boat Dweller people settled in the boats of today's Guangdong Province and Guangxi Zhuang Autonomous Region.

The term "Tanka" (蜑家) may originate from tan (Cantonese: "egg") and ka (Cantonese: "family" or "people"), although another possible etymology is tank ("junk" or "large boat") rather than tan. "Tanka" is now considered derogatory and no longer in common usage. The Boat Dwellers are now referred to in China as "people on/above water" (Chinese: 水上人; pinyin: shuǐshàng rén; Cantonese Yale: Séuiseuhngyàn), or "people of the southern sea" (Chinese: 南海人; Cantonese Yale: Nàamhóiyàn). No standardised English translation of this term exists. "Boat People" is a commonly used translation, although it may be confused with the similar term for Vietnamese refugees in Hong Kong. "Boat Dwellers" was proposed by Dr. Lee Ho Yin of The University of Hong Kong in 1999, and it has been adopted by the Hong Kong Museum of History for its exhibition.

Both the Boat Dwellers and the Cantonese speak Cantonese. However, Boat Dwellers living in Fujian speak Min Chinese.

(Wikipedia)

Read the rest of this entry »

Comments (14)

Topolect in the big city

The title of this song attracted my attention:  "Fāngyán de ànshāng 方言的黯伤" ("The sadness of topolect"). 

I listened to it here, but couldn't catch everything that the singer was saying.  I asked Zhaofei Chen what she heard, and here's what she gleaned from listening to the recording:

Read the rest of this entry »

Comments (5)

Taiwanese Twosome: tea and Sino-Korean

Even if you can't understand spoken Taiwanese, you can learn a lot from these two videos because of the excellent visuals, plus it is nice just to hear the clearly spoken Taigi and compare terms in Taigi with their parallels in Sino-Korean.

The first is a video from Taiwan's public TV (公視台語台) on the interesting distribution of the names of tea in the world:

Read the rest of this entry »

Comments (11)

Dungan radio broadcasts from 2018-2021

We've talked about Dungan a lot on Language Log.  That's the northwest Sinitic topolect written in Cyrillic that has been transplanted to Central Asia.  See "Selected readings" below.

For those of you who are interested and would like to hear what it sounds like in real life — spoken and sung by male and female voices — we are fortunate to have a series of ten radio broadcast recordings (here).

Note the natural, easy, undistorted insertion of non-Sinitic borrowings, e.g., "Salam alaikum" (Arabic as-salāmu ʿalaykum  السَّلَامُ عَلَيْكُمْ ["Peace be upon you"]).  That would not be possible in sinographic transcription of northwest Sinitic speech.  This and other aspects and implications of alphabetic Dungan have been extensively discussed on LL.

Read the rest of this entry »

Comments (9)

Battle for Taiwanese, part 2

IA sent me this article (in Chinese) about a new translation of George Orwell's 1984.  It begins:

Yīngguó zuòjiā Qiáozhì Ōuwēiěr de míngzhù `1984' chūbǎn yuē 75 nián, jìnrì yíng lái shǒubù Táiwén bǎn. Yìzhě Zhōu Yíngchéng shuō, zhè shì tuīdòng `Táiyǔ zhèngchánghuà'de chángshì, ràng Táiyǔ mǔyǔzhě bùbì tòuguò Zhōngwén yìběn, yě néng jiēchù shìjiè jīngdiǎn wénxué

英國作家喬治‧歐威爾的名著「1984」出版約75年,近日迎來首部台文版。譯者周盈成說,這是推動「台語正常化」的嘗試,讓台語母語者不必透過中文譯本,也能接觸世界經典文學。

1984, a famous novel by British writer George Orwell, was published about 75 years ago and recently had its first Taiwanese version. Translator Zhou Yingcheng said that this is an attempt to promote the "normalization of Taiwanese" so that native Taiwanese speakers can access world classic literature without having to rely on Chinese translations.

IA points out that, as in the following quotation from the translator, "Zhōngwén 中文" (lit. "Chinese writing"), refers not only to written language but spoken as well:

Tā shuō:`Dāngshí zài guó wài jiǎng zhōngwén, chángcháng bèi dàng zuò zhōngguó rén, yúshì wǒ kāishǐ sīkǎo zìjǐ gēn táiwān de liánjié shì shénme, dé chū de jiélùn shì tái yǔ. Dàn wǒ tái yǔ bùgòu hǎo, yǒu shí wǒmen xiǎng jiǎng qiāoqiāohuà,(jiǎng zhōngwén) pà biérén tīng dǒng, jiù huì qiēhuàn chéng tái yǔ, dàn yòu méi bànfǎ wánzhěngde shuō

他說:「當時在國外講中文,常常被當作中國人,於是我開始思考自己跟台灣的連結是什麼,得出的結論是台語。但我台語不夠好,有時我們想講悄悄話,(講中文)怕別人聽懂,就會切換成台語,但又沒辦法完整地說」。

He said: "When I was speaking Chinese abroad, I was often mistaken for Chinese, so I began to think about what my connection with Taiwan was, and I concluded it was Taiwanese. But my Taiwanese is not good enough. Sometimes when we want to whisper, we are afraid that others will understand (what we are saying in Chinese), so we switch to Taiwanese, but we can't speak it completely."

Read the rest of this entry »

Comments (21)

Cantonese as old and pure: a critique

[This is a guest post by Robert S. Bauer in response to the video and paper featured in this recent Language Log post:  "Cantonese is both very cool and very old" (4/1/25)]

After I read the paper the first word that came to mind was “Cringeworthy” in regard to the author’s phrase “purer descent”; and the second word was “Superficial” in regard to the author’s knowledge of Cantonese and Chinese linguistics. For instance, the author who has narrowly focused on just those items that support his claims doesn’t seem to know that the Ancient Chinese tone category of Rusheng/Entering Tone which has disappeared from Mandarin was not a particular tone contour; the distinctive feature of Rusheng was that the monomorphosyllables belonging to it had as their finals or endings the three stop consonants -p, -t , -k, all of which have been retained in Cantonese, as well as in various other Chinese topolects of South China.

Read the rest of this entry »

Comments (13)

Cantonese is both very cool and very old

Comments (11)