Archive for Topolects

L-complex

From Peter Daniels:

Do the 7 or 8 (or whatever) “dialects” of Sinitic constitute what Hockett called an “L-complex,” like Romance, such that you could traverse the entire domain and never encounter neighboring villages that didn’t understand each other, with cultural centers where the language described in the regional grammar book and dictionary is spoken, or are they distinct languages as far back as one can look?

Read the rest of this entry »

Comments (14)

Fundamental Sinitic linguistic issues solved through analysis of Chinese rap

Julesy just keeps getting better and better:

Read the rest of this entry »

Comments (8)

Boat people

"The endangered Tanka language in Hong Kong: phonological variations and lexical convergence with Cantonese", Cong Wang, Daxingwang Peng, Yanmei Dai & Chong Qi, Humanities and Social Sciences Communications volume 12, Article number: 1133 (July 19, 2025)

The first thing we need to take care of is to discuss their name:

According to official Liu Zongyuan (773–819) of the Tang dynasty, there were Boat Dweller people settled in the boats of today's Guangdong Province and Guangxi Zhuang Autonomous Region.

The term "Tanka" (蜑家) may originate from tan (Cantonese: "egg") and ka (Cantonese: "family" or "people"), although another possible etymology is tank ("junk" or "large boat") rather than tan. "Tanka" is now considered derogatory and no longer in common usage. The Boat Dwellers are now referred to in China as "people on/above water" (Chinese: 水上人; pinyin: shuǐshàng rén; Cantonese Yale: Séuiseuhngyàn), or "people of the southern sea" (Chinese: 南海人; Cantonese Yale: Nàamhóiyàn). No standardised English translation of this term exists. "Boat People" is a commonly used translation, although it may be confused with the similar term for Vietnamese refugees in Hong Kong. "Boat Dwellers" was proposed by Dr. Lee Ho Yin of The University of Hong Kong in 1999, and it has been adopted by the Hong Kong Museum of History for its exhibition.

Both the Boat Dwellers and the Cantonese speak Cantonese. However, Boat Dwellers living in Fujian speak Min Chinese.

(Wikipedia)

Read the rest of this entry »

Comments (14)

Topolect in the big city

The title of this song attracted my attention:  "Fāngyán de ànshāng 方言的黯伤" ("The sadness of topolect"). 

I listened to it here, but couldn't catch everything that the singer was saying.  I asked Zhaofei Chen what she heard, and here's what she gleaned from listening to the recording:

Read the rest of this entry »

Comments (5)

Taiwanese Twosome: tea and Sino-Korean

Even if you can't understand spoken Taiwanese, you can learn a lot from these two videos because of the excellent visuals, plus it is nice just to hear the clearly spoken Taigi and compare terms in Taigi with their parallels in Sino-Korean.

The first is a video from Taiwan's public TV (公視台語台) on the interesting distribution of the names of tea in the world:

Read the rest of this entry »

Comments (11)

Dungan radio broadcasts from 2018-2021

We've talked about Dungan a lot on Language Log.  That's the northwest Sinitic topolect written in Cyrillic that has been transplanted to Central Asia.  See "Selected readings" below.

For those of you who are interested and would like to hear what it sounds like in real life — spoken and sung by male and female voices — we are fortunate to have a series of ten radio broadcast recordings (here).

Note the natural, easy, undistorted insertion of non-Sinitic borrowings, e.g., "Salam alaikum" (Arabic as-salāmu ʿalaykum  السَّلَامُ عَلَيْكُمْ ("Peace be upon you").  That would not be possible in sinographic transcription of northwest Sinitic speech.  This and other aspects and implications of alphabetic Dungan have been extensively discussed on LL.

Read the rest of this entry »

Comments (9)

Battle for Taiwanese, part 2

IA sent me this article (in Chinese) about a new translation of George Orwell's 1984.  It begins:

Yīngguó zuòjiā Qiáozhì Ōuwēiěr de míngzhù `1984' chūbǎn yuē 75 nián, jìnrì yíng lái shǒubù Táiwén bǎn. Yìzhě Zhōu Yíngchéng shuō, zhè shì tuīdòng `Táiyǔ zhèngchánghuà'de chángshì, ràng Táiyǔ mǔyǔzhě bùbì tòuguò Zhōngwén yìběn, yě néng jiēchù shìjiè jīngdiǎn wénxué

英國作家喬治‧歐威爾的名著「1984」出版約75年,近日迎來首部台文版。譯者周盈成說,這是推動「台語正常化」的嘗試,讓台語母語者不必透過中文譯本,也能接觸世界經典文學。

1984, a famous novel by British writer George Orwell, was published about 75 years ago and recently had its first Taiwanese version. Translator Zhou Yingcheng said that this is an attempt to promote the "normalization of Taiwanese" so that native Taiwanese speakers can access world classic literature without having to rely on Chinese translations.

IA points out that, as in the following quotation from the translator, "Zhōngwén 中文" (lit. "Chinese writing"), refers not only to written language but spoken as well:

Tā shuō:`Dāngshí zài guó wài jiǎng zhōngwén, chángcháng bèi dàng zuò zhōngguó rén, yúshì wǒ kāishǐ sīkǎo zìjǐ gēn táiwān de liánjié shì shénme, dé chū de jiélùn shì tái yǔ. Dàn wǒ tái yǔ bùgòu hǎo, yǒu shí wǒmen xiǎng jiǎng qiāoqiāohuà,(jiǎng zhōngwén) pà biérén tīng dǒng, jiù huì qiēhuàn chéng tái yǔ, dàn yòu méi bànfǎ wánzhěngde shuō

他說:「當時在國外講中文,常常被當作中國人,於是我開始思考自己跟台灣的連結是什麼,得出的結論是台語。但我台語不夠好,有時我們想講悄悄話,(講中文)怕別人聽懂,就會切換成台語,但又沒辦法完整地說」。

He said: "When I was speaking Chinese abroad, I was often mistaken for Chinese, so I began to think about what my connection with Taiwan was, and I concluded it was Taiwanese. But my Taiwanese is not good enough. Sometimes when we want to whisper, we are afraid that others will understand (what we are saying in Chinese), so we switch to Taiwanese, but we can't speak it completely."

Read the rest of this entry »

Comments (21)

Cantonese as old and pure: a critique

[This is a guest post by Robert S. Bauer in response to the video and paper featured in this recent Language Log post:  "Cantonese is both very cool and very old" (4/1/25)]

After I read the paper the first word that came to mind was “Cringeworthy” in regard to the author’s phrase “purer descent”; and the second word was “Superficial” in regard to the author’s knowledge of Cantonese and Chinese linguistics. For instance, the author who has narrowly focused on just those items that support his claims doesn’t seem to know that the Ancient Chinese tone category of Rusheng/Entering Tone which has disappeared from Mandarin was not a particular tone contour; the distinctive feature of Rusheng was that the monomorphosyllables belonging to it had as their finals or endings the three stop consonants -p, -t , -k, all of which have been retained in Cantonese, as well as in various other Chinese topolects of South China.

Read the rest of this entry »

Comments (13)

Cantonese is both very cool and very old

Comments (11)

American diplomat in Hong Kong reciting a Tang poem in Cantonese

Comments (9)

Suzhou rap sounds like it has a French accent

From Chas Belov:  

Google Translate says that this song is in Suzhou topolect (it actually says "dialect" but thanks to you I know better). But I had to recognize a few words before I could convince myself it wasn't in French (which I also don't know). Later in the song it sounds more Chinese, but the rapper never really loses that French sound. Am I imagining things?

【苏州方言RAP】红中 Zyh 《三十三》PROD BY XVIBE

LISTEN HERE

Read the rest of this entry »

Comments (28)

Hiberno-English: it's a soft day

Spending some time in Ireland, I hear people saying "It's a soft day" or "It's a soft day, thank God!".  Not knowing what that expression implies, I do a search and find that "A soft day is what the Irish call a very very damp fog or a mizzle, which is a cross between a mist and a drizzle." (source)  Mizzle is also the color of a shade of paint. (source)

"Soft day" is a phrase derived from Irish lá bog (lit.) ("overcast day; light drizzle/mist").

That reaction to a moist, overcast day tells you something about the Irish mindset and helps you understand Irish sentiment and humor.

Read the rest of this entry »

Comments (13)

The Power of Naming

[This is a guest post by Conal Boyce]

Overview: Here we look at some technical terms and how they’ve fared since their release to, or adoption by, the public: information theory; (TW) the colored quarks of Nambu and Han; cosmic‑ray decay according to Millikan; the Sinitic languages (Mair) vs. ‘the Chinese language’ (misnomer); Wu’s cosmic chirality as the violation of a nonNoetherian principle.

① information theory is the mother of all factoids. Why would one call it that? Because there is no such thing, only the following phantom utterance that is ubiquitous: “Shannon’s information theory.” In 1948, Shannon wrote a paper on the mathematics of data‑communication technology, and named it accordingly. Put off by its name, science journalists introduced it to the world as “information theory.” The name stuck, suggesting in the minds of innocents something so deep and epochal that it might even shed light on Mozart. Shannon 1948 is the big example of how of data and information have been confounded for 3/4 of a century, but it is accompanied by innumerable smaller cases, as when Susskind argues that “in physics we treat them as pretty much the same thing” (paraphrase; details in Appendix A). Here is a rough‑and‑ready demonstration of how different they actually are: “Go.” ←That’s just data, but place it in a context, and a layer of information now “rides on it” (or floats above it, on a different plane) such that this is conveyed: “Go to the store now before it closes”; or this: “Fly now to Hiroshima and drop the bomb.” True, in shop‑talk and hallway conversations, a database developer or data‑comm engineer might toss the terms data and information around as if one believed them to be interchangeable. Then, overheard by someone in the world at large, such casual usage is easily misconstrued, leading astrophysicists to fret in public over the “information” that might be “lost” in a black hole. (As for an actual Theory of Information, we must wait for a superintelligent computer to produce it since that task is far beyond human ability. And once coughed up, it will be so lengthy as to require several lifetimes to read it, and in any case, largely incomprehensible to us.)

Read the rest of this entry »

Comments (33)