Archive for Writing systems

4-digit numbers versus 5-digit numbers

Phil H wrote these comments to "Uncommon words of anguish" (7/18/21):

The anguish is very real. My wife had a character in her name that most computers will not reproduce ([石羡]), despite it being relatively common in names in our part of the world, and has been refused bank accounts, credit cards, and a mortgage because of it. In the end she changed her name rather than continue to deal with the hassle. The character is in the standard, but it was too late for us.

…there have always been ways to get the character onto a computer, but any given piece of bank software might not recognise it, and any given bank functionary might be unfamiliar with them. We then had trouble when some organisations used the pinyin XIAN in place of the character, but that then made their documentation inconsistent with her national ID card (which had the right character on it) and so yet further bodies would not accept them… It was the standard "mild computer snafu + large inflexible bureaucracy = major headache" equation.

An anonymous correspondent, a computer scientist, sent in the following remarks:

Phil H is talking about a character which is in a "supplementary plane" in Unicode (and similarly in GB-18030).  Unfortunately, an awful lot of software was only ever tested on Basic Multilingual Plane characters.

Read the rest of this entry »

Comments (16)

Absence of metaphysics

[The following is a guest post by Zihan Guo in response to this article by Bruno Maçães, a Portuguese politician and political philosopher, who asserts, among other things, that China lacks metaphysics because of the nature of its language (i.e., script):  "The Black Box:  A Theory of China", World Game on Substack (12/25/20) — excerpts below.]

I can hear Zhuangzi chuckling.

For me, the charm of a Chinese word is its ability to conjure up images beyond its denotational meaning. The word shānshuǐ 山水 ("landscape") does come from shān 山 ("mountain") and shuǐ 水 "water"), but it connotes much more than that. What immediately comes to my mind is Chinese landscape painting, where an infinitesimal figure (a being) plods along the trails leading up to an insurmountable cliff. To my amateurish eyes, those painters are not just depicting empirical reality. There is meaning behind the surface representation.

Asking Zhuangzi "what is red?", Zhuangzi might also ask "what is not red, what is redder, what is less red, why does it matter?" The Chinese answer with a collage of red objects can go through a similar philosophical rumination just as the scientific Western definition. If "red" can be represented by many different things, its concept becomes unstable and ambiguous. Cherries are red, but roses might be redder, so which is the "true" red? One sees that redness as phenomenal keeps changing and that one's own perception of reality can be deceptive. I believe Zhuangzi probes into similar questions as well as the eventual question of whether there is something unchanging beyond all phenomena.

I think the author is more interested in modern Chinese politics than the idea that Chinese script is purely physical. His ending note about Xi feels rather sarcastic.

Read the rest of this entry »

Comments (6)

Emoji Heart Sutra

From the Library of Congress International Collections FB page (Saturday 7/17/21):

Read the rest of this entry »

Comments (14)

Uncommon words of anguish

From a manual for a thermal printer:

Dǎyìn kòngzhì bǎn nèizhì GB18030 Zhōngwén zìkù, chèdǐ miǎnchú shēngpì zì de kǔnǎo

打印控制板内置 GB18030 中文字库,彻底免除生僻字的苦恼

Printer control panel built-in GB18030 Chinese character, thoroughly remove the uncommon words of anguish

(courtesy of Amy de Buitléir)

A more accurate English translation would be:

Printer control panel with built-in GB18030 Chinese character font, thoroughly removing the anguish brought about by uncommon / obscure characters

"GB" stands for "guóbiāo 国标" ("national standard"), and is used for many technical terms in the PRC (another instance of encroaching digraphia, for which see here and here [with extensive bibliography]).

Read the rest of this entry »

Comments (14)

Nonbinary third person pronoun in written Mandarin

Comments (17)

Faux Manchu: Ornamental Manchu II

[This is a guest post by Jichang Lulu]

In “Ornamental Manchu: the lengths to which a forger will go” (LL, April 24), Professor Mair discussed a handscroll with faux-Manchu inscriptions. Although the writing clearly imitated Manchu, the imitation was so liberal and the forger so unfamiliar with the Manchu script that hardly any word was intelligible even to eminent Manjurists consulted for the post.

As a non-Manjurist, I found the text only more puzzling, but was able to identify its model by comparing a a conjectural reading of a non-recurring word in it to a published text of a Manchu translation of the Heart Sutra (Fuchs, Die mandjurischen Druckausgaben des Hsin-ching (Hṛdayasūtra) (non legi), transcribed in Hurvitz, “Two polyglot recensions of the Heart Scripture”, J Indian Philos 3:1/2 (1975)). That guess I shared in a comment embedded in the post, elaborated under it with the likely source text. That presumably settled the question, but, with the source given in transliteration only, didn’t make it any easier to appreciate the hilarious cavalierness of the copy without an ability to mentally untransliterate it back into the Manchu script.

Professor Kicengge has now compared the text to a Manchu-script rendition of the sutra and composed an image that juxtaposes the copy to its model. The juxtaposition verifies the identification of the source text: not only does the text (very roughly) match, so does its division into columns.


The handscroll’s faux Manchu and its model, juxtaposed. Supplied by Kicengge.

Read the rest of this entry »

Comments (1)

African (il)literacy

The following article is so revelatory, at least for me, that I wish I could copy it entirely.  Since that's not what we do at Language Log, I will just quote the opening portion (probably less than a quarter of the total essay), while pointing to a few additional highlights, and encourage others who are interested to read the whole piece (4,700 words):

"Africa writes back:  European ideas of African illiteracy are persistent, prejudiced and, as the story of Libyc script shows, entirely wrong", Aeon (6/17/21), by D. Vance Smith, edited by Sam Dresser

Four different writing systems have been used in Algeria. Three are well known – Phoenician, Latin and Arabic – while one is both indigenous to Africa and survives only as a writing system. The language it represents is called Old Libyan or Numidian, simply because it was spoken in Numidia and Libya. Since it’s possible that it’s an ancestor of modern Berber languages – although even that’s not clear – the script is usually called Libyco-Berber. Found throughout North Africa, and as far west as the Canary Islands, the script might have been used for at least as long as 1,000 years. Yet only short passages of it survive, all of them painted or engraved on rock. Everything else written in Libyco-Berber has disappeared.

Libyco-Berber has been recognised as an African script since the 17th century. But even after 400 years, it hasn’t been fully deciphered. There are no long texts surviving that would help, and the legacy of the written language has been one of acts of destruction, both massive and petty. That fate, of course, is not unique. It’s something that’s characteristic of modern European civilisation: it both destroys and treasures what it encounters in the rest of the world. Like Scipio Africanus weeping while he gazed at the Carthage he’d just obliterated, the destruction of the other is turned into life lessons for the destroyer, or artefacts in colonial cabinets of curiosities. The most important piece of Libyco-Berber writing was pillaged and sold to the British Museum for five pounds. It’s not currently on display.

Read the rest of this entry »

Comments (28)

Dungan, a Sinitic language of Central Asia written in the Cyrillic Alphabet

The linguistic importance of Dungan is greatly disproportionate to the number of its speakers, approximately 150,000, who live in seven different countries that are widely spread across Eurasia:   Kyrgyzstan, Kazakhstan, Russia, Tajikistan, Mongolia, Uzbekistan, and Ukraine.  The main reason why Dungan has been the focus of so much interest during the half-century since I began studying this fascinating language is that it puts the lie to the fallacy that Sinitic languages can only be written with the Sinographic script (i.e., Chinese characters).  The only Sinitic language that needs to be written with morphosyllabic characters is Literary Sinitic / Classical Chinese, a language that, in terms of its sayability, has been dead for millennia.  The recent academic study of Dungan has played a key role in enabling language specialists and the lay public finally to come to this realization.

Because the Dungan people are so highly scattered across vast distances and live among dominant populations with completely different languages that they need to speak for daily survival, their own language — and consequently also its alphabetic script — is threatened with extinction.  Furthermore, in recent decades the Dungans have been buffetted by ethnopolitical winds that make it even harder to maintain their unique identity.  That is why I have long felt a sense of urgency about the need to document and research Dungan language and script in all of their dimensions (morphology, phonology, lexicography, grammar, syntax, script, literature, sociolinguistics…).

Read the rest of this entry »

Comments off

Character confusion: three-child policy

Read the rest of this entry »

Comments (13)

Sinitic spelling: winter melon and bean curd

Comments (3)

Germanic runes on a pre-Cyrillic Slavic bone stir a debate

Article in Sunday's NYT:

"A Scratched Hint of Ancient Ties Stirs National Furies in Europe"

"Czech archaeologists say marks found on a cattle bone are sixth-century Germanic runes, in a Slavic settlement. The find has provoked an academic and nationalist brawl." Andrew Higgins (5/16/21)

The opening paragraphs lay out very clearly the reasons why the find is of such exceptional significance:

LANY, Czech Republic — In a region long fought over by rival ethnic and linguistic groups, archaeologists in the Czech Republic have discovered something unusual in these turbulent parts: evidence that peoples locked in hostility for much of the modern era got along in centuries past.

A few yards from a Czech Army pillbox built as a defense against Nazi Germany, the archaeologists discovered a cattle bone that they say bears inscriptions dating from the sixth century that suggest that different peoples speaking different languages mingled and exchanged ideas at that time.

The bone fragment, identified by DNA analysis and carbon dating as coming from the rib of a cow that lived around 1,400 years ago, was found in a Slavic settlement in 2017, said Jiri Machacek, the head of the archaeology department at Masaryk University in the Czech city of Brno. But in what is considered a major finding, a team of scholars led by Dr. Machacek recently concluded that the bone bears sixth-century runes, a system of writing developed by early Germans.

Read the rest of this entry »

Comments (17)

Difficult tongues

Johnson, in the Economist (5/7/21), has an enjoyable article:  "Some languages are harder to learn than others — but not for the obvious reasons".

Here's the first part of the article:

When considering which foreign languages to study, some people shy away from those that use a different alphabet. Those random-looking squiggles seem to symbolise the impenetrability of the language, the difficulty of the task ahead.

So it can be surprising to hear devotees of Russian say the alphabet is the easiest part of the job. The Cyrillic script, like the Roman one, has its origins in the Greek alphabet. As a result, some letters look the same and are used near identically. Others look the same but have different pronunciations, like the p in Cyrillic, which stands for an r-sound. For Russian, that cuts the task down to only about 20 entirely new characters. These can comfortably be learned in a week, and soon mastered to the point that they present little trouble. An alphabet, in other words, is just an alphabet. A few tricks aside (such as the occasional omission of vowels), other versions do what the Roman one does: represent sounds.

Read the rest of this entry »

Comments (45)

Tel Lachish and the origin of the alphabet

I've often heard of important discoveries at Tel Lachish, and I have a special interest in the origins of the alphabet, which I consider one of the most important inventions in the history of humankind.  So when I saw the title of this article, I perked up instantaneously.

"Archaeologists Think They’ve Found Missing Link in Origin of the Alphabet

A three and a half millennia old milk jar fragment unearthed at Tel Lachish in Israel has caused quite a bit of excitement."

By Candida Moss, The Daily Beast, Updated Apr. 25, 2021 8:18AM ET / Published Apr. 25, 2021 8:17AM ET

Read the rest of this entry »

Comments (13)