Language Log

Badge of honor: Language Log is blocked in China

December 26, 2019 @ 8:41 am· Filed by Victor Mair under Language and business, Language and computers, Language and politics, Language and science

Two days ago, I received this message from a colleague in China:

Not sure if this should be a badge of honor or a disappointment, but a few days ago Language Log got blocked in China. (Source — GreatFire.org: Language Log is 100% censored)

This caps off a miserable year where we also lost Wikipedia (all languages), The Guardian, Al Jazeera, Hackernews, Imgur….

[VHM: Of course, Google, Facebook, Twitter, YouTube, and many other invaluable websites were already off-limits to Chinese citizens for years The internet in China is severely decimated by the CCP government.]

Read the rest of this entry »

Permalink Comments (6)

Chinese acronyms

December 22, 2019 @ 8:30 am· Filed by Victor Mair under Acronyms, Bilingualism, Borrowing, Diglossia and digraphia, Language and computers, Metaphors, Romanization

Apollo Wu sent in this list of what he calls "Chinese acronyms" (Romanizations, translations, links, and comments are by VHM):

GJBZ 国家标准 Guójiā biāozhǔn ("National Standard") — this is commonly reduced still further to "GB"

YDYL 一带一路 Yīdài yīlù ("One Belt, One Road" or "Belt and Road")

RMB 人民币 Rénmínbì ("RMB", the Chinese yuan)

Read the rest of this entry »

Permalink Comments off

Classical Chinese computing

December 19, 2019 @ 1:44 pm· Filed by Victor Mair under Language and computers, Nerdview

Several colleagues called this article to my attention:

"Programming Language for the ancient Chinese"

Here's the introduction:

文言, or wenyan, is an esoteric programming language that closely follows the grammar and tone of classical Chinese literature. Moreover, the alphabet of wenyan contains only traditional Chinese characters and 「」 quotes, so it is guaranteed to be readable by ancient Chinese people. You too can try it out on the online editor, download a compiler, or view the source code.

The home page then goes through "Syntax", "Compilation", and "Get (Source Code; Online Editor; Reference".

Read the rest of this entry »

Permalink Comments (18)

Pinyin to Hanzi Two Way Conversions

December 17, 2019 @ 7:22 am· Filed by Victor Mair under Language and computers, Writing systems

Apollo Wu, who was a long-term translator at United Nations headquarters, sent me the following note:

Dear Victor,

I wish to acquire a language tool for two way conversions between Pinyin and Hanzi texts. Do you know if any do exist? I sometimes write Pinyin texts and want to convert them to characters for some Chinese readers who are not familiar with Pinyin.

Best!

Apollo

Read the rest of this entry »

Permalink Comments (30)

Emojis vs. emoticons

July 8, 2019 @ 3:44 pm· Filed by Victor Mair under Emojis and emoticons, Language and computers, Writing systems

Here's an emoji: 😻

Here's an emoticon: :‐)

As we will see below, the superficial resemblance of the two words is completely coincidental — even though they both have to do with the visual depiction of emotions and ideas in texts.

This post began as a comment to "Emoticons as writing" (7/7/19), but it soon became too long and too complex to fit in a comment, so it now receives separate treatment of its own.

Read the rest of this entry »

Permalink Comments (25)

Emoticons as writing

July 7, 2019 @ 5:12 pm· Filed by Victor Mair under Diglossia and digraphia, Emojis and emoticons, Language and computers, Writing, Writing systems

This morning I received this card from a friend:

Read the rest of this entry »

Permalink Comments (22)

Mandarin hospital robocalls

June 18, 2019 @ 12:28 pm· Filed by Victor Mair under Communication, Language and computers

Article in The Washington Post (6/18/19):

"Robocalls are overwhelming hospitals and patients, threatening a new kind of health crisis"

" … Many of the messages seemed to be the same: Speaking in Mandarin, an unknown voice threatened deportation unless the person who picked up the phone provided their personal information…."

Read the rest of this entry »

Permalink Comments (7)

The CCP's Learning / Learning Xi (Thought) app

May 25, 2019 @ 7:44 am· Filed by Victor Mair under Language and computers, Language and politics

A couple of nights ago, I had dinner with one of my students from China and his parents, both of whom are members of the Chinese Communist Party (CCP). The father is a doctor and has to work 10 hours a day, during which he sees a hundred patients every day. Most of them are suffering from diabetes. At the end of his long day, the father is required (i.e., not optional) to log into the Party's Xuéxí / Xué Xi 学习 ("Learning / Learn Xi [Thought]") app — full name “Xuéxí / Xué Xi qiángguó 学习强国” ("Learning / Learn Xi [Thought]" to strengthen the nation"), which was launched in the early part of 2015.

Read the rest of this entry »

Permalink Comments (29)

Odevity or parity

May 15, 2019 @ 3:18 pm· Filed by Victor Mair under Language and computers, Language and mathematics, Numbers

[This is a guest post by Jeffrey Shallit]

A Chinese student here at Waterloo used the term "odevity" for what English-speaking computer scientists typically call "parity" — the property of an integer being odd or even.

I had never heard this term before, so I used Google Scholar to look at where it is being used. It is used almost exclusively by Chinese engineers, mathematicians, and computer scientists. The first usage I was able to find with Google Book Search was in 1972, obtained with this search.

Read the rest of this entry »

Permalink Comments (23)

Digitizing specialized language dictionaries

April 30, 2019 @ 11:07 am· Filed by Victor Mair under Dictionaries, Language and computers, Language and food, Lexicon and lexicography

[The following is a guest post by David Dettmann. The "Schwarz Uyghur dictionary" to which he refers in the third paragraph is this: Henry G. Schwarz, An Uyghur-English dictionary (Bellingham, Washington: Center for East Asian Studies, Western Washington University, 1992).]

It is a bit of a nerdy obsession of mine to customize my computers to comfortably use languages that I've studied.

About 10 years ago, I got relatively proficient with using optical character recognition (OCR) software and scanner hardware. Any time I found an essential dictionary for the languages I studied, I converted them to unicode OCR scans in pdf format (i.e., converting images of pages to text). I later used that data to create dictionary content files that would work together with the Mac OS dictionary application. I did this process with several dictionaries that I found essential while I studied Kazakh, Uzbek, and Uyghur.

This process was particularly useful for me to use the Schwarz Uyghur dictionary. I could not get used to the alphabetical order that he favored (which was different from typical Latin order AND Uyghur Arabic script order). As a result, any lookup would just take forever. That said, the formatting of each page was quite pleasant, and there were some nice illustrations of plants of traditional Uyghur medicine as well as handy keys at the bottom of each page to explain abbreviations.

Read the rest of this entry »

Permalink Comments (5)

Phonetic annotations as a welcome aid for learning how to read and write Sinographs

April 26, 2019 @ 7:51 am· Filed by Victor Mair under Language and computers, Language and education, Language teaching and learning, Phonetics and phonology, Reading, Writing, Writing systems

In several recent posts, we've been discussing the most efficient, least painful way to acquire facility with hanzi / kanji / hanja 漢字 ("Sinographs; Chinese characters"). Lord knows there are endless numbers of them and they are so intricately constructed that it is an arduous task to master the two thousand or so that are necessary for basic literacy.

It would be so much easier to learn the Sinographs if language pedagogues would provide phonetic annotations for each character. Better yet, the phonetic annotations should be divided into words with spaces between them according to the official orthographic rules.

Read the rest of this entry »

Permalink Comments (26)

Korean inputting on cellphones

April 8, 2019 @ 3:53 pm· Filed by Victor Mair under Language and computers, Writing, Writing systems

For the first time in my life, I closely observed someone inputting Korean on a cell phone. (I was sitting behind the person doing it on the train ride to the city this afternoon.) Of course, I don't know exactly how it works, but what I observed was very interesting.

First of all, the young woman's phone had a special feature I've never seen in any other type of inputting. Namely, she could use a little, built-in, popup, electronic magnifying glass to hover over a particular syllable block that she had composed to inspect it carefully to see that she had formed it correctly. She did this fairly often.

Next, she seemed to spend a lot of time typing and retyping individual syllable blocks to make sure she got them right.

Read the rest of this entry »

Permalink Comments (22)