Language Log

Twitter length restrictions in English, Chinese, Japanese, and Korean

October 7, 2017 @ 6:45 am· Filed by Victor Mair under Language and computers, Writing systems

Josh Horwitz has a provocative article in Quartz (9/27/17): "SAY MORE WITH LESS: In Japanese, Chinese, and Korean, 140 characters for Twitter is plenty, thank you"

The thinking here is muddled and the analysis is misplaced. There's a huge difference between "characters" in English and in Chinese. We also have to keep in mind the difference between "word" and "character", both in English and in Chinese. A more appropriate measure for comparing the two types of script would be their relative "density", the amount of memory / code space required to store and transmit comparable information in the two scripts.

Read the rest of this entry »

Permalink Comments (38)

Learning languages is so much easier now

August 18, 2017 @ 9:38 pm· Filed by Victor Mair under Dictionaries, Language and computers, Language teaching and learning, Pedagogy, Writing systems

If you use the right tools, that is, as explained in this Twitter thread from Taylor ("Language") Jones.

A brief thread on how kids have it (in this case, language learning) easier these days.

When I first studied Chinese in college… 1/

— Language Jones (@languagejones) August 17, 2017

Rule number 1: Use all the electronic tools at your disposal.

Rule number 2: Do not use paper dictionaries.

Jones' Tweetstorm started when he was trying to figure out the meaning of shāngchǎng 商场 in Chinese. He remembered from his early learning that it was something like "mall; store; market; bazaar". That led him to gòuwù zhòngxīn 购物中心 ("shopping center"). With his electronic resources, he could hear these terms pronounced, could find them used in example sentences, and could locate actual places on the map designated with these terms.

Read the rest of this entry »

Permalink Comments (14)

Neglected email

August 8, 2017 @ 3:57 am· Filed by Geoffrey K. Pullum under Changing times, Language and computers, Language and technology

Some charmingly reflective and sincere writing in the latest xkcd comic as Cueball types a reply to a long-neglected email correspondent:

Dear Kevin,
I'm sorry it's taken me two years to reply to your email. I've built up so much stress and anxiety around my email inbox; it's an unhealthy dynamic which is more psychological than technical. I've tried one magical solution after another, and as each one has failed, deep down I've grown more certain that the problem isn't email – it's me.

Regardless, these are my issues, not yours; you're my friend, and I owe you the basic courtesy of a response. I apologize for my neglect, and I hope you haven't been too hurt by my failure to reply.

Anyway, I appreciate your invitation to join your professional network on LinkedIn, but I'm afraid I must decline…

The mouseover alt text says: "I would be honored, but I know I don't belong in your network. The person you invited was someone who had not yet inflicted this two-year ordeal upon you. I'm no longer that person."

Read the rest of this entry »

Permalink Comments off

Banned by Beijing

August 3, 2017 @ 9:29 pm· Filed by Victor Mair under Language and computers, Language and politics, Puns

Just saw this great post by the editors of supchina:

"Here are all the words Chinese state media has banned: A full translation of the style guide update from Xinhua, and why it matters." (8/1/17)

We can be grateful to the editors for their reliable translations, complete with Chinese characters and Hanyu Pinyin romanizations, with word spacing and tonal diacritics.

The list is divided into sections on "Politics and society" (including politically incorrect and vulgar terms), "Law", "Religion and society", "Hong Kong, Macau, Taiwan, territory, and sovereignty", and "International relations". Specialists in all of these areas will have a field day examining these sensitive terms and analyzing their political, social, and cultural implications. I encourage everyone who has an interest in contemporary China to avail themselves of this extraordinary opportunity to get inside the most fundamental level of the censorial apparatus of the Communist Chinese state.

Read the rest of this entry »

Permalink Comments (10)

Attribution of the WannaCry ransomware to Chinese speakers

May 26, 2017 @ 9:59 pm· Filed by Victor Mair under Dialects, Errors, Found in translation, Language and computers, Phonetics and phonology, Translation

The notorious WannaCry malware infestation began on Friday, May 12, 2017 and spread rapidly throughout the world, infecting hundreds of thousands of computers and causing major damage. Speculation concerning the identity of the perpetrators focused on North Korea, but the supposed connection was never convincingly demonstrated, and there were no other serious suspects.

Yesterday, Jon Condra, John Costello, and Sherman Chu published a stunning report which suggests that the authors of WannaCry — or someone they hired — spoke fluent Chinese:

"Linguistic Analysis of WannaCry Ransomware Messages Suggests Chinese-Speaking Authors" (Flashpoint [5/25/17])

Read the rest of this entry »

Permalink Comments (17)

Similes for quality of computer code

May 11, 2017 @ 12:19 pm· Filed by Geoffrey K. Pullum under Humor, Insults, Language and computers, Language play, Rhetoric

I must admit to having enjoyed the series of savage similes about quality of computer program code presented in three xkcd comic strips. They show a female character, known to aficionados as Ponytail, reluctantly agreeing to take a critical look at some code that the male character Cueball has written. Almost at first sight, she begins to describe it using utterly brutal similes. In the first strip (at http://xkcd.com/1513) she announces that reading it is "like being in a house built by a child using nothing but a hatchet and a picture of a house." But Ponytail is not done: there is more bile and contempt where that came from.

Read the rest of this entry »

Permalink Comments off

Veggies for cats and dogs

May 1, 2017 @ 8:48 pm· Filed by Victor Mair under Awesomeness, Information technology, Language and computers, Language and food, Lost in translation

This video was passed on by Tim Leonard, who remarks, "real-time video translation at its best":

Read the rest of this entry »

Permalink Comments (8)

The miracle of reading and writing Chinese characters

March 26, 2017 @ 9:24 am· Filed by Victor Mair under Information technology, Language and computers, Language teaching and learning, Writing, Writing systems

We have the testimony of a colleague whose ability to write Chinese characters has been adversely affected by her not being able to visualize them in her mind's eye. See:

"Aphantasia — absence of the mind's eye" (3/24/17)

This prompts me to ponder: just how do people who are literate in Chinese characters recall them?

Read the rest of this entry »

Permalink Comments (26)

Password nerdview

February 9, 2017 @ 4:43 pm· Filed by Geoffrey K. Pullum under Language and computers, Lost in translation, Nerdview

Steve Politzer-Ahles was trying to change his password on the Hong Kong Polytechnic University system, and found himself confronted with this warning:

You may not use the following attribute values for your password:
puAccNetID
puStaffNo
puUserGivenName
puUserSurname

Attribute values? This is classic nerdview.

Read the rest of this entry »

Permalink Comments off

Why electronic machine translation services sometimes seem to fail

January 29, 2017 @ 8:38 am· Filed by Victor Mair under Borrowing, Diglossia and digraphia, Language and computers, Style and register, Topolects, Translation

The inability of Google Translate, Microsoft Translator, Baidu Fanyi, and other translation services to correctly render jī nián dàjí 鸡年大吉 ("may the / your year of the chicken be greatly auspicious!") in various languages points up a vital distinction that I have long wanted to make, and now is as good a time as ever. Namely, just as you could not expect these translation services to handle Cantonese, Shanghainese, Taiwanese, etc. (unless specifically and separately programmed to do so), we should not expect them to deal with Literary Sinitic / Classical Chinese (LS / CC).

Read the rest of this entry »

Permalink Comments (10)

Finding non-Roman letters and characters in an MS Word document

December 10, 2016 @ 3:07 pm· Filed by Victor Mair under Language and computers, Typography

Somebody asked Mark Swofford to help her devise a speedy, easy way to locate all the Chinese characters in a book-length manuscript that she was working on. Mark set to work on the problem, and this is what he came up with:

"How to find Chinese characters in an MS Word document" (12/10/16)

Read the rest of this entry »

Permalink Comments (9)

Offal is not awful

December 9, 2016 @ 10:51 pm· Filed by Victor Mair under Language and computers, Language and culture, Language and food

My son sent me this wonderful, learned post called "The best bits" from the "Old European culture" blog (12/7/2015). It begins:

Offal, also called variety meats or organ meats, refers to the internal organs and entrails of a butchered animal. The word does not refer to a particular list of edible organs, which varies by culture and region, but includes most internal organs excluding muscle and bone.

The word shares its etymology with several Germanic words: Frisian ôffal, German Abfall (offall in some Western German dialects), afval in Dutch and Afrikaans, avfall in Norwegian and Swedish, and affald in Danish. These Germanic words all mean "garbage", or —literally— "off-fall", referring to that which has fallen off during butchering. However, these words are not often used to refer to food with the exception of Afrikaans in the agglutination afvalvleis (lit. "off-fall-meat") which does indeed mean offal. For instance, the German word for offal is Innereien meaning innards. According to the Oxford English Dictionary, the word entered Middle English from Middle Dutch in the form afval, derived from af (off) and vallen (fall).

Read the rest of this entry »

Permalink Comments (9)

Mystery modal window error message

December 5, 2016 @ 10:57 am· Filed by Geoffrey K. Pullum under Announcements, Errors, Language and computers, Language and technology, WTF

Almost every day, when looking through the headlines on Google News, I see one or two stories where what's meant to be a snippet from the first paragraph of the story contains not a single word from the story but instead says this:

This is a modal window. This modal can be closed by pressing the Escape key or activating the close button. Close Modal Dialog. This is a modal window.

Read the rest of this entry »

Permalink Comments (31)

Archive for Language and computers

Twitter length restrictions in English, Chinese, Japanese, and Korean

Learning languages is so much easier now

Neglected email

Banned by Beijing

Attribution of the WannaCry ransomware to Chinese speakers

Similes for quality of computer code

Veggies for cats and dogs

The miracle of reading and writing Chinese characters

Password nerdview

Why electronic machine translation services sometimes seem to fail

Finding non-Roman letters and characters in an MS Word document

Offal is not awful

Mystery modal window error message

Follow us on Twitter

Archives [+/–]

Blogroll [+/–]

Meta