Archive for Language on the internets

Conmen

Gotta be careful when you pick your URL, otherwise something like this might happen to you.

The Chinese Confucius and Mencius Association of Taiwan has the following URL for their website:

CONMEN.ORG.TW

Read the rest of this entry »

Comments (23)

Sound rules

Stephen Halsey, who is spending the year in Taiwan doing research, observed an interesting linguistic phenomenon that shows the predominance of sound over symbol, even in the writing of Chinese, where the symbols are complex and semantically "heavy" in comparison to phonetic scripts like the Roman alphabet or bopomofo / zhuyin fuhao (Mandarin phonetic symbols), where the symbols are simple and semantically "light".

Read the rest of this entry »

Comments (18)

Cat and mouse on the Chinese internet

Yesterday in the Washington Post, there was an enticing article by Anna Fifield:  "These are the secret code words that let you criticize the Chinese government" (7/29/15).

Fifield states that she is drawing on "Decoding the Chinese Internet: A Glossary of Political Slang," by authors Perry Link and Xiao Qiang.  Comment by Perry Link:  "This is good work, and I am happy to have my name associated, but it is not my work.  Ms Fifield somehow made a mistake."

Read the rest of this entry »

Comments (1)

Your country: pronominal resistance

When Westerners begin to study Chinese, Japanese, or Korean, a small obstacle that confronts them is the fact that the words for "my / our country" in these languages usually have to be translated as "China", "Japan", and "Korea" respectively in English.  As a colleague who knows all three languages put it, "I'm always struck by the oddness and even slight ungrammaticality of the English usage 'in my country' that you hear from C J K speakers."

We looked at this phenomenon in some depth a couple of years ago:

"My country" (1/23/13)

Now an extremely interesting new twist with regard to this concept of "my / our country" has arisen in China that merits another look.

Read the rest of this entry »

Comments (29)

Fake account spotting on Facebook

One language-related story in the British press over the weekend was that Gavin McGowan was threatened by Facebook with having his account shut down… because they said his name was fake.

About ten years ago Gavin learned some Scottish Gaelic and started using the Gaelic spelling of his name: Gabhan Mac A Ghobhainn. Facebook is apparently running software designed to spot bogus accounts on the basis of the letter-strings used to name them. Gabhan's name evidently failed the test.

Read the rest of this entry »

Comments (36)

Since the beginning of history

I have mentioned chinaSMACK before on Language Log, but have never featured it so directly as in this post.  The reason is that this time there's an interesting language aspect to one of their articles that is hard to pass up.

chinaSMACK specializes in translating trenchant, amazing stories from the vast amount of traffic that flows through China's microblogs and on the internet more generally.  Sometimes they are so bizarre and surreal that my initial reaction upon reading them — after being shocked senseless or laughing myself silly — is to dismiss them as Onionesque.  But that is usually impossible because they are so well documented.  In the present case, there is an initial news report and five stunning photographs.  Because the photographs are so gross and graphic, just downright disgusting, I won't show them directly on Language Log (especially not during the holidays), but readers can go to the link and see them with their own eyes.

"Excrement Tanker Explodes, Covering Everyone in Human Waste"  (12/28/14)

Read the rest of this entry »

Comments (7)

A record-setting pangrammatic window

A few months ago, I posted here (and on Slate's Lexicon Valley blog) about PangramTweets, a bot created by Jesse Sheidlower that combs Twitter for tweets that include all 26 letters of the alphabet. I mentioned that it would be interesting to see if PangramTweets turns up any particularly short "pangrammatic windows," i.e., pangrammatic strings in naturally occurring text. At the time, the shortest known example was 42 letters long, in a passage from Piers Anthony's Cube Route:

"We are all from Xanth," Cube said quickly. "Just visiting Phaze. We just want to find the dragon."

My post inspired Malcolm Rowe, a software engineer at Google, to set about finding short pangrammatic windows in an automated fashion, first on the Project Gutenberg corpus and then on the megacorpus of web pages indexed by Google. (Let's hear it for Google's 20 percent time!) On his blog, Malcolm now reports on his findings, including the discovery of a 36-letter pangrammatic window that appeared in a review of the movie Magnolia on PopMatters:

Further, fractal geometries are replicated on a human level in the production of certain “types” of subjectivity: for example, aging kid quiz show whiz Donnie Smith (William H. Macy) and up and coming kid quiz show whiz Stanley Spector (Jeremy Blackman) are connected (or, perhaps, being cloned) in ways they couldn’t possibly imagine.

Read the rest of this entry »

Comments (14)

Emotional contagion

As usual, xkcd nails it:

Mouseover title: "I mean, it's not like we could just demand to see the code that's governing our lives. What right do we have to poke around in Facebook's private affairs like that?"

Read the rest of this entry »

Comments (12)

Is the Urdu script on the verge of dying?

Hindi-Urdu, also referred to as Hindustani, is the classic case of a digraphia, so much so that there has been a long-standing controversy over whether they are one language or two.  Their colloquial spoken forms are nearly identical, but when written down, the one in the Devanāgarī script, the other in the Nastaʿlīq script, they have a very different look and "feel".

Read the rest of this entry »

Comments (56)

Banned in Beijing

Everyone knows that the Chinese government goes to extraordinary lengths to police the internet (see: "Blocked on Weibo").

And most sentient beings are aware of the awesome fame of the Grass-Mud Horse, the notorious Franco-Croatian Squid, and and the mysterious River Crab.  You can find all of them in "Grass-Mud Horse Lexicon Classics".

Sometimes, the censors begin to look pretty ridiculous, as when they outlawed the word "jasmine" in 2011, particularly since it refers not just to the Jasmine Revolution, but also to a favorite flower, tea, and folk song.

mòlì 茉莉 ("jasmine")

mòlì chá 茉莉茶 ("jasmine tea") OR mòlìhuā chá 茉莉花茶 ("jasmine tea") OR xiāngpiàn 香片 ("scented [usually with jasmine] tea")

mòlìhuā 茉莉花 ("jasmine flower", name of a popular folk song; presidents Jiang Zemin and Hu Jintao were both excessively fond of this song, and there are videos of them singing it, so it becomes especially awkward to try to forbid citizens to use the word mòlì 茉莉 ("jasmine")

Read the rest of this entry »

Comments (28)

Bonfire beneficiaries

Subeditor Humphrey Evans points out to me that the grammar of phishing spam emails is getting worse and worse, rather than better. He recently saw one that contained this text:

The sum of (6.5M Euros only will be transfer into your account after the processing of all relevant legal documents with your name as the bonfire beneficiary, the transfer will be made by Draft or telegraphic Transfer (T/T), conformable in 3 working days as soon as you apply to the bank director.

That "bonfire beneficiary" bit is an eyebrow-raiser, isn't it? It seems to be an error for the Latin phrase bona fide "good faith".

Read the rest of this entry »

Comments off

PangramTweets

The Twitter API, beyond its great utility for corpus linguistics (see "On the front lines of Twitter linguistics," "The he's and she's of Twitter"), has made possible a lot of fun automated text-mining projects. One fertile area is algorithmic found poetry: there have been Twitter bots designed to find accidental haikus, and even more impressively, a bot named @Pentametron that finds rhyming tweets in iambic pentameter and fashions sonnets out of them.

And then there is found wordplay, which is its own kind of found poetry. I'm a big fan of @Anagramatron, which discovers paired tweets that form serendipitous anagrams of each other. (Example: "Last time I do anything" ⇔ "That's it. I'm dying alone.") Now, courtesy of Jesse Sheidlower, comes @PangramTweets, in which each tweet contains every letter of the alphabet at least once.

Read the rest of this entry »

Comments (8)

Cantonese poetry recitation

A recent issue (1/7/14) of the South China Morning Post (SCMP) carried an article by a staff reporter entitled "Hong Kong student's poem recital goes viral in the mainland ". The article features this amazing video of a Hong Kong high school student reciting a couple of Classical Chinese poems:


Read the rest of this entry »

Comments (19)