Long ago, in a narratology far away…

Louisa Shepard, "‘May the force be with you’ and other fan fiction favorites", Penn Today 12/18/2019:

Starting with Star Wars, Penn researchers create a unique digital humanities tool to analyze the most popular phrases and character connections in fan fiction. […]

The Penn team started with the script of “Star Wars: The Force Awakens” and created algorithms to analyze the words in the script against those in millions of fan fiction stories. The unique program identifies the most popular phrases, characters, scenes, and connections that are repurposed by these writers and then displays them in a simple graph format.

The results are now available on their “fan engagement meter” at https://fanengagement.org.

Serendipitously, today's xkcd:

Read the rest of this entry »

Comments (1)


Classical Chinese computing

Several colleagues called this article to my attention:

"Programming Language for the ancient Chinese"

Here's the introduction:

文言, or wenyan, is an esoteric programming language that closely follows the grammar and tone of classical Chinese literature. Moreover, the alphabet of wenyan contains only traditional Chinese characters and 「」 quotes, so it is guaranteed to be readable by ancient Chinese people. You too can try it out on the online editor, download a compiler, or view the source code.

The home page then goes through "Syntax", "Compilation", and "Get (Source Code; Online Editor; Reference".

Read the rest of this entry »

Comments (18)


Mrs. Transformer-XL Tittlemouse

This is another note on the amazing ability of modern AI learning techniques to imitate some aspects of natural-language patterning almost perfectly, while managing to miss common sense almost entirely. This probably tells us something about modern AI and also about language, though we probably won't understand what it's telling us until many years in the future.

Today's example comes from Zihang Da et al., "Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context", arXiv 6/2/2019.

Read the rest of this entry »

Comments (5)


Menu overtranslation

Bruce Rusk sent in this photograph taken at his local (Vancouver) Hong Kong-style congee joint: the English translation of the third item on the menu reads the Sinographs 龍崗 (lit., "dragon hillock / mound / [lookout] post / sentinel / sentry") as a Japanese toponym or family name, when they should be read in Cantonese (as the name of a neighborhood in Shenzhen, he believes).

Read the rest of this entry »

Comments (4)


Canoe schemata nama gary anaconda

Following up on recent posts suggesting that speech-to-text is not yet a solved problem ("Shelties On Alki Story Forest", "The right boot of the warner of the baron", "AI is brittle"), here's a YouTube link to a lecture given in July of 2018 by Michael Picheny, "Speech Recognition: What's Left?" The whole thing is worth following, but I particularly draw your attention to the section starting around 50:06, where he reviews the state of human and machine performance with respect to "noise, speaking style, accent, domain robustness, and language learning capabilities", with the goal to "make the case that we have a long way to go in [automatic] speech recognition".

Read the rest of this entry »

Comments (4)


Pinyin to Hanzi Two Way Conversions

Apollo Wu, who was a long-term translator at United Nations headquarters, sent me the following note:

Dear Victor,

I wish to acquire a language tool for two way conversions between Pinyin and Hanzi texts. Do you know if any do exist?  I sometimes write Pinyin texts and want to convert them to characters for some Chinese readers who are not familiar with Pinyin.

Best!

Apollo

Read the rest of this entry »

Comments (30)


Enteral fever

Fuchsia Dunlop has a real talent for finding these things (cf. "Explosion Cheese Durian Pie" [9/23/19]):

Read the rest of this entry »

Comments (1)


Crosstalk about topolects

In the last few days, we've been discussing the notion of "national language" and its relationship to other languages and topolects spoken in China.  Here's a famous 6:47 comic skit filmed in 1980 featuring the late Mǎ Jì 马季 and his straight man, Zhào Yán 赵炎, called "Guǎngdōng huà 广东话" ("Cantonese") (I will describe its contents below):

Read the rest of this entry »

Comments (6)


Multilingualism in Philadelphia's Chinatown

Sign spotted by Diana Shuheng Zhang on December 7, 2019:

Read the rest of this entry »

Comments (14)


Communicative disfluencies interpolations

In the past few days, I've encountered some nice examples of the communicative interpretation of what I've suggested we ought to call "interpolations" rather than "disfluencies".

Read the rest of this entry »

Comments (16)


"National Language" in the Xinjiang Uyghur Autonomous Region

Many people have been asking me about the use of the term Guóyǔ 国语 ("National Language") for "Mandarin" in Xinjiang today.  Here's an inquiry from Peter Moody:

I have encountered what seems to be an anomaly in contemporary Chinese usage, and have been assured that you are among those most capable of addressing it.

I was reading an analysis by a Darren Byler, a "Xinjiang Scholar," of a 2017 classified directive from Zhu Hailun, Gauleiter of Xinjiang, on how properly to run the concentration camps in that territory (https://supchina.com/2019/12/04/a-xinjiang-scholars-close-reading-of-the-china-cables/). (I have not looked either at the full English translation of these directives, or the Chinese text, although both are available. I figured the analysis would give the gist of them.)

Read the rest of this entry »

Comments (17)


Amazing new Japanese words

These come from the following nippon.com article:

"Pay It Forward: The Top New Japanese Words for 2019" (12/13/19)

I'll list the words first, then explain which one is my favorite.

A prefatory note:  nearly half of the words on these lists are based wholly or partly on borrowings from English, though they are assimilated into Japanese in such a manner that they are unrecognizable to monolingual English speakers.

Read the rest of this entry »

Comments (8)


Seeding Mars

Comments (3)