Archive for Parsing

Spacing in Korean

The role of a Scotsman, John Ross (1842-1915), in creating it.  Although he was a Christian missionary who spent over half his life in China, he was apparently a gigachad.

The following video is densely packed with solid information and moves rapidly, so you have to pay close attention to follow it.

Read the rest of this entry »

Comments (8)

The full name of Bangkok

@kattoksthai

Replying to @Mamba Did you know that Bangkok has the longest city name in the world? I dare you to say it too! #bangkok #thailand #thai

♬ original sound – Kat Talks Thai

Read the rest of this entry »

Comments (27)

"Written Cantonese must have word segmentation"

That's the title of an essay that appeared in my e-mail today from an outfit called Cantonese Script Reform 粵字改革.  Here's what they say:

Written Cantonese must have spaces, like Korean. The calligraphic issue must give way. For the space itself is a grammatical marker that marks the beginning and the end of a word. This tool of demarcation will allow poet and playwright to invent new words by putting words together within the confinements delineated by the spaces between words. Written Cantonese needs all the tools imaginable for it to revitalise and resurrect its lost vocabulary. A Hebrew-esque recycling off ancient words for purposes anew is the way to go. But we can’t do that if we can’t tell if this is a new word because we can’t tell if these characters familiar so and so sequenced are merely a fanciful poetic playful arrangement or other mark of the invention of a new word, where a familiar noun is turned into a verb or verb is turned into an adjective or an adjective is now henceforth interpreted as a noun in this particular context.

Read the rest of this entry »

Comments (33)

Word division and computer lockouts

Random storefront in Taiwan:

Read the rest of this entry »

Comments (12)

Human Washing Machine

Headline on NDTV, Nov. 29, 2025: "Japan Unveils Human Washing Machine, Now You Can Get Washed Like Laundry."


Read the rest of this entry »

Comments (15)

Parsing of a fated kin tattoo

 

It's been a while since I've written about Chinese tattoos, although years ago they used to be a staple subcategory of our Chinglish-themed posts.

This intriguing one is too good to pass up:

Read the rest of this entry »

Comments (34)

What good are kanji?

Why Do Japanese Still Use Kanji? Complicated Writing System…

Read the rest of this entry »

Comments (8)

Words, morphemes, collocations, characters

We've met Julesy before:  "The conundrum of singing with tones" (5/30/25).  She has a Ph.D. in linguistics and knows how to communicate her scientific knowledge of Mandarin to intelligent laypersons.  Here she is again, this time telling us some very important things about the differences between words and characters:

Read the rest of this entry »

Comments (22)

Hangul for Cantonese

Comments (3)

Multiple possible parsings of strings of sinographs

[line spacing was difficult with this one]

Chinese signs collected by Zeyao Wu:

本店/有/嬰兒被/賣 or 本店/有/嬰兒/被賣
běn diàn yǒu yīng'ér bèi                                  mài
this shop has baby   passive signifier; blanket for sale
"this shop has baby blankets for sale" or "this shop has had babies for sale"

Read the rest of this entry »

Comments (2)

Garden paths galore

In two successive comments on different posts (here and here), Jarek Weckwerth asserts that this garden path post is "a timely follow-up" to the exuberant discussion on the parsing of a Classical Chinese / Literary Sinitic (CC/LS) book title that took place in this post and the plethora of readers' remarks that followed it.  This is an interesting proposition, and it makes me wonder if CC/LS is prone to this sort of ambiguity because of the inexplicitness of its grammar.

During the more than half a century that I have been studying and teaching CC/LS, it has always seemed to me that checking out different possible "garden paths" is a sine qua non for responsible reading of such texts.

Read the rest of this entry »

Comments (2)

Ravens on the garden path

I just ran across a particularly impressive garden path sentence in Bernd Heinrich's book RAVENS IN WINTER (p. 268); it took me several tries to get this sentence to parse grammatically:

"Even the wolverine is said to do nothing to drive ravens off that land beside it and steal its food."

(Of course parsing is no problem if the sentence is spoken.  But in written form, for me at least, "and steal its food" just didn't seem to fit at first.  My mis-parse was reading "off" as the head of a prepositional phrase.)

Comments (19)

Chinese characters and the messiness of Chinese culture

Is it really so?

Uncannily and independently, Apollo Wu* sent me the following note before I made this post:

Hànzì bǐ bù shàng zìmǔ wénzì de guānjiàn lǐngyù zàiyú páixù jiǎnsuǒ hé réngōng zhìnéng děng fāngmiàn. Fùzá fánsuǒ nán xué nán yòng shì dāngqián miàn duì de kùnnán. Hànzì wú xù gěi Zhōngguó wénhuà dǎshàng língluàn de làoyìn!

汉字 比不上 字母文字 的 关键 领域 在于 排序 检索 和 人工智能 等 方面。复杂 繁琐 难学难用 是 当前 面对的 困难。汉字 无序 给 中国 文化 打上 凌乱 的 烙印!

Google Translate:

The key areas where Chinese characters are not as good as alphabetic characters are sorting, retrieval and artificial intelligence. Complicated, cumbersome, difficult to learn and difficult to use are the difficulties we are currently facing. The disorder of Chinese characters marks Chinese culture as messy!

Read the rest of this entry »

Comments (10)