Archive for Classification

More Metadata Muddles on Google Books

Mark's discovery of a mistitled Google Books entry—a book on experimental theater filed as a 2009 book on management—is entertaining but not that unusual. Like the other metadata mixups at Google books (involving authorship, genre classification and publication date, among other things) that I enumerated in a 2009 post "Google Books: A Metadata Train Wreck," there are probably thousands of cases in which the metadata for one book is associated with an entirely different work. Or at least that's what induction suggests; Paul Duguid and I have happened on quite a number of these, some as inadvertantly comical as Mark's example. Clicking on the entry for a book called Tudor Historical Thought turns up the text of a book on tattoo culture, the entry for an 1832 work on the question of whether the clergy of the Church of England can receive tithes turns up a work by Trotzky, the entry for Last Year at Marienbad turns up the text of Sam Pickering's Letters to a Teacher, and so on (see more examples below the fold). What's particularly interesting about Mark's example, though, is that the work is similarly misidentified on Amazon and Abe Books, which indicates that for many modern titles, at least, the error is likely due not to "some (perhaps algorithmic) drudge on the Google assembly line," as Mark suggests, but to one of the third-party offshore cataloguers on which Google and others rely for their metadata.

Read the rest of this entry »

Comments (9)

Is Cantonese a language, or a personification of the devil?

Whether Cantonese is a language or a dialect is a subject that we have touched upon many times on Language Log, e.g., "Spoken Hong Kong Cantonese and written Cantonese" (see especially the remarks in the second half of the original post) and "English is a Dialect of Germanic; or, The Traitors to Our Common Heritage ."

But now it has become a hot-button issue in China, especially in Hong Kong, where the government's Education Bureau recently made a monumental gaffe by declaring that Cantonese was not an official language of the Special Administrative Region:  "Education Bureau rapped over Cantonese 'not an official language' gaffe:  Claim Cantonese 'not an official language' leaves public lost for words."

Here's an article in Chinese on the uproar that followed the announcement of the Education Bureau that Cantonese is not an official language of Hong Kong.

Read the rest of this entry »

Comments (89)

Poetry as "Word Temple" — NOT

Andrew Shields encountered the idea — on Facebook and vigorously promoted on this blog — that the Chinese character for poetry, shī 诗, consists of two parts meaning "word" and "temple".  Furthermore, it is claimed that this is a particularly apt way to represent the notion of poetry, one that is conspicuously missing in Western culture.

Such a facile interpretation commits several fallacies, the chief of which is to misunderstand the history and nature of the character in question.  After a careful examination of the evidence, it seems far more likely that shī 诗 has to do with the ritual performance of the odes by eunuchs, i.e., by reciters or singers who were castrati, than that it means "word" + "temple".

Read the rest of this entry »

Comments (46)

Signifying the Local

I just found out about this new book on local languages in China.  Judging from the abstract and table of contents, it looks very interesting and promising: Signifying the Local: Media Productions Rendered in Local Languages in Mainland China in the New Millennium. The publisher's blurb:

In Signifying the Local, Jin Liu examines contemporary cultural productions rendered in local languages and dialects (fangyan) in the fields of television, cinema, music, and literature in Mainland China. This ground-breaking interdisciplinary research provides an account of the ways in which local-language media have become a platform for the articulation of multivocal, complex, and marginal identities in post-socialist China. Viewed from the uniquely revealing perspective of local languages, the mediascape of China is no longer reducible to a unified, homogeneous, and coherent national culture, and thus renders any monolithic account of the Chinese language, Chineseness, and China impossible.

Read the rest of this entry »

Comments (6)

Zazaki: a West Iranian language

In the midst of our ongoing debates about whether Cantonese, Shanghainese, Taiwanese, and so forth are Sinitic languages or dialects, I continually find evidence that the custom of referring to them only as "dialects" is exceptional when compared with linguistic usage elsewhere (e.g., India, Europe, Africa).

Today I came across an Iranian language that I'd never heard of before, Zazaki, although — without knowing it — I probably met some of its speakers in Sweden, where there are many  Zazak refugees.  Also called Zaza, Kirmanjki, Kirdki, Dimli, and Dimili, Zazaki is found primarily in eastern Anatolia.  It belongs to the northwestern branch of the Iranian group of the Indo-European family.

Read the rest of this entry »

Comments (43)