Archive for Transcription

Mongolian text-to-speech, online transliterator of Cyrillic to classical script

From IA:

By way of introduction to what you see below under the asterisks, regarding the (not-always) technical reasons for the paucity of webpages in Mongolian script, see some of the comments here, especially the one at the top (Greg Pringle).

I might mention that the president of Mongolia's webpage in Mongolian script — which he links to — only displays correctly for me in Chrome, not in Firefox and not on my iPhone (Safari).

Read the rest of this entry »

Comments (4)

Nebraska: "Flat Water"

When you hear the name "Nebraska", the first thing you think of is probably "corn" and "cornhuskers", at least that was what always passed through my mind.

No longer.  Now having come roughly halfway across this long (430 miles) state and finding myself in Central City, I have gained a keen (I would even say "palpable") sense that it means "flat river".  That's because, from one end to the other, I'm following Route 30 / Lincoln Highway, and it was easy for the surveyors who laid out the Lincoln Highway (our nation's first transcontinental road) to follow the Platte River.  You guessed it, which I also did long ago, that "platte" is French for "flat", and that decidedly is what this river is all about:  flat, flat, flat.

Read the rest of this entry »

Comments (17)

Unknown language #10, part 2

[This is a guest post by Martin Schwartz.]

"Unknown language #10" (12/1/17) left all stumped, including a broad range of superb scholars of many languages.  I have no Rosetta Stone for it, but have something that may be called a Russetta or Rusetta (as in ruse) Bone.

First, the mystery text, which was the focus of Language Log Unknown Language #10,  I reproduce it here as was transmitted there:

Ukhant karapet qulkt kirlerek
Iqat ighun chapuq sireleq,
Poghtu Paghytei Piereleq
Azlayn qoghular eliut karapet.

Now, to the above I give a set of verse found in Aleksandr Kuprin's Russian novel Jama ('The Pit'), 1909-1915:

U Karapeta est' bufet
Na bufete est' konfet,
Na konfete est' portret
Ètot samyj Karapet.

'Karapet has a buffet
On the buffet is a bonbon (vel sim.)
On the bonbon is a portrait,
It's the very same Karapet.'

Read the rest of this entry »

Comments (9)

Spelling Manchu with Chinese characters

Read the rest of this entry »

Comments (5)

Mixed script writing in Taiwan, part 2

[This is a guest post by Kirinputra]

To take a step or two back towards the Sad Cripples theme, I had the TV on the other day and the show host — echoing the guest, a dietitian — said this:

“Lán Tâi-oân lâng kóng ‘tāu-leng’; gōa-kok lâng kóng ‘tó͘⁺-nài.’”

(Can't remember if he used Tone 3 or Tone 5 on the last syllable.) This could also be written like this:

“Lán Tâi-oân lâng kóng ‘tāu-leng’; gōa-kok lâng kóng ‘豆奶’ (DÒUNǍI).”

= 咱台灣人講「豆乳」。外國人講『豆奶』(DÒUNǍI)。

Again, he wasn't making a point; he was just summarizing an offhand remark the guest had just made to the same effect. While he seems to be referring to Mandarin speakers as foreigners — and they are, in a meaningful sense — there is no way he meant that. Rather, he & the guest were ultimately both referring to the English word soy milk, but calqued into Mandarin as 豆奶 — also a word in Mandarin, but not generally used in Formosa.

Read the rest of this entry »

Comments (29)

Tocharo-Sinica

Language Log has been fortunate to have had several guest posts and numerous comments by Douglas Adams, doyen of Tocharian studies in America (see "Selected readings" for a sampling).  Now, stimulated by the recent post on Chinese chariotry, he has written the following ruminations in response.

I read with interest the material on early Chinese chariotry.  It was far outside my competence to judge.  As you knew, I was most interested in the comment that was looking to the possibility of Tocharian > Chinese lexical borrowings.  As you also know, it has long been my suspicion that there was more west > east influence on Chinese language and culture than is generally realized.  And the "westerners" involved were most likely to have been Tocharians of one sort or another ("Tocharian D"?).  It's probably not only PIE pigs and honey that, via Tocharian, show up in Chinese.

Read the rest of this entry »

Comments (18)

Perso-Arabic script for Mandarin, Pe̍h-ōe-jī for Taiwanese: sad cripples?

We have been intrigued by Iskander Ding since encountering him on X/Twitter a while back, several posts from his account having made it onto Language Log (see "Selected readings").

With a handle like his, where Iskandar is the Persian form of the name of the Macedonian king Alexander the Great (256–323 BC) and Ding has an unmistakable Sinitic / Hannic ring to it, we suspected from the start that he is a Hui (Chinese Muslim).  So far, though, we have not been able to track down the sinographs for his full Hannic name, which is a bit unusual, where even Mongols, Uyghurs, Tibetans, and other non-Sinitic people are compelled to take Sinographic names.

Iskandar Ding is currently writing a Ph.D. dissertation on Yaghnobi linguistics and culture at SOAS in London (see this page for basic information about him).

Yaghnobi is an Eastern Iranian language spoken in the upper valley of the Yaghnob River in the Zarafshan area of Tajikistan by the Yaghnobi people. It is considered to be a direct descendant of Sogdian and has sometimes been called Neo-Sogdian in academic literature.

(source)

Here is a 53 second video of ID announcing a talk on Perso-Arabic-script Hannic.

Here is the 43.44 talk ("Xiao’erjing – Writing Chinese with Perso-Arabic Letters" – Iskandar Ding | PG 2022) as it actually happened.  IA comment:  "What he said throughout the talk was pleasing now and then — saying 'Eastern Turkistan' in Uighur for example…".

Here is a 1 hour 22 minute interview with ID. If you click the link it will open at the 18:11 mark, where he speaks of Perso-Arabic-script Hannic . That part ends at 22:40.

The above is based largely on information provided by IA, and the following quotes IA directly:

Read the rest of this entry »

Comments (8)

Spelling with Chinese character(istic)s, pt. 5

Serious questions about "religion" in Sinitic

Below the fold is for advanced specialists in Chinese philology, theology, and lexicography.  Even for them, it is recommended that readers prepare themselves by reviewing "Spelling with Chinese character(istic)s, pt. 4" (7/4/16).

[This is a guest post by IA]

Read the rest of this entry »

Comments (6)

Sound over symbol (and meaning)

Zach Hershey called to my attention a phenomenon about the relationship between speech and writing (and meaning) that I long suspected might well be true, and I even collected plentiful evidence in support of it, but I was never absolutely certain that it was true, namely, that in many cases speakers of Sinitic languages have in mind sounds over characters.  Now, with information provided by Zach, we have proof that Sinitic speakers in some cases are indeed thinking of sounds separately (apart from) hanzi.

Read the rest of this entry »

Comments (17)

Pinyin resurgent

Hopefully.

Some exciting news.

A member of the PRC's National Committee of the Chinese People's Political Consultative Conference (the yearly meeting of which is taking place in Beijing right now) is urging schools to increase the time spent teaching Pinyin (currently 4-6 weeks) to a semester or even longer to help ensure more students have a solid foundation in this skill. Intriguingly, there's also a mention of using more "texts."

Here's an account of what's happening:

"Schools should spend more time teaching Pinyin: PRC politician", Pinyin News (3/7/24)

Xu Xudong (徐旭東/徐旭东), a member of the Chinese People’s Political Consultative Conference and a professor at Central China Normal University in Wuhan, is advocating that public schools in China allocate substantially more time to the teaching of Hanyu Pinyin.

Read the rest of this entry »

Comments (5)

Spontaneous SCOTUS

Years ago, Jerry Goldman (then at Northwestern) created the oyez.org website as

 a multimedia archive devoted to making the Supreme Court of the United States accessible to everyone. It is the most complete and authoritative source for all of the Court’s audio since the installation of a recording system in October 1955. Oyez offers transcript-synchronized and searchable audio, plain-English case summaries, illustrated decision information, and full-text Supreme Court opinions

He rescued decades of tapes and transcripts from the National Archives, digitized and improved them, and arranged the website's interactive presentations of the available recordings. Jiahong Yuan and I played a role, by devising and validating a program to identify which justice was speaking when (See "Speaker Identification on the Scotus Corpus", 2008).

More recently, Jerry has inspired an effort to recreate oral arguments from famous cases that took place before the recording system was installed, starting with Brown v. Board of Education. Rejecting the idea of producing "deep fakes" using the existing transcripts and extant recordings of the justices involved, he and his colleagues decided to create what we might call "shallow fakes", where actors will perform (selections from) the transcripts, and a voice morphing system will then be used to make their recordings sound like the target speakers. The recreated clips will be embedded in explanatory material.

All the scripts have been written, and in a few months, you'll be able to hear the results — which I expect will be terrific.

Read the rest of this entry »

Comments (2)

Jumbled pinyin

I spotted this not-too-old post on Stephen Jones: a blog, "Interpreting pinyin" (10/9/17).

Read the rest of this entry »

Comments (4)

Handsome court — translation / transcription hybrid

Schematic map of bus stops in the vicinity of Lingnan University, Tuen Mun (below Castle Peak), Hong Kong.  Note the tenth stop outbound, which is "Handsome Court" (to be explained below):

Read the rest of this entry »

Comments (3)