Archive for Decipherment

Reading Old Turkic runiform inscriptions with the aid of 3D simulation

"Augmenting parametric data synthesis with 3D simulation for OCR on Old Turkic runiform inscriptions: A case study of the Kül Tegin inscription", Mehmet Oğuz Derin and Erdem Uçar, Journal of Old Turkic Studies (7/21/24)

Abstract

Optical character recognition for historical scripts like Old Turkic runiform script poses significant challenges due to the need for abundant annotated data and varying writing styles, materials, and degradations. The paper proposes a novel data synthesis pipeline that augments parametric generation with 3D rendering to build realistic and diverse training data for Old Turkic runiform script grapheme classification. Our approach synthesizes distance field variations of graphemes, applies parametric randomization, and renders them in simulated 3D scenes with varying textures, lighting, and environments. We train a Vision Transformer model on the synthesized data and evaluate its performance on the Kül Tegin inscription photographs. Experimental results demonstrate the effectiveness of our approach, with the model achieving high accuracy without seeing any real-world data during training. We finally discuss avenues for future research. Our work provides a promising direction to overcome data scarcity in Old Turkic runiform script.

Read the rest of this entry »

Comments (1)

Unknown language #19

Inscribed sandstone known as the "Singapore Stone", Singapore, 10th–14th century:


Collection of the National Museum of Singapore

(Source; also includes an animated photo that can be rotated 360º in any direction and enlarged or reduced to any size)

Read the rest of this entry »

Comments (7)

Unknown language #10, part 2

[This is a guest post by Martin Schwartz.]

"Unknown language #10" (12/1/17) left all stumped, including a broad range of superb scholars of many languages.  I have no Rosetta Stone for it, but have something that may be called a Russetta or Rusetta (as in ruse) Bone.

First, the mystery text, which was the focus of Language Log Unknown Language #10,  I reproduce it here as was transmitted there:

Ukhant karapet qulkt kirlerek
Iqat ighun chapuq sireleq,
Poghtu Paghytei Piereleq
Azlayn qoghular eliut karapet.

Now, to the above I give a set of verse found in Aleksandr Kuprin's Russian novel Jama ('The Pit'), 1909-1915:

U Karapeta est' bufet
Na bufete est' konfet,
Na konfete est' portret
Ètot samyj Karapet.

'Karapet has a buffet
On the buffet is a bonbon (vel sim.)
On the bonbon is a portrait,
It's the very same Karapet.'

Read the rest of this entry »

Comments (9)

Unknown language #18

[This is a guest post by John Mock]

Query about inscription on crystal from Afghanistan.

Face 1 (actual and reverse):

Read the rest of this entry »

Comments (10)

Unknown language #17

Shared by Sup Gau in the Facebook group "Language Nerds":

Read the rest of this entry »

Comments (5)

Once again the Voynich manuscript

This is one of the most novel theories on the Voynich manuscript (Beinecke MS408; early 15th c.) that I've ever encountered, and there are many.

The Voynich Manuscript, Dr Johannes Hartlieb and the Encipherment of Women’s Secrets, by Keagan Brewer and Michelle L Lewis, Social History of Medicine, hkad099 (22 March 2024)

Keywords:  Voynich manuscript, Dr Johannes Hartlieb, women’s secrets, sex, gynaecology

A floral illustration on page 32

Read the rest of this entry »

Comments (10)

Unknown language #17

Read the rest of this entry »

Comments (4)

Unknown language #16

From Beverly Kahn:

Here's a puzzle that I hope you (or fellow linguists) might solve. My neighbor showed me a wood carving of what is likely an American Indian. It is dated 1907. On the back one finds markings that are like a language. Can you determine what the language is and perhaps what it says?

Read the rest of this entry »

Comments (7)

A Video Game Decoding Ancient Languages

Xinyi Ye, who sent this to me, thought the idea of multiple languages and the Tower of Babel in a game would be quite cliché, but this one is actually good.  You will be surprised at what you see and hear.

This is the official trailer:
 

Read the rest of this entry »

Comments (19)

AI (and human ingenuity) to the rescue

If you've ever had any doubt about the positive potential of AI for fundamental linguistic research of various types, here's a powerful example that will set your mind at rest.

"First passages of rolled-up Herculaneum scroll revealed:  Researchers used artificial intelligence to decipher the text of 2,000-year-old charred papyrus scripts, unveiling musings on music and capers."  By Jo Marchant, Nature (2/5/24).

doi: https://doi.org/10.1038/d41586-024-00346-8

With four striking illustrations, including a video and an animation, plus a separate related visual showing how the feat was accomplished.

Read the rest of this entry »

Comments (4)

Decryption of a difficult script

Photograph accompanying a New York Times article, with the following caption: "Merle Goldman explaining the Chinese characters for the word China":


(source)

Read the rest of this entry »

Comments (4)

Central Asian Kharosthi script on an ancient knife hilt found in Austria

Astonishing demonstration of East-West interaction during Roman times (with an equally mind-boggling demonstration of the occasional, yet horrendous [defying common sense], ineptitude of AI translation):

"Geheimnis um Messergriff aus dem römerzeitlichen Wels gelüftet"

Ein vor über 100 Jahren entdeckter Elfenbeingriff mit rätselhafter Inschrift aus dem antiken Ovilava gehörte wohl einst einem Besucher aus dem fernen Asien

"The mystery of the Roman period Wels knife handle revealed"

An ivory handle with a mysterious inscription from ancient Ovilava discovered more than 100 years ago probably once belonged to a visitor from distant Asia

Thomas Bergmayr, Der Standard (7/28/23)

Before presenting the remarkable findings reported in this important article, just a short prefatory note about the AI translation of the title.  Three of the main online multilingual neural machine translation services (Google Translate, Baidu Fanyi, and DeepL) mistranslated "Wels" (the eighth largest city in Austria [ancient Ovilava]) as "catfish" (only Bing Translator got it right).  Given the object that we're dealing with, that is a genuinely bizarre rendering of the word, especially since the material of the handle is identified as ivory and the artifact as coming from Ovilaval in the subtitle.  (It is all the more perplexing that three of the four services are consistent in making the same strange mistake [well, not so strange after all, since "wels" really does mean catfish in German].)  Fortunately, the machine translators do a better job in the body of the article, where there is more context.

For the purposes of the rough translation of the German article, I have relied mainly on GT, with occasional assistance from the other translation services, and some good old human input from my own brain.  Please bear in mind that the translations proffered below do not pretend to be polished, flawless English renderings of parts of the German article, but only to give a functionally useful idea of its content.

N.B.:  Two photographs of the knife handle are provided near the bottom of this post.

Read the rest of this entry »

Comments (20)

Kushan inscriptions from Western and Southern Central Asia (WCA, SCA)

The article I am calling to your attention in this post is of extraordinary importance for its potential to link together many of the themes we have repeatedly investigated during nearly the last two decades on Language Log (see the bibliography below for a sampling of relevant posts).

To make it easier for non-specialist readers, here are a few brief identifications of essential languages and peoples (all late Classical and early Medieval):

Bactrian (Αριαο, Aryao, [arjaː]) is an extinct Eastern Iranian language formerly spoken in the Central Asian region of Bactria (in present-day Afghanistan) and used as the official language of the Kushan and the Hephthalite empires.

The Kushan Empire (Ancient Greek: Βασιλεία Κοσσανῶν; Bactrian: Κοϸανο, Košano; Sanskrit: कुषाण वंश; Brahmi: , Ku-ṣā-ṇa; BHS: Guṣāṇa-vaṃśa; Parthian: , Kušan-xšaθr; Chinese: 貴霜; pinyin: Guìshuāng) was a syncretic empire, formed by the Yuezhi, in the Bactrian territories in the early 1st century. It spread to encompass much of what is now Uzbekistan, Afghanistan, Pakistan and Northern India, at least as far as Saketa and Sarnath near Varanasi (Benares), where inscriptions have been found dating to the era of the Kushan Emperor Kanishka the Great.

Read the rest of this entry »

Comments (11)