Archive for Language and computers
February 10, 2021 @ 11:31 am· Filed by Victor Mair under Artificial intelligence, Language and computers, Lost in translation
This has been making the rounds:
1. Go to Google Translate.
2. Set the input language to Spanish.
3. Paste in "soy milk"
4. Set the output language to English or X language.
5. Hilarity ensues.
The obligatory screen shot:
Read the rest of this entry »
Permalink
January 17, 2021 @ 8:10 pm· Filed by Victor Mair under Language and computers, Lost in translation, Translation
This is all over the Chinese internet:
(source)
Read the rest of this entry »
Permalink
December 10, 2020 @ 8:23 pm· Filed by Victor Mair under Language and computers, Lost in translation, Translation
From Jeff DeMarco:
I’m sure you’ve seen the Facebook translation artifact where it repeats “and I’m going to go to the middle of the day.” This post does that and something similar with “of the 912th.” I keep advising Facebook that these are unintelligible, but they seem to be a low priority.
Read the rest of this entry »
Permalink
October 21, 2020 @ 11:00 pm· Filed by Victor Mair under Language and computers, Lost in translation
Jonathan Silk wonders how this mistranslation from Latin to Dutch in Google Translate occurred the same way in English:
Read the rest of this entry »
Permalink
October 18, 2020 @ 6:57 am· Filed by Victor Mair under Alphabets, Dictionaries, Information technology, Language and computers, Lexicon and lexicography
We just had a good discussion about a Sinitic language written with an alphabet:
"The look, feel, and sound of Dungan language" (10/15/20)
Under "Selected readings" below, there are listed additional earlier posts about writing Sinitic languages with Romanization.
One of the major advantages of the alphabet over a morphosyllabic / logographic ideopicto-phonetic writing system like the Sinographic script is that it is very easy to order and find / retrieve the entire lexicon with the former, whereas carrying out these tasks with the latter is toilsome at best and torturesome at worst. See:
Victor H. Mair, "The Need for an Alphabetically Arranged General Usage Dictionary of Mandarin Chinese: A Review Article of Some Recent Dictionaries and Current Lexicographical Projects", Sino-Platonic Papers, 1 (February, 1986), 1-31 pp.
Read the rest of this entry »
Permalink
September 17, 2020 @ 5:18 pm· Filed by Victor Mair under Errors, Language and computers, Miswriting, Speech technology
That's part of a message from one of my students. I knew right away what he meant, but — as always — I'm curious about what causes such off-the-wall typos. It can't be because of a spellchecker gone awry. So I asked the student, "What type of input system do you use? I'm trying to think about how that was produced."
He replied, "I use the bog-standard* American English input that Apple has. I think I missed the 'h' and it grabbed it from there? Maybe an additional incorrect letter?
[*This was the first time I encountered this expression, and I didn't know what it meant.]
I followed up:
just regular keyboard?
not on iPhone?
no shortcuts? swypes?
speech recognition input?
Read the rest of this entry »
Permalink
August 22, 2020 @ 9:40 pm· Filed by Victor Mair under Language and computers, Speech technology, Typography
E-mail message from my son, Thomas Krishna:
I'm using the voice recognizer to write you this message. When you do take your truck in for service at Toyota place, ask them if an exterior cleaning is included. Having visited you over the years I know that where you park a lot of tree debris falls onto your vehicles! This is no big deal, except for one thing, you don't want stuff to fall on top of your vents right in front of where the windshield is. I had this problem with my truck under the crepe myrtles at Lacey's house. For a while I tried using cardboard cutouts to cover them up but they did not last very well in the Sun and rain. I know that at your place things dropping off the trees is almost a continuous problem whereas for me it was only in the fall. So just thinking maybe you should try to find something that can cover those vents for when your truck is parked there.
Read the rest of this entry »
Permalink
April 20, 2020 @ 7:43 am· Filed by Victor Mair under Borrowing, Historical linguistics, Language and computers, Phonetics and phonology, Reconstructions
A fuller and more specific version of the title of this post would be "Chinese transcriptions of Indic terms in the translations of An Shigao (Chinese: 安世高; pinyin: Ān Shìgāo; Wade–Giles: An Shih-kao, Korean: An Sego, Japanese: An Seikō, Vietnamese: An Thế Cao) (fl. 148-180 CE) and Lokakṣema (लोकक्षेम, Chinese: 支婁迦讖; pinyin: Zhī Lóujiāchèn) (fl. 147-189)".
With the collaboration of Jan Nattier, Nathan Hill was able to digitize some data from Han Buddhist transcriptions back in 2017 and has now published them as a dataset on
Zenodo:
Hill, Nathan, Nattier, Jan, Granger, Kelsey, & Kollmeier, Florian. (2020). Chinese transcriptions of Indic terms in the translations of Ān Shìgāo 安世高 and Lokakṣema 支婁迦讖 [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3757095
Read the rest of this entry »
Permalink
April 8, 2020 @ 1:52 pm· Filed by Victor Mair under Errors, Information technology, Language and computers, Miswriting, Phonetics and phonology, Typography
A little over a week ago, I described how I mistyped "stalk" for "stock". That led to a vigorous discussion of precisely how people pronounce "stalk". (As a matter of fact, in my own idiolect I do pronounce "stock" and "stalk" identically.) See:
"Take stalk of: thoughts on philology and Sinology" (3/29/20)
I just now typed "One I first saw…" when I meant "When I first saw…".
Read the rest of this entry »
Permalink
April 4, 2020 @ 7:29 am· Filed by Victor Mair under Artificial intelligence, Language and computers, Lost in translation
[This is a guest post by Cyrus Shaoul]
I am a long time LL reader and I came across an interesting machine translation error today.
When my Japanese friend sent me this sentence:
62年前のこの日に慶應義塾大学病院で命を授かりました。
I was flummoxed by the verb 授かる [VHM: sazukaru {"be gifted / endowed with (an award / title); to be blessed (e.g., with a child); be granted / taught; to be given something of great value / a treasure, by deities or someone of higher social class"}] at the end of the sentence, so I asked Google Translate for help and lo and behold, it said:
"On this day, 62 years ago, I died at Keio University Hospital."
Read the rest of this entry »
Permalink
February 22, 2020 @ 3:40 pm· Filed by Victor Mair under Language and computers, Lost in translation, Translation
From Jeff DeMarco:
I hit the translation button for this Facebook post and this is what I got!
Read the rest of this entry »
Permalink
February 9, 2020 @ 8:24 am· Filed by Victor Mair under Bilingualism, Borrowing, Humor, Language and computers, Words words words
From Nathan Hopson:
Can't believe I had never heard this marvelous Japanglish until now:
トップレス‐ミーティング(toppuresu mītingu = "topless meeting")or トップレス会議 (kaigi = meeting)
Read the rest of this entry »
Permalink
February 4, 2020 @ 5:47 pm· Filed by Victor Mair under Abbreviation, Borrowing, Language and computers, Neologisms, Writing systems
Message from Stoyan Gegovski:
I am editing parts of the "Xi'an Investment Guide" (every major city in China issues one of these every year) and I came upon an interesting use of the abbreviation "IP" which might interest you:
"Xīn shídài xīn Xī'ān xīn IP 新时代 新西安 新IP"
It is placed on the third page of the handbook, right after a short introduction of the city and a map of the ancient Silk Road.
I have never encountered such a use of "IP" and I find it quite interesting. The Graduate students tasked with the translation rendered it as "New Era, New Xi’an, New IP", which obviously does not truly represent its meaning. Apparently, even the Chinese are not too sure what it means, as they were also unable to define it.
Read the rest of this entry »
Permalink