Archive for Information technology

What it's like inside the Great Firewall

By now, we've had dozens of posts about the Great Firewall, VPNs, internet censorship, and so forth, but they're all from the vantage of the outside trying to look in.  Of course, that gives us a skewed picture of what the situation is really like with regard to the internet inside and outside of the PRC.  This is not a healthy situation, for nearly one fifth of the world's population (17.72%) live inside the borders of China.  To be ignorant of how they are living is dangerous, for we may make erroneous assumptions about what one fifth of humanity is doing and thinking.

Fortunately, at last I have found an American expat who has been living and working in the PRC for more than a decade at a remote location and is well connected with many Chinese colleagues.  He is an active scholar and very well informed about the internet, AI, databases, and so forth, both inside and outside of the PRC.  I should note that he does not live among expats.  In fact, he is the only Westerner where he is located, quite far from major metropolitan areas, so he truly understands what Chinese of all walks of life do on a day-to-day basis.

Read the rest of this entry »

Comments (8)

Fissures in the Great Firewall caused by X

Things are becoming dicey for the CCP/PRC regime:

"A cartoon cat has been vexing China’s censors – now he says they are on his tail"

By Tessa Wong, Asia Digital Reporter, BBC (6/10/24)

Here's the dilemma faced by the Chinese communist authorities.   It would be very easy for the censors to shut down all VPNs and invoke strictly draconian internet controls that would make it impossible for netizens to communicate with the outside internet.  But that would mean that China would no longer have access to external information and communication, which the government desperately needs if they are going to continue to acquire advanced technology and science from abroad, not to mention operate their economic initiatives such as BRI (Belt and Road Initiative).

Read the rest of this entry »

Comments (5)

We need libraries and we need computers

Both for the flow of and access to information.

More than a week ago, the Seattle Public Library system, a large and wonderful institution that thousands rely on every day, went offline after ransomware hackers attacked it.

"Why did ransomware hackers target Seattle Public Library?", GeekWire, by Taylor Soper (May 29, 2024)

This is an excellent article that explains why the criminals went after a library, how they carried out their dirty work, and what the authorities are doing to restore services.

Read the rest of this entry »

Comments (8)

Flash mob / drive

Placed on the countertop of the coffee corner in the dining hall at Lingnan University in Hong Kong:

Read the rest of this entry »

Comments (7)

Overall, why do Mandarin enrollments continue to decline?

This is a problem that has been troubling colleagues across the country.

"Why fewer university students are studying Mandarin"

Learning the difficult language does not seem as worthwhile as it once did

Economist (Aug 24th 2023)

China | How do you say “not interested”?

Ten years ago Mandarin, the mother tongue of most Chinese, was being hyped as the language of the future. In 2015 the administration of Barack Obama called for 1m primary- and secondary-school students in America to learn it by 2020. In 2016 Britain followed suit, encouraging kids to study “one of the most important languages for the UK’s future prosperity”. Elsewhere, too, there seemed to be a growing interest in Mandarin, as China’s influence and economic heft increased. So why, a decade later, does Mandarin-learning appear to have declined in many places?

Read the rest of this entry »

Comments (32)

Data, information, knowledge, insight, wisdom, and Conspiracy Theory, part 2

From Phillip Remaker:

The one that claimed authorship clipped the edge of the unicorn tail.

 
The only version I have found that doesn't clip the edge of the unicorn tail is this one from farhan
 
I don't know if that means I found the original or if the author touched it up. The page is not archived on the Internet Archive.
 
It seems consistent with his other art.

Read the rest of this entry »

Comments off

Information Management and Library Science

Just out today, this is one of the longest book reviews I have ever written:

Jack W. Chen, Anatoly Detwyler, Xiao Liu, Christopher M. B. Nugent, and Bruce Rusk, eds., Literary Information in China:  A History (New York:  Columbia University Press, 2021).

Reviewed by Victor H. Mair

MCLC Resource Center Publication (Copyright September, 2022)

I am calling it to your attention because the book under review, which I will refer to here as LIIC, signals a sea change in:

1. Sinology
2. Information technology
3. Academic attitudes toward the study of language and literature

Read the rest of this entry »

Comments (4)

Language is not script and script is not language, part 2

[This is a guest post by Paul Shore.]

    The 2022 book Kingdom of Characters by Yale professor Jing Tsu is currently #51,777 in Amazon's sales ranking.  (The label "Best Seller" on the Amazon search-results listing for it incorporates the amusing mouseover qualification "in [the subject of] Unicode Encoding Standard".)  I haven't read the book yet:  the Arlington, Virginia library system's four copies have a wait list, and so I have a used copy coming to me in the mail.  What I have experienced, though, is a fifty-minute National Public Radio program from their podcast / broadcast series Throughline, entitled "The Characters That Built China", that's a partial summary of the material in the book, a summary that was made with major cooperation from Jing Tsu herself, with numerous recorded remarks by her alternating with remarks by the two hosts:  https://www.npr.org/podcasts/510333/throughline (scroll down to the May 26th episode).  Based on what's conveyed in this podcast / broadcast episode, I think many people on Language Log and elsewhere who care about fostering a proper understanding of human language among the general public might agree that that ranking of 51,777 is still several million too high.  But while the influence of the book's ill-informed, misleading statements about language was until a few days ago mostly confined to those individuals who'd taken the trouble to get hold of a copy of the book or had taken the trouble to listen to the Throughline episode as a podcast (it was presumably released as such on its official date of May 26th), with the recent broadcasting of the episode on NPR proper those nocive ideas have now been splashed out over the national airwaves.  And since NPR listeners typically have their ears "open like a greedy shark, to catch the tunings of a voice [supposedly] divine" (Keats), this program seems likely to inflict an unusually high amount of damage on public knowledge of linguistics. 

Read the rest of this entry »

Comments (27)

Google Translate is even better now, part 2

"Google Translate learns 24 new languages"
Isaac Caswell, Google blog (5/11/22)

==========

Illustrated green globe with the word "hello" translated into different languages.

For years, Google Translate has helped break down language barriers and connect communities all over the world. And we want to make this possible for even more people — especially those whose languages aren’t represented in most technology. So today we’ve added 24 languages to Translate, now supporting a total of 133 used around the globe.

Over 300 million people speak these newly added languages — like Mizo, used by around 800,000 people in the far northeast of India, and Lingala, used by over 45 million people across Central Africa. As part of this update, Indigenous languages of the Americas (Quechua, Guarani and Aymara) and an English dialect (Sierra Leonean Krio) have also been added to Translate for the first time.

Read the rest of this entry »

Comments (24)

The weirdness of typing errors

In this age of typing on computers and other digital devices, when we daily input thousands upon thousands of words, we are often amazed at the number and types of mistakes we make.  Many of them are simple and straightforward, as when our fingers stumblingly hit the wrong keys by sheer accident.  People who type on phones warn their correspondents about the likelihood that their messages are prone to contain such errors because they include some such warning at the bottom: 

Please forgive spelling / grammatical errors; typed on glass // sent from my phone.

Read the rest of this entry »

Comments (37)

Cambodian voice traffic

A Rest of World article from November that I missed when it first came out, but am posting on now because it speaks to the comments on several recent Language Log posts (e.g., here and here):

"Fifty percent of Facebook Messenger’s total voice traffic comes from Cambodia. Here’s why:

Keyboards weren't designed for Khmer. So Cambodians have just decided to ignore them", By Vittoria Elliott and Bopha Phorn (12 November 2021)

The first four paragraphs of this longish article

In 2018, the team at Facebook had a puzzle on their hands. Cambodian users accounted for nearly 50% of all global traffic for Messenger’s voice function, but no one at the company knew why, according to documents released by whistleblower Frances Haugen.

One employee suggested running a survey, according to internal documents viewed by Rest of World. Did it have to do with low literacy levels? they wondered. In 2020, a Facebook study attempted to ask users in countries with high audio use, but was only able to find a single Cambodian respondent, the same documents showed. The mystery, it seemed, stayed unsolved.

The answer, surprisingly, has less to do with Facebook, and more to do with the complexity of the Khmer language, and the way users adapt for a technology that was never designed with them in mind.

Read the rest of this entry »

Comments (6)

Pen scanner

New product:

With the Scanmarker Air no more Retyping- Simply Scanning!

Scan any text in a document or book and it's instantly available on your PC/Mac in any program including Word, Google Docs, Evernote and more. You can also use it on your smartphone/tablet with our app.

  • Super Easy to use
  • Scanmarker Air is 30 times faster than manual retyping
  • Scans up to 3,000 characters a minute and will save hours of tedious work
  • Can read aloud any scanned text
  • Instant translation to over 70 languages- including reading the translation aloud!

Read the rest of this entry »

Comments (25)

Language is not script and script is not language

Trying to clear up the confusion between the two is a battle we have been waging for decades, and nowhere is the problem more severe than in the study of Sinitic languages and the Sinographic script.  The crisis (not a "danger + opportunity"!) has come to the surface again this month with the appearance of a new book by Jing Tsu titled Kingdom of Characters: The Language Revolution That Made China Modern (Riverhead Books, 2022).

The publication of Tsu's book has generated a lot of excitement, publicity, and reviews.  Here I would like to call attention to the brief remarks of an anonymous correspondent (a famous, reclusive linguist) that are right on target:

Reimagining "antiquated" Chinese

Reproduced below is the text of a book review in Science that you may not have seen. It is classified as "Linguistics", though the reviewer is a historian at Cal State Poly, Pomona. Notice that Chinese is assumed to be "antiquated" and in need of being "reimagined"!  There is simply no sign of Science understanding the difference between a human language and a writing system. This is consistent with the way they have always treated linguistics; they have no idea what the subject really is.

Read the rest of this entry »

Comments (19)