Language Log

Archive for Computational linguistics

Hack of the year: 1980

May 21, 2023 @ 10:00 am· Filed by Mark Liberman under Computational linguistics

I recently stumbled on this 5/10/2023 Medium article by David Brock, "A Backup of Historical Proportions" — which reminded me of the Xerox Palo Alto Research Center ("PARC") and the Xerox Alto. Those were the people and the machine that invented interactive GUIs on bit-mapped displays, the computer mouse, and so on — though it took Steve Jobs to "borrow" the ideas and turn them into a social (and business) success.

But as a speech person, I always thought it was odd and unfortunate that the Alto had no provision for audio input or output — and I was impressed by the hack that Henry Thompson used to get around the audio output problem for his 1980 Berkeley thesis, "Stress and Salience in English: Theory and Practice".

Read the rest of this entry »

Permalink Comments (11)

AI Anchorman "@EdisonGPT"

May 15, 2023 @ 6:52 am· Filed by Mark Liberman under Computational linguistics, Language and politics

The future of news?

🚨 BREAKING!!! Just when we thought we'd defeated the #CensorshipIndustrialComplex and @TuckerCarlson rose again on @Twitter, @elonmusk appoints #LindaYaccarino as new #TwitterCEO.

Can an Executive Chair from the #WEF lead the revolution?#BreakingNews #EdisonThrustwell #WTF pic.twitter.com/uHt7NjCiO6

— Edison Thrustwell (@EdisonGPT) May 12, 2023

Read the rest of this entry »

Permalink Comments (12)

"The age of Socratic AI"?

April 30, 2023 @ 6:20 pm· Filed by Mark Liberman under Computational linguistics, Language and politics

Or should we call it "Delphic AI"?

Alexy Khrabrov suggested both possibilities a few days ago, in "Reasonable AI — the Golden Age of AI Programming":

The emerging techniques are all around the way you construct the prompts and also chain them. Effectively, we’re plotting dialogues.

I call it the Age of Socratic AI, or Reasonable AI. We are engaging in conversations with AI that elicit meaning. We make the most basic assumption that it has the information we need and can provide it in the form we need, e.g. as an explanation or a how-to plan of action. We consider it an imperfect oracle that has to be assuaged, and asked questions in very specific ways to get the reply we need.

Read the rest of this entry »

Permalink Comments (3)

The perils of AI (Artificial Intelligence) in the PRC

April 17, 2023 @ 6:49 am· Filed by Victor Mair under Artificial intelligence, Computational linguistics, Language and politics

Here at Language Log, for the last couple months, we've been having long, intense discussions about ChatGPT and other AI chatbots and LLM (Large Language Model) applications. Now, it seems that the battle over such AI programs has reached the level of ideological warfare.

"America, China and a Crisis of Trust"

Opinion | The New York Times (4/14/23)

Indeed, a story making the rounds in Beijing is that many Chinese have begun using ChatGPT to do their ideology homework for the local Communist Party cell, so they don’t have to waste time on it.

I have some evidence that this might well be true. Already about half-a-dozen years ago, my M.A. students from the PRC whose parents were CCP members told me that the government required daily interaction with the propaganda installed on their phones — upon pain of being demoted or dismissed. They had to read a specified amount of Xi-speak and answer questions about the content. This demanded a serious investment of time (hours). It was considered to be especially onerous for those CCP members whose day jobs (doctors, bureaucrats, stock brokers, etc., etc.) already demanded a very full work schedule in the office. So many, if not most of them, hired various human and electronic services to meet the obligations.

Read the rest of this entry »

Permalink Comments (12)

The hand of GOD GPT

April 16, 2023 @ 7:25 am· Filed by Mark Liberman under Computational linguistics, Language and religion

A VentureBeat story by Michael Kerner, "Cohere expands enterprise LLM efforts with LivePerson partnership" (4/11/2023), leads with this image:

…memetically referencing a widely-reproduced detail from Michelangelo's Sistine Chapel fresco Creazione di Adamo:

Read the rest of this entry »

Permalink Comments (5)

Hallucinations: In Xanadu did LLMs vainly fancify

April 3, 2023 @ 10:55 am· Filed by Victor Mair under Artificial languages, Computational linguistics, Metaphors, Semiotics

Bill Benzon has been our most prolific humanistic commentator about GPTs, almost as prolific as GPTs themselves. Here he introduces his latest creation in / on the genre:

"From 'Kubla Khan' through GPT and beyond", 3 Quarks Daily (3/27/23)

In a covering note to me, Bill writes:

A story about how I came to be interested in GPTs. It’s also implicitly a critique of the large language model business. You have a bunch of very smart and clever people creating engines that pump out language by the bucketful, but who seem to have little interest in or knowledge about language itself, much less linguistics, psycholinguistics, or the various cognitive sciences. It’s crazy. But the machines they’re producing are marvelous and fascinating.

Read the rest of this entry »

Permalink Comments (22)

Pablumese

March 22, 2023 @ 10:33 pm· Filed by Victor Mair under Artificial intelligence, Computational linguistics, Style and register

Knowing how much I like to invent terms for things that have no name ("topolect", "character amnesia", etc.), and needing a word for the parlance produced by ChatGPT-4 and kindred AI chatbots, Conal Boyce asked me to coin a term for it. I instantly obliged him by coming up with "pablumese" to designate the sort of language that is unremittingly neutral and takes no stance on any subject or topic it addresses.

Conal liked my invention and responded:

Here's one of the problems with ChatGPT and its brethren: Not only does it spew what Victor calls 'pablumese' but for technical questions it then mixes its pablumese with quantitative nonsense, creating a truly creepy kind of output.

I was curious to see how it would handle the question of how many copper atoms fit into the cross-section of a typical copper wire. It responded in a way that made it sound very knowledgeable, breaking everything down into tiny (sometimes condescending) steps, and yet, at the very end of its perfect logic, it botched its answer, because it was unable to do a conversion between millimeters and picometers correctly.

But here's the kicker: What makes this stuff maximally odious is that the creeps who design it will succeed in taking over the world anyway, because this week "version 4 is astonishingly better than the beta ChatGPT!!!" and version 5 next week will be astonishingly better than…. etc. etc. until they've improved it enough that it really will threaten the jobs of 3/4 of the human race. It must be an absolutely sickening time to be a young person, trying to plan one's career.

Read the rest of this entry »

Permalink Comments (25)

The mind of artificial intelligence

March 22, 2023 @ 9:39 pm· Filed by Victor Mair under Artificial intelligence, Computational linguistics

Sean Carroll's Preposterous Universe Podcast #230

Raphaël Millière on How Artificial Intelligence Thinks, March 20, 2023 / Philosophy, Technology, Thinking / Comments

Includes transcript of the two hour podcast.

Welcome to another episode of Sean Carroll's Mindscape. Today, we're joined by Raphaël Millière, a philosopher and cognitive scientist at Columbia University. We'll be exploring the fascinating topic of how artificial intelligence thinks and processes information. As AI becomes increasingly prevalent in our daily lives, it's important to understand the mechanisms behind its decision-making processes. What are the algorithms and models that underpin AI, and how do they differ from human thought processes? How do machines learn from data, and what are the limitations of this learning? These are just some of the questions we'll be exploring in this episode. Raphaël will be sharing insights from his work in cognitive science, and discussing the latest developments in this rapidly evolving field. So join us as we dive into the mind of artificial intelligence and explore how it thinks.

[The above introduction was artificially generated by ChatGPT.]

Read the rest of this entry »

Permalink Comments (6)

ChatGPT-4: threat or boon to the Great Firewall?

March 21, 2023 @ 7:57 pm· Filed by Victor Mair under Artificial intelligence, Censorship, Computational linguistics

"The practical value of LLMs is high enough that it will induce Chinese to seek out the best systems, and they will not be censored by China.”

"Yes, the Chinese Great Firewall will be collapsing"

by Tyler Cowen Marginal Revolution (March 21, 2023)

Something that the PRC censors had not predicted:

As framed from China:

Fang Bingxing, considered the father of China’s Great Firewall, has raised concerns over GPT-4, warning that it could lead to an “information cocoon” as the generative artificial intelligence (AI) service can provide answers to everything.

Fang said the rise of generative AI tools like ChatGPT, developed by Microsoft-backed OpenAI and now released as the more powerful ChatGPT-4 version, pose a big challenge to governments around the world, according to an interview published on Thursday by Red Star News, a media affiliate to state-backed Chengdu Economic Daily.

“People’s perspectives can be manipulated as they seek all kinds of answers from AI,” he was quoted as saying.

Fang, a computer scientist and former government official, is widely considered the chief designer of China’s notorious internet censorship and surveillance system. He played a key role in creating and developing the Great Firewall, a sophisticated system of internet filters and blocks that allows the Chinese government to control what its citizens can access online.

Read the rest of this entry »

Permalink Comments (4)

This is the 4th time I've gotten Jack and his beanstalk

March 15, 2023 @ 7:45 pm· Filed by Victor Mair under Artificial intelligence, Computational linguistics, Writing

Bill Benzon shares the response he got from ChatGPT to the prompt, "Tell me a story."

Read the rest of this entry »

Permalink Comments (30)

ChatGPT writes VHM

February 28, 2023 @ 5:47 am· Filed by Victor Mair under Artificial intelligence, Computational linguistics, Writing systems

[This is a guest post by ChatGPT]

Write a Language Log blog post by Victor H. Mair

Title: "The Wonders of the Chinese Language"

As one of the world's oldest and most complex languages, Chinese has captivated linguists and language enthusiasts for centuries. From its unique writing system to its vast vocabulary and intricate grammar, Chinese is a fascinating and challenging language that continues to surprise and delight those who study it.

Perhaps the most distinctive feature of Chinese is its writing system. Chinese characters are visual symbols that convey meaning, and they are used in both written and spoken language. Unlike the phonetic alphabets used in many other languages, Chinese characters do not correspond to specific sounds, but rather to concepts or ideas. This makes the learning process much more complex, as students must memorize thousands of characters and learn their meanings and pronunciations.

Read the rest of this entry »

Permalink Comments (22)

Vignettes of quality data impoverishment in the world of PRC AI

February 23, 2023 @ 8:00 am· Filed by Victor Mair under Artificial intelligence, Computational linguistics, Data bases

Some snippets:

Limited data sets a hurdle as China plays catch-up to ChatGPT

Lack of high-quality Chinese texts on Internet a barrier to training AI models.

Ryan McMorrow, Nian Liu, Eleanor Olcott, and Madhumita Murgia, FT, Ars Technica (2/21/23)

…

Baidu struggled with its previous attempt at a chatbot, known as Plato, which analysts said could not even answer a simple question such as: “When is Alibaba co-founder Jack Ma’s birthday?”

Analysts point to the lack of high-quality Chinese-language text on the Internet and in other data sets as a barrier for training AI software.

GPT, the program underlying ChatGPT, sucked in hundreds of thousands of English academic papers, news articles, books, and social media posts to learn the patterns that form language. Meanwhile, Baidu’s Ernie has been trained primarily on Chinese-language data as well as English-language data from Wikipedia and Reddit.

…

Read the rest of this entry »

Permalink Comments (11)

Uh-oh! DeepL in the classroom; it's already here

February 22, 2023 @ 1:22 pm· Filed by Victor Mair under Artificial intelligence, Computational linguistics

Yesterday in my Classical Chinese class, we were reading Ouyang Xiu's (1007-1072) "Discussion on 'Biographies of Eunuchs'" in the New History of the Five Dynasties (written 1036-1039, published 1072). Here's the relevant passage:

Móu zhī ér bùkě wéi. Wéi zhī ér bùkě chéng. Zhì qí shén zé jù shāng ér liǎng bài. ——“Xīn wǔdài shǐ huàn zhě chuán lùn”

謀之而不可為。為之而不可成。至其甚則俱傷而兩敗。 ——《新五代史宦者傳論》

[Because of the special circumstances of this post, I will not adhere to my usual custom of providing Pinyin Romanization, Hanzi transcription, and English translation all three together.]

Read the rest of this entry »

Permalink Comments (8)

« Previous Page — « Previous Entries

Next Entries » — Next Page »

Archive for Computational linguistics

Hack of the year: 1980

AI Anchorman "@EdisonGPT"

"The age of Socratic AI"?

The perils of AI (Artificial Intelligence) in the PRC

The hand of GOD GPT

Hallucinations: In Xanadu did LLMs vainly fancify

Pablumese

The mind of artificial intelligence

ChatGPT-4: threat or boon to the Great Firewall?

This is the 4th time I've gotten Jack and his beanstalk

ChatGPT writes VHM

Vignettes of quality data impoverishment in the world of PRC AI

Uh-oh! DeepL in the classroom; it's already here

Follow us on Twitter

Archives [+/–]

Blogroll [+/–]

Meta