Language Log

Archive for Linguistic history

Monosyllabism

May 7, 2025 @ 11:46 am· Filed by Mark Liberman under Linguistic history

Ever since I learned a bit of Vietnamese in 1970, I've been curious about the apparent areal feature of monosyllabism in southeast Asia. I did some poking around on Google Scholar yesterday, and came across something that's definitely worth following up on.

Read the rest of this entry »

Permalink Comments (23)

Philology vs. linguistics

March 14, 2025 @ 6:07 am· Filed by Victor Mair under Linguistic history, Linguistics as a discipline, Philology

Linguistics is a relatively young discipline, formally dating from roughly the mid-19th century. In the study of language, it was preceded by philology, which has hoary roots going all the way back to Pāṇini (520-460 BC) and beyond.

In my own lifetime, until recently I preferred to identify myself as a philologist, but that met with too many dumb stares, so I gave up on that. Now, however, I find that there is a World Philology Union to carry the torch for this venerable profession, so perhaps there's hope for reviving my lost lifework after all.

From the WPU's website:

The World Philology Union (WPU) was founded on 2 December 2021 in Oslo, Norway. The WPU is an international association whose purpose is to promote philology worldwide, in research, education, society and culture.

The first General Assembly of the WPU was held in Rome, 15 December 2022. At the same occasion, the first WPU conference was held, 14–16 December, hosted by the Sapienza University of Rome and ISMEO – The International Association of Mediterranean and Oriental Studies. This conference discussed the current state of philology at universities and other academic institutions worldwide.

…

Read the rest of this entry »

Permalink Comments (23)

Reflections on Alan Lomax and Bill Labov

December 31, 2024 @ 4:49 pm· Filed by Mark Liberman under Linguistic history

Below is a guest post by Corey Miller.

Alan Lomax was brought back to my mind through his appearance in this year’s holiday film A Complete Unknown, which is centered on Bob Dylan. I, a most unmusical linguist, wasn’t sure why the name rang a bell; my first thought was that he was (someone like) Milman Parry or Albert Lord, people who were interested in finding vestiges of the Homeric tradition in modern southeastern Europe. His portrait in the film is most unflattering (in contrast to the angelic Pete Seeger or a mute Woody Guthrie), culminating in a fistfight.

Read the rest of this entry »

Permalink Comments (6)

The evolution of verbal interpolations

July 6, 2024 @ 11:09 am· Filed by Mark Liberman under Fillers and pause words, Linguistic history

Philip Castle, "Quelles sont les expressions les plus utilisées dans la langue française courante?", Quora 6/20/2024:

On va commencer par voilà. O-bli-ga-toi-re ! Il faut parsemer votre discours de "voilà", sans trop vous préoccuper de leur place ni de leur utilité dans la phrase, bien au contraire. Exemple : "Je me suis dit que voilà ce serait bien de voilà faire des efforts pour voilà améliorer mon français". Il faut aussi garder à l'esprit que ce mot merveilleux peut tout remplacer, y compris une fin de phrase. Exemple entendu ce matin sur France Inter : "En fait, le SMIC à 1600 €, je suis patron alors voilà". Vous avez compris le principe, il n'est pas nécessaire de terminer votre phrase, votre interlocuteur la finira lui même en remplaçant le voilà par ce qu'il veut.

We'll start with "voilà". O-bli-ga-to-ry! You need to sprinkle your speech with (instances of) "voilà", without worrying much about their place or their use in the phrase, in fact the opposite. Example: "Je me suis dit que voilà ce serait bien de voilà faire des efforts pour voilà améliorer mon français". You also need to keep in mind that this marvelous word can replace anything, including the end of a phrase. An example heard this morning on France Inter: "En fait, le SMIC à 1600 €, je suis patron alors voilà". You've understood the principle, it's not necessary to end your phrase, your interlocutors will finish it for themselves, replacing the "voilà" with whatever they like.

Read the rest of this entry »

Permalink Comments (4)

Le Nouchi

December 17, 2023 @ 8:13 am· Filed by Mark Liberman under Language contact, Linguistic history

Elian Peltier, "How Africans Are Changing French — One Joke, Rap and Book at a Time", NYT 12/12/2023:

French, by most estimates the world’s fifth most spoken language, is changing — perhaps not in the gilded hallways of the institution in Paris that publishes its official dictionary, but on a rooftop in Abidjan, the largest city in Ivory Coast.

There one afternoon, a 19-year-old rapper who goes by the stage name “Marla” rehearsed her upcoming show, surrounded by friends and empty soda bottles. Her words were mostly French, but the Ivorian slang and English words that she mixed in made a new language.

To speak only French, “c’est zogo” — “it’s uncool,” said Marla, whose real name is Mariam Dosso, combining a French word with Ivorian slang. But playing with words and languages, she said, is “choco,” an abbreviation for chocolate meaning “sweet” or “stylish.”

A growing number of words and expressions from Africa are now infusing the French language, spurred by booming populations of young people in West and Central Africa.

Read the rest of this entry »

Permalink Comments (12)

Whorf invents generative phonology?

November 26, 2023 @ 6:15 pm· Filed by Mark Liberman under Linguistic history, Linguistics as a discipline

After stumbling on Benjamin Lee Whorf's affiliation with the Theosophical Society, I read two articles that he contributed to the MIT Technology Review in 1940: "Science and Linguistics" in the April issue, and "Linguistics as an Exact Science" in the December issue. Something in the second article surprised me.

Whorf gives a formal account of English syllable structure in terms of what he calls "pattern symbolics", presenting the term and a sketch of the associated formalism as if they were standard linguistic theory, like "Maxwell's equations" in physics. But I've never heard the phrase "pattern symbolics" before, and web search turns up no examples other than this article. And the formalism seems similarly idiosyncratic.

Read the rest of this entry »

Permalink Comments (12)

Radial dendrograms

July 26, 2023 @ 2:53 pm· Filed by Mark Liberman under Computational linguistics, Linguistic history

From Sarah Gao and Andrew Gao, "On the Origin of LLMs: An Evolutionary Tree and Graph for 15,821 Large Language Models", arxiv.org 7/19/2023:

That's not a vinyl — it's a "radial dendrogram" — showing the evolutionary tree of nearly 6,000 Large Language Models posted at Hugging Face. Zeroing in on one quadrant, so you can read the labels:

Read the rest of this entry »

Permalink Comments (2)

The Origin of Speeches? or just the collapse of Uruk?

June 23, 2023 @ 12:16 pm· Filed by Mark Liberman under Linguistic history

I've wondered for a long time why Biblical inerrantists have a big problem with biological evolution, which contradicts Chapter 1 of Genesis, but not so much with historical linguistics, which contradicts Chapter 11.

But in "Linguistic Confusion and the Tower of Babel", National Catholic Register 6/21/2023, Dave Armstrong argues that the usual interpretation of the Tower of Babel story is simply a mistake, due to a bad job of sense disambiguation:

[T]he Hebrew word for “earth” (eretz) can mean many things, including the entire world (e.g., Genesis 1:1, 15; 2:1, 4), but also things like the “land” or “ground” of countries, such as Egypt (eretz mitzrayim) and Canaan (eretz kana’an), the dry land (Genesis 1:10), and ground from which seeds grow (Genesis 1:12). The New American Standard Bible translates eretz: country or countries 59 times, ground 119 times, land 1638 times; compare to earth, 656 instances, and world (3).

Read the rest of this entry »

Permalink Comments (22)

"On Dialogic Speech"

February 6, 2023 @ 8:59 am· Filed by Mark Liberman under Linguistic history

Thanks to yesterday's post on "Linguistic Laws", I spent a few minutes looking into the life and works of the Russian linguist Lav Jakubinskiy (or Lev Yakubinsky, or whatever transliteration you prefer). I don't think I've heard of him before — but a couple of things (and not Jakubinskiy's Law) convinced me that I should have. The main thing was what I learned about his 1923 work О диалогической речи ("On Dialogic Speech"). I haven't been able to find any online scans of the Russian original, but there's a 1997 PMLA article by Michael Eskin that offers some translated fragments along with a "Translator's Introduction", and a 2016 book, also due to Eskin, that offers a larger translated sample.

Read the rest of this entry »

Permalink Comments (8)

Inaugural embedding depth

November 29, 2022 @ 8:30 am· Filed by Mark Liberman under Linguistic history, Style and register

Following up on yesterday's "Embedding depth" post, I've done the same analysis to the 62 Inaugural Addresses of U.S. presidents. (Actually, 61 of them — I had to omit John Adams' 1797 address, because its 35th sentence is 797 words long, which made the standard version of the Berkeley Neural Parser break down in tears…)

Read the rest of this entry »

Permalink Comments (8)

Embedding depth

November 28, 2022 @ 9:02 am· Filed by Mark Liberman under Linguistic history, Style and register

In "Trends" (3/27/2022) I compared the distributions of sentence lengths in Ernest Hemingway's A Moveable Feast and Ursula K. Le Guin's The Wave in the Mind. The background, and some of the conclusions, can be found in the slides for my SHEL12 presentation. Hemingway is known for his short and simple sentences — see e.g. "Homo Hemingwayensis", 1/9/2005, for some discussion — but as I showed, his average sentence length is actually a bit on the long side for his time. And his overall distribution of sentence lengths is essentially identical that found in (later) work by Ursula K. Le Guin, despite her hilarious discussion of an alleged difference in her 1992 essay "Introducing Myself":

Read the rest of this entry »

Permalink Comments (9)

The mysterious Yale Burma embarrassment

August 22, 2022 @ 8:19 pm· Filed by Mark Liberman under Linguistic history

Ben Zimmer just sent an update to a thread that started with a series of posts on the mobilization of American linguists during WWII:

"A tale of two societies", 3/1/2007
"Linguistics in 1940", 3/11/2007
"The Intensive Language Program", 3/20/2007
"The Chinese episode", 3/21/2007
"The Burmese Story", 3/22/2007

J. Milton Cowan's account of the Burmese Story (from American Linguistics in Peace and at War) ends with the following passage:

Things went well for about a month then one day Franklin Edgerton turned up in our office looking very embarrassed. He said that Alamon had not been entirely frank about his sources of income, and although he rather enjoyed the atmosphere at Yale and Spotty was happy and well-adjusted, he was losing money on the deal. It seems he had been running a little numbers racket in lower Manhattan. Our work was so far along and the problem of getting a replacement so great that we finally settled for doubling his salary. The unwritten history of Burmese linguistics is loaded. Alamon's successor, the other Burmese-sounding name on the Roster, gave rise to an embarrassment of the Yale linguists and the University which was as funny to outsiders as it was painful for those involved. But enough for Burmese.

Read the rest of this entry »

Permalink Comments (12)

Trends

March 27, 2022 @ 2:21 pm· Filed by Mark Liberman under Language and gender, Linguistic history, Style and register

About six weeks from now, I'm scheduled to give a (virtual) talk with the (provisional) title "Historical trends in English sentence length and syntactic complexity". The (provisional) abstract:

It's easy to perceive clear historical trends in the length of sentences and the depth of clausal embedding in published English text. And those perceptions can easily be verified quantitatively. Or can they? Perhaps the title should be "Historical trends in English punctuation practices", or "Historical trends in English conjunctions and discourse markers." The answer depends on several prior questions: What is a sentence? What is the boundary between syntactic structure and discourse structure? How is message structure encoded in speech (spontaneous or rehearsed) versus in text? This presentation will survey the issues, look at some data, and suggest some answers — or at least some fruitful directions for future work.

So I've started the "look at some data" part, so far mostly by extending some of the many relevant earlier LLOG Breakfast Experiment™ explorations, such as "Inaugural embedding", 9/9/2005, or "Real trends in word and sentence length", 10/31/2011, or "More Flesch-Kincaid grade-level nonsense", 10/23/2015.

In most cases, the extensions just provide more data to support the ideas in the earlier posts. But sometimes, further investigation turns up some twists.

Read the rest of this entry »

Permalink Comments (15)

« Previous Entries

Archive for Linguistic history

Monosyllabism

Philology vs. linguistics

Reflections on Alan Lomax and Bill Labov

The evolution of verbal interpolations

Le Nouchi

Whorf invents generative phonology?

Radial dendrograms

The Origin of Speeches? or just the collapse of Uruk?

"On Dialogic Speech"

Inaugural embedding depth

Embedding depth

The mysterious Yale Burma embarrassment

Trends

Follow us on Twitter

Archives [+/–]

Blogroll [+/–]

Meta