Archive for Linguistic history

"On Dialogic Speech"

Thanks to yesterday's post on "Linguistic Laws", I spent a few minutes looking into the life and works of the Russian linguist Lav Jakubinskiy (or Lev Yakubinsky, or whatever transliteration you prefer). I don't think I've heard of him before — but a couple of things (and not Jakubinskiy's Law) convinced me that I should have. The main thing was what I learned about his 1923 work О диалогической речи ("On Dialogic Speech"). I haven't been able to find any online scans of the Russian original, but there's a 1997 PMLA article by Michael Eskin that offers some translated fragments along with a "Translator's Introduction", and a 2016 book, also due to Eskin, that offers a larger translated sample.

Read the rest of this entry »

Comments (8)

Inaugural embedding depth

Following up on yesterday's "Embedding depth" post, I've done the same analysis to the 62 Inaugural Addresses of U.S. presidents. (Actually, 61 of them — I had to omit John Adams' 1797 address, because its 35th sentence is 797 words long, which made the standard version of the Berkeley Neural Parser break down in tears…)

Read the rest of this entry »

Comments (8)

Embedding depth

In "Trends" (3/27/2022) I compared the distributions of sentence lengths in Ernest Hemingway's A Moveable Feast and Ursula K. Le Guin's The Wave in the Mind. The background, and some of the conclusions, can be found in the slides for my SHEL12 presentation. Hemingway is known for his short and simple sentences — see e.g. "Homo Hemingwayensis", 1/9/2005, for some discussion — but as I showed, his average sentence length is actually a bit on the long side for his time. And his overall distribution of sentence lengths is essentially identical that found in (later) work by Ursula K. Le Guin, despite her hilarious discussion of an alleged difference in her 1992 essay "Introducing Myself":

Read the rest of this entry »

Comments (9)

The mysterious Yale Burma embarrassment

Ben Zimmer just sent an update to a thread that started with a series of posts on the mobilization of American linguists during WWII:

"A tale of two societies", 3/1/2007
"Linguistics in 1940", 3/11/2007
"The Intensive Language Program", 3/20/2007
"The Chinese episode", 3/21/2007
"The Burmese Story", 3/22/2007

 J. Milton Cowan's account of the Burmese Story (from American Linguistics in Peace and at War) ends with the following passage:

Things went well for about a month then one day Franklin Edgerton turned up in our office looking very embarrassed. He said that Alamon had not been entirely frank about his sources of income, and although he rather enjoyed the atmosphere at Yale and Spotty was happy and well-adjusted, he was losing money on the deal. It seems he had been running a little numbers racket in lower Manhattan. Our work was so far along and the problem of getting a replacement so great that we finally settled for doubling his salary. The unwritten history of Burmese linguistics is loaded. Alamon's successor, the other Burmese-sounding name on the Roster, gave rise to an embarrassment of the Yale linguists and the University which was as funny to outsiders as it was painful for those involved. But enough for Burmese.

Read the rest of this entry »

Comments (12)


About six weeks from now, I'm scheduled to give a (virtual) talk with the (provisional) title "Historical trends in English sentence length and syntactic complexity". The (provisional) abstract:

It's easy to perceive clear historical trends in the length of sentences and the depth of clausal embedding in published English text. And those perceptions can easily be verified quantitatively. Or can they? Perhaps the title should be "Historical trends in English punctuation practices", or "Historical trends in English conjunctions and discourse markers." The answer depends on several prior questions: What is a sentence? What is the boundary between syntactic structure and discourse structure? How is message structure encoded in speech (spontaneous or rehearsed) versus in text? This presentation will survey the issues, look at some data, and suggest some answers — or at least some fruitful directions for future work.

So I've started the "look at some data" part, so far mostly by extending some of the many relevant earlier LLOG Breakfast Experiment™ explorations, such as "Inaugural embedding", 9/9/2005, or  "Real trends in word and sentence length", 10/31/2011, or "More Flesch-Kincaid grade-level nonsense", 10/23/2015. 

In most cases, the extensions just provide more data to support the ideas in the earlier posts. But sometimes, further investigation turns up some twists.

Read the rest of this entry »

Comments (15)

Henry Lee Smith Jr.

Amazingly, it appears that Henry Lee Smith Jr. has no Wikipedia page, despite a notable career in science, public service, and the media. According to his 1972 NYT obituary:

In 1940, when Dr. Smith was 27 and a member of the Department of English at Brown University, he came to public attention on the radio program, “Where Are You From?” over WOR. He selected people from a studio audience, listened to them talk and told them where they came from. He was right in four out of five tries.

For more about that radio program, see "Dr. Smith", The New Yorker 11/22/1940 (page image here), or "Radio: Where Are You From?", Time Magazine 5/6/1940.

According to a "Flashback" by the UB Reporter ("55 Years Ago: Henry Lee Smith, Linguist", 10/27/2011):

After receiving his PhD from Princeton and lecturing at Barnard, Columbia, and Brown, Smith headed the Language Section, Information and Education Division of the U.S. Army from 1942 to 1946.

Prior to the war, there were no foreign language materials for the bulk of the military and civilian personnel, and Smith, along with linguists he recruited, produced language guides, phrase books and military and general-purpose dictionaries in many different languages. Under Smith’s direction, the linguists also developed what came to be known as the Army method of language instruction—later adopted by colleges and universities—emphasizing the use of phonograph records on which a native speaker recited the foreign words and allowed a pause for repetition by the student.

Smith founded the State Department’s School of Language and Linguistics in 1946, and served as the school’s director prior to coming to UB.

For more about the role of linguists in (what became) the Defense Language Institute, see "A tale of two societies" (3/1/2007) and "Linguistics in 1940" (3/11/2007).

My personal exposure to Smith's work was through the influential 1951 monograph that we used to call "Trager Smith"  — I remember being struck by how many of the examples in Chomsky & Halle's 1968 The Sound Pattern of English were reproduced exactly from that source. (A link to a .pdf, courtesy of the Internet Archive, is here.)

Read the rest of this entry »

Comments (10)


Making coffee this morning made me think about brewing — not the process, but the English verb brew and its semantic evolution. In particular, it made me wonder again about nativist versions of semantic atomism, which hold that word meanings are (perhaps structured) collections of innate atomic features. Versions of these ideas go back thousands of years, but their most prominent recent exponent was Jerry Fodor.

The Internet Encyclopedia of Philosophy's article puts it this way:

Fodor was also a staunch defender of nativism about the structure and contents of the human mind, arguing against a variety of empiricist theories and famously arguing that all lexical concepts are innate. Fodor vigorously argued against all versions of conceptual role semantics in philosophy and psychology, and articulated an alternative view he calls “informational atomism,” according to which lexical concepts are unstructured “atoms” that have their content in virtue of standing in certain external, “informational” relations to entities in the environment.

Read the rest of this entry »

Comments (35)

Interfaces and Interactions

Going through a box of papers from years ago, I found one of Sally Thomason's famous doodles:

I've set it aside to be framed and hung, facing the Haida frog that was a gift a decade earlier from Nicola Bessel.

Read the rest of this entry »

Comments (14)

LLOG image search

Where is this picture from?

I tried Google Image Search without useful results.

Read the rest of this entry »

Comments (25)


In our 1992 chapter "The stress and structure of modified noun phrases in English" (in Sag & Szabolcsi, Lexical Matters), Richard Sproat and I noted that the normal order in English puts a nominal modifier before its head, but "there are some cases where it appears to be necessary to assume that the head of the construction is on the left and the modifier is on the right". We gave the examples

vitamin C, route 1, brand X, exit 14, peach Melba, steak diane, Cafe Beethoven, Club Med

My email address and cell phone number have recently found their way onto some political contact lists. And as a result, I get dozens of messages a day from Team X, where X is some politician's name: Team Trump, Team Joe, Team Warren, Team Collins, …

This led me to wonder about the history of the Team NAME construction. I'm not sure that I've got it right, so please explain in the comments what I've missed or misunderstood.

Read the rest of this entry »

Comments (54)

IRCS Prosody Workshop 1992: Undoing bit rot

Recently, Antônio Simões wrote to Cynthia McLemore to ask about a 28-year-old proceedings:

I used to find on the internet the Proceedings from 1992 that you edited with Mark Liberman. I tried to find them, but they are not on the internet anymore. Do you still have that volume in pdf? Or is it accessible somewhere on the internet? This is the volume:

McLemore, Cynthia, and Mark Liberman, eds. 1992. Proceedings of the IRCS Workshop on Prosody in Natural Speech. IRCS Report No. 92-37.

"IRCS" stands for "Institute for Research in Cognitive Science", an NSF research center founded in 1990 by Lila Gleitman and Aravind Joshi. IRCS  died in 2016 after a lingering siege of academic politics, and its website seems to have been purged last year. Penn's library has some IRCS technical reports in its repository, but not the one that Antônio is looking for. Many others are clearly missing, along with event recordings and so on — I'll see whether there are backups somewhere from which things can be restored.

Meanwhile, Cindie found a paper copy of the requested proceedings, and this page provides a table of contents with links and abstracts for scanned versions of the 26 papers it contains. Most of them are still interesting and relevant today!


Comments (1)

Further v. Farther

Apparently, further and farther come from the same source, namely the verb that we retain as further meaning "to promote". The different spellings were originally due to the general diversity of English orthography in earlier times. And the spelling was apparently not regularized because the word(s) took over as the comparative form of far, which used to be farrer. But because of the similarity of meanings, both forms seem always to have been used in the full range of adjectival and adverbial meanings, though with some probabilistic influence of far's spatial sense on the vowel.

From the 1895 OED entry for farther (not revised since then):

Middle English ferþer (whence by normal phonetic development farther ) is in origin a mere variant of further n., due probably to the analogy of the verb ferþren < Old English fyrðrian to further v. The primary sense of further, farther is ‘more forward, more onward’; but this sense is practically coincident with that of the comparative of far, where the latter word refers to real or attributed motion in some particular direction. Hence further, farther came to be used as the comparative of far; first in the special application just mentioned, and ultimately in all senses, displacing the regular comparative farrer. In standard English the form farther is usually preferred where the word is intended to be the comparative of far, while further is used where the notion of far is altogether absent; there is a large intermediate class of instances in which the choice between the two forms is arbitrary.

Read the rest of this entry »

Comments (51)

Structure of Language and its Mathematical Aspects

I recently had reasons to consult a book published in 1961, "Structure of Language and its Mathematical Aspects", Proceedings of Symposia in Applied Mathematics, Volume XII, edited by Roman Jakobson.

The table of contents:

W. V. Quine – Logic as a source of syntactical insights
Noam Chomsky – On the notion “Rule of Grammar”
Hilary Putnam – Some issues in the theory of grammar
Henry Hiż – Congrammaticality, batteries of transformations and grammatical categories
Nelson Goodman – Graphs for linguistics
Haskell B. Curry – Some logical aspects of grammatical structure
Yuen Ren Chao – Graphic and phonetic aspects of linguistic and mathematical symbols
Murray Eden – On the formalization of handwriting
Morris Halle – On the role of simplicity in linguistic descriptions
Robert Abernathy – The problem of linguistic equivalence
Hans. G. Herzberger – The joints of English
Anthony G. Oettinger – Automatic syntactic analysis and the pushdown store
Victor H. Yngve – The depth hypothesis
Gorden E. Peterson and Frank Harary – Foundations in phonemic theory
Joachim Lambek – On the calculus of syntactic types
H. A. Gleason, Jr. – Genetic relationship among languages
Benoit Mandelbrot – On the theory of word frequencies and on related Markovian models of discourse
Charles F. Hockett – Grammar for the hearer
Rulon Wells – A measure of subjective information
Roman Jakobson – Linguistics and communication theory

Read the rest of this entry »

Comments (23)