Why definiteness is decreasing, part 3
Ten days ago, I documented a striking 20th-century decrease in the frequency of the definite article the ("Decreasing definiteness", 1/8/2015) — from about 6.6% to about 5.4% in the Corpus of Historical American English; from about 6.4% to 5.2% in the Google Books ngram indices; and from about 9.3% to about 4.7% in U.S. presidents' State of the Union messages.
In two follow-up posts, I offered some additional ideas about this change:
In "Why definiteness is decreasing, part 1", I suggested that it might be connected to an overall decrease in the formality of published English, starting with the observation that in contemporary English, the frequency of the varies by a large factor between very formal material (6.42% in the "Academic" genre of the Corpus of American English) and conversational speech (2.47% in the Fisher corpus).
In "Why definiteness is decreasing, part 2", I noted that both in a collection of Facebook posts and in Fisher conversational speech transcripts, older people use the more often than younger people, and men use the more often than women; and I wondered whether this is a stable life-cycle and gender-identity difference, or the result of a change in progress. (Or both…)
Today, I want to discuss a third idea about the decreasing frequency of the, suggested to me by Jamie Pennebaker.
Read the rest of this entry »