Up in ur internets, shortening all the words

« previous post | next post »

Lucy Jones, "Ralph Fiennes blames Twitter for 'eroding' language", The Telegraph 10/28/2011:

Speaking at the BFI London Film Festival awards in Old Street, London, the actor said that modern language "is being eroded" and blamed "a world of truncated sentences, soundbites and Twitter."

"Our expressiveness and our ease with some words is being diluted so that the sentence with more than one clause is a problem for us, and the word of more than two syllables is a problem for us," he said.

This sort of thing always makes me suspect that it's really our veracity and our ease with facts that is being diluted.

But  the Telegraph's reporter found an expert to join Mr. Fiennes in Viewing  With Alarm:

Fiennes, who does not use Twitter, is not alone in his theory. JP Davidson, the author of Planet Word and a linguistic expert, talked this week about longer words dying out in favour of shortened text message-style terms.

He said: “You only have to look on Twitter to see evidence of the fact that a lot of English words that are used say in Shakespeare’s plays or PG Wodehouse novels — both of them avid inventors of new words — are so little used that people don’t even know what they mean now.

“This could be viewed as regrettable, as there are some great descriptive words that are being lost and these words would make our everyday language much more colourful and fun if we were to use them.

“But it’s only natural that with people trying to fit as much information in 140 characters that words are getting shortened and are even becoming redundant as a result.”

Since I'm the old-fashioned sort of person who clings to the belief that words, whatever their length, ought to mean something, I thought I'd check whether it's really true that "words are getting shortened" by the constraints of the Twitterverse.

So I grabbed the text of Hamlet, the text of a number of P.G. Wodehouse stories (Leave it to Jeeves, Jeeves and the Unbidden Guest, Jeeves and the Hard-Boiled Egg, Absent Treatment, Helping Freddie, Rallying Round Old George, Doing Clarence a Bit of Good, and The Aunt and the Sluggard), and the 100 most recent tweets from the Daily Pennsylvanian, Penn's independent student newspaper. I figured that the DP ought to count as a good representative of the Kids Today who are responsible for the alleged word-shortening trend.

I wrote a little program to adjust these texts in appropriate ways (removing the character attributions and stage directions from Hamlet, removing the Gutenberg boilerplate from P.G. Wodehouse, removing the @'s and #'s and URLs from the DP tweets, etc.), and then to count the letters in each word.

The result? The mean word length in Hamlet (in modern spelling) was 3.99 characters; in P. G. Wodehouse's Jeeves stories, the mean word length was 4.05 characters; in the DP's tweets, the mean word length was 4.80 characters.

Here's a boxplot showing the median (horizontal line), the 25th and 75th percentiles (top and bottom of the box), and the "range" (defined by the R boxplot program's default as "the most extreme data point which is no more than 1.5 times the interquartile range from the box"):

The business about word lengths is the easiest to check of the article's assertions about linguistic decay, and it's  false. Trivially and transparently so, as such plaints usually are.

I don't have the time this evening to train up language models to check the claim that "words … are even becoming redundant as a result", but I'm willing to wager a substantial sum that the DP's tweets have significantly higher average per-word entropy than the novels of P. G. Wodehouse.

This would be logical, in fact, since "people trying to fit as much information in 140 characters" ought to lead to higher information density, and therefore less per-word redundancy. Dr. Davidson's argument on this point is so obviously  inverted, from a logical point of view, that I'm inclined to wonder whether The Telegraph's reporter might have misquoted him.

Similarly, I'm confident that the distribution of syllable-counts in the Twitterverse tends to be higher than in Wodehouse or Shakespeare, since the constraint on message lengths favors semi-telegraphic modes of expression.

As for the notion that "the sentence with more than one clause is a problem for us", it's certainly true that average depth of embedding has trended downward since the 18th century (though not monotonically so). But Shakespeare and Wodehouse are not especially good examples of hypotaxis — I suspect that investigation would show that the DP's articles exhibit a somewhat higher average depth of clausal embedding than either of them.

The Telegraph article also asserts that Mr. Fiennes's full name is Ralph Nathaniel Twisleton-Wykeham-Fiennes, which I consider implausible but too good to check. If this is true, in any case, it would certainly give him a personal interest in the matter at hand. Dr. Davidson has no such excuse, though a decade spent working for the BBC might be taken to constitute extenuating circumstances.

Update — for those who like to worry about how journalism is being eroded, some uptake: "The short and tweet of it: literacy at risk, Ralph Fiennes says", Montreal Gazette; "Twitter killing English, says actor", Melbourne Herald Sun; "Actor Ralph Fiennes blames Twitter for dumbing down the English language", Daily Mail; "Ralph Fiennes Blames Twitter for ‘Eroding’ English", International Business Times; "Ralph Fiennes: ‘Twitter is eroding our language'", Houston Chronicle; "Ralph Fiennes: We're in a world of dumbed-down English thanks to Twitter", London Evening Standard; "Est-ce que Twitter tue notre langue?", La Presse; "Verfall: Verstümmelte Sätze – Ralph Fiennes beklagt heutige Sprache", Berliner Morgenpost; "Ralph Fiennes culpa a Twitter de destrozar el inglés", El Pais; …

A contrarian view from Gawker ("Ralph Fiennes Says Twitter is Destroying English … #whatever #shutupvoldemort"):

You know what will never be flattened or truncated? An actor's ability to plug his next movie, which in this case would be a contemporary version of Shakespeare's Coriolanus, which Fiennes stars in and directed and is due in theaters this January. Don't get Ralph started on his sententious exegesis on establishmentarian perturbation! He'll gnaw your ear off for hours.



84 Comments

  1. Valentine said,

    October 28, 2011 @ 6:38 pm

    Tee hee.

  2. George Grady said,

    October 28, 2011 @ 6:42 pm

    I may be misinterpreting the box plots, but Hamlet has plenty of words longer than 8 letters, and Wodehouse used plenty of words longer than 9 letters.

    [(myl) You are, indeed, misinterpreting the box plot: "the whiskers extend to the most extreme data point which is no more than range times the interquartile range from the box", and the default value for range is 1.5. If we set range=0, so that the whiskers extend to the data extremes, we get this plot:

    (The longest word in the Jeeves stories is "Melonsquashville", used to illustrate Jeeves' omniscience by reference to a ticket seller who can instantly answer questions like ""When's the next train for Melonsquashville, Tennessee?". The longest word in the DP's tweets is "acknowledgment", used in the tweet "LETTER: A College sophomore wonders why #SteveJobs deserves so much acknowledgement in death". The longest word in Hamlet is "transformation", in Claudius's line "Something have you heard / Of Hamlet's transformation".)

    From the perspective of data visualization, it would probably have been better to use a bean plot or a violin plot... But I can add that the 95th percentile for both Hamlet and the Jeeves stories is 8 letters, whereas for the DP's tweets it's 10 letters.

    Anyhow, your misunderstanding was a natural one that I should have avoided by specifying in the OP how the whiskers were calculated -- which I've now done.]

  3. Sili said,

    October 28, 2011 @ 6:43 pm

    “But it’s only natural that with people trying to fit as much information in 140 characters that words are getting shortened and are even becoming redundant as a result.”

    Dr. Davidson's argument on this point is so obviously inverted, from a logical point of view, that I'm inclined to wonder whether the Telegraph's reporter might have misquoted him.

    I took it to mean that words we consider necessary in 'natural language' are becoming redundant in twitter language and are therefore left out of tweets (Omit Needless Words!) to the annoyance of people who need everything spelt out (Avoid Ambiguity!).

  4. teucer said,

    October 28, 2011 @ 6:57 pm

    Seems to me that the tweets should be expected to have longer words on average, since they are more apt to leave out short words such as "the." But you might also expect the content words to be shorter, giving less of a tail on that end of the graph, which is not actually seen in the data.

  5. rootlesscosmo said,

    October 28, 2011 @ 7:05 pm

    Wikipedia confirms Fiennes' full name, so it's probably pure coincidence that one of Bertie Wooster's friends was called Pongo Twistleton-Twistleton.

  6. Steve Kass said,

    October 28, 2011 @ 7:14 pm

    Teucer’s observation about words like “the” is a good one, I think For the record, I read teucer’s comment only after tweeting about this very blog post. Aside from including the LL link, I wrote, “Box plot of the day: Daily Pennsylvanian tweets contain longer words, on average, than Shakespeare, Wodehouse.” I think “The Daily Pennsylvanian’s” would have fit, though.

  7. Ø said,

    October 28, 2011 @ 7:16 pm

    Nice and old-fashioned of you to write "plaints" instead of "complaints". We don't shorten words that way so much any more.

  8. Brett said,

    October 28, 2011 @ 7:21 pm

    I wonder if "redundant" is being used in a peculiarly British way here. I know that it has a meaning in Britain this is basically "fired" (or "sacked"), although I assume there are some subtleties related to whether the vacated position will subsequently be filled by a new hire. Perhaps this use of "redundant" is influence by a figurative extension of that meaning?

  9. Lee Sullivan said,

    October 28, 2011 @ 7:42 pm

    "Wikipedia confirms Fiennes' full name, so it's probably pure coincidence that one of Bertie Wooster's friends was called Pongo Twistleton-Twistleton."

    Or that once of his erstwhile fiancees was Bobbie Wickham. Rather a good bit of coincidence, there.

  10. J Lee said,

    October 28, 2011 @ 8:14 pm

    what is ironic is his whine that we have lost some words used in shakespeare's time; obviously a great deal of extremely frequent modern words found in shakespeare are nothing but false friends.

    personally i hoped mass adoption of texting or twitter would rid us of 'would of gone' etc. or at the least result in more latinate verbs to replace phrasal verbs that require more keystrokes

  11. Ø said,

    October 28, 2011 @ 8:17 pm

    I don't believe that Pongo Twistleton has the extra "Twistleton". Not like, for example, Cyril Bassington-Bassington.

  12. seriously said,

    October 28, 2011 @ 9:58 pm

    In the same article, the erudite Mr. Fiennes used "begs the question" in the sense of "raises the question." Yes, yes, I know, that's how everyone uses it now. its descriptively coming to mean that, and it's no use having that as my pet peeve, but certainly it isn't something that a person complaining about erosion of standards, dilution of language, etc. would be expected to do.

    [(myl) Indeed.]

    Who's law is it that states that a letter to the editor complaining about grammar, usage, or syntax will invariably contain at least one error of grammar, usage, or syntax?

    [(myl) This has been called "Hartman's Law of Prescriptive Retaliation", and also the Hartman/McKean/Skitt Law, and also Muphry's Law and Murphry's Law, both intentionally different from Murphy's Law.]

  13. Boudica said,

    October 28, 2011 @ 11:06 pm

    Were there similar (incorrect) complaints with the rise of telegrams?

  14. Wu Yong said,

    October 29, 2011 @ 12:39 am

    Obviously, this story will be tweeted by all and sundry — as it should.

  15. Dan Hemmens said,

    October 29, 2011 @ 4:54 am

    What I don't understand is why so many people equate brevity with stupidity.

    I mean it's been a while since I've had to write a formal essay, but I always understood that expressing yourself succinctly was as important a skill as using lots of long words.

  16. Stefania Spina said,

    October 29, 2011 @ 5:17 am

    I did the same for Italian, and the results are similar: http://sspina.blogspot.com/2011/10/twitter-eco-e-manzoni.html
    (it's in Italian, sorry).

  17. Richard Gadsden said,

    October 29, 2011 @ 5:55 am

    That really is Ralph Fiennes' full name; he's from a notable English gentry family. His father's cousin is Sir Ranulph Fiennes, the explorer

  18. markg said,

    October 29, 2011 @ 6:03 am

    Though I don't tweet very often, I think the formal, haiku-like constraints are interesting. Surely the problem, if there is one, is not the average length of words in the Twitterverse, but the distortion of language and speech/conversation rhythms into impersonal sound bytes (though 'txt'-speak may be more of a culprit here). But I find it difficult to be alarmed about such things; any living language is bound to be relentlessly protean.

  19. mollymooly said,

    October 29, 2011 @ 6:17 am

    My interpretation of 'redundant' in

    words are getting shortened and are even becoming redundant as a result

    is similar to Brett's. I am confident what JP Davidson really meant was

    words are getting shortened and are even being omitted entirely as a result

    A rebuttal should not be predicated on the assumption that a peever using a hard word like "redundant" or "patois" actually knows what it means. This wastes my LL subscription dollars.

  20. markg said,

    October 29, 2011 @ 6:21 am

    Sorry to go off-topic, but I want to amend that lazy last statement, as it suddenly occurred to me that of course there may well be living languages that are not 'relentlessly protean', and some, such as the Amazonian Pirahã, may hardly change at all.

  21. Jarek Weckwerth said,

    October 29, 2011 @ 6:22 am

    Somewhat tangential: The horizontal lines in the boxplot are at the same level; you say they show the median, while the numerical values you give are the means. Can you confirm? I'm asking because if they show the means, they certainly don't show a mean of 4.80 for the twitters, and a difference of 0.75 would be ginormous. What ARE the medians then? Sorry to be a pest…

    [(myl) No sweat -- I should have given the medians, which are 4 characters in all cases.]

  22. Adam said,

    October 29, 2011 @ 7:08 am

    I agree with Ø about "plaints". Let's revive it. Modern kids with unnecessarily long words … get off my lawn!

  23. Steven Capsuto said,

    October 29, 2011 @ 7:39 am

    It's a shame that "serious" media such as the Telegraph give credence to actors' statements on linguistics, politics, etc. No one would think to publish a linguist's or politician's views on the finer points of Method acting.

    In any case, your experiment is fascinating but are you sure that tweets created in an Ivy League environment are a representative sample?

    [(myl) There's a listing of "Directors, Screenwriters and Actors on Twitter" here. The first one on the list is Judd Apatow. For his most recent 15 tweets, the mean word length in characters is 4.65.

    As for the terms of comparison in general, shouldn't we be comparing Elizabethan graffiti or WWI letters home? For the first few letters from the last (1917) collection, the mean word length is 3.69 characters... ]

  24. Cathy Relf said,

    October 29, 2011 @ 7:57 am

    Oh Mr Fiennes, Twitter is the single biggest language resource I have yet discovered – full of word nerds, linguists, pedants and ponderers – and, yes, some naughty txtspeakers too.

    PS. Don't worry, this hasn't stopped me fancying you.

  25. Dan Hemmens said,

    October 29, 2011 @ 7:59 am

    In any case, your experiment is fascinating but are you sure that tweets created in an Ivy League environment are a representative sample?

    It's possible that Mr Liberman was deliberately and cheekily skewing the figures, but you can make a very strong case that it was an attempt to pick a balanced sample. While neither Shakespeare nor Wodehouse had the benefit of a university education, they were both professional writers at the top of their fields, and if we are attempting to trace the effect of *twitter alone* on people's use of language, it is important that we compare like with like.

    Obviously a lot of tweets *are* written by people with poor literacy skills, but this does not mean that tweeting *causes* poor literacy, and I am *absolutely damned certain* that the average sixteen year old in 21st century London is *far* more literate than the average sixteen year old in Shakespeare's England.

  26. Andy Averill said,

    October 29, 2011 @ 8:08 am

    I wonder though whether some of the difference is accounted for by the proportion of proper nouns (especially names) in the text? I would expect that to be higher in tweets than in any work of literature. I imagine they tend to be longer than the most common English words.

    [(myl) Perhaps; but at this point I believe that you have given the matter several orders of magnitude more thought than Ralph Fiennes, J.P. Davidson, and Lucy Jones combined.]

  27. ENKI-][ said,

    October 29, 2011 @ 9:46 am

    @teucer: Constraints like those put forth by twitter tend to influence people toward leaving out low-information words, and low-information words tend to be short. However, I don't think low information words compose a significant percentage of unique short words, and I suspect that the literary forms tend toward avoiding redundant use of low-information words anyhow. However, it's easy enough to run the experiment over again with text free of articles, or text free of the top 1% most common words.

    I have a sneaking suspicion that using tweets from a 'literate' source like a university newspaper would not be convincing to a supporter of the abovementioned argument, and were this experiment to be run over again I'd recommend using the public feed as a source, rather than the DP.

  28. John said,

    October 29, 2011 @ 10:50 am

    And of course the Bard's most famous quotation:

    To be or not to be, that is the question.

    This has a mean and median word-length of 2 and, isn't exactly content-free.

  29. Josh Bowles said,

    October 29, 2011 @ 10:50 am

    Twitter really is one of those unfortunate kinds of observational sets of data that varies so widely given a dependent domain that no argument will convince anyone of what they are not already convinced of. Given twitter is social media, the tweets you tend to see are MORE THAN LIKELY clustered around socio-cultural-economic chunks. Extreme claims are better bounded by counter-extreme evidence. And don't forget about code-switching. Just 'cause you go to a good school doesn't mean you can't switch to Twitter-speak. Also, if there's an extreme claim about short words (and we assume "non-literates" use shorter words than "literates", who better to test the extreme claim against than those who you think might have a "larger" vocab… testing for data sets for which you expect to support the extreme claim is trivial). If college students confirm the extreme claim, then the likelihood that not-college-educated will confirm the claim is even higher.

    But really, it seems that many people are convinced that reading Shakespeare or Melville makes them a "better" person because they are reading all the big and interesting words. And the layperson on the facebooks and twitters will continue on in normal ignorance of Dante and Shakespeare (who, by the way, were part of pop culture in their day). Aheesh.

    Stay tuned for the next installment:
    SIRI #{insert_any_nlp_voice_response_software} IS DUMBING DOWN CONVERSATIONAL VOCABULARY AND RUINING OUR ABILITY TO CONVERSE INTELLIGENTLY

  30. eric said,

    October 29, 2011 @ 11:48 am

    Comparing creative writing and twitter (or email) may be comparing apples and oranges. I myself (at age 44) am a product of what we call the "business writing" method. Beginning in college, and continuing today in my workplace, I am encouraged to write sentences with fewer clauses. This aids easy readability and scanning, and communicates efficiently. Could it be that we really are writing shorter sentences, but that this is not the results of twitter? Rather, it may be the result of a cultural trend that values efficiency and clear communication more highly than creativity?

    [(myl) Average sentence lengths have been getting shorter, in many varieties of English prose, for several hundred years -- see e.g. "Complexity", 9/7/2005. To the extent that it's not purely a matter of fashion, this is probably due to changes in the role of classical languages in education, changes in the audience due to increased literacy rates, and (more recently) with a general stylistic trend away from ornamentation. There is no comparable trend, as far as I'm aware, towards shortening of word lengths.

    But none of this has anything whatever to do with Twitter. As far as I can tell, whenever someone tells you that Twitter has caused this or that change in the English language, you can safely bet the rent (or mortgage) money that they're talking nonsense.]

  31. J.W. Brewer said,

    October 29, 2011 @ 12:05 pm

    Since telegrams were apparently typically paid for on a by-the-word basis rather than by-the-character basis, you might expect even more than with tweets to have longer average words with more short words that would be obligatory in ordinary prose omitted. Eg., here's Orville Wright's famous telegram to his father from December 1903: "Success four flights Thursday morning all against twenty one mile wind started from level with engine power alone average speed through air thirty one miles longest flight fifty nine seconds inform press home Christmas."

  32. Dominik Lukes said,

    October 29, 2011 @ 1:11 pm

    I always associated PG Woodhouse with a rather high level of hypotaxis -particularly complex chains of parentheticals. But maybe my impression was wrong. Wonder what a thorough investigation would reveal.

  33. Theophylact said,

    October 29, 2011 @ 1:28 pm

    Actually, John, the mean word length is 3 (ten words; thirty letters).

  34. Josh Bowles said,

    October 29, 2011 @ 1:39 pm

    AAH. This stuff drives me crazy. A simple Bayesian model:

    P(t | L,e) = P(L | t, e) * P(t | e) / P(L | e).

    where t = twitter-cause, L = language-degradation-effect (shorter words), and e = evidence.

    In order to prove that twitter causes some degradation in language you have to prove that there is in fact degradation in language. This means you need to compare common language today and common language from Elizabethan England, then compare Twitterverse with common language, and Shakespeare with Elizabethan common language. There is no corpus of spoken Elizabethan English, all we have is written text. Even if you had a spoken Elizabethan corpus to compare against Elizabethan written text, Twitter and social media are entirely new genre's of language. It is not comparable. Shakespeare's constraints were not bytecode related, but they were dictated by clarity in pronunciation. Twitter and Shakespeare are not comparable, but neither is Twitter with most historical textual material! This means Twitter's impact is unpredictable. We could build models for prediction but we won't know if they are right. I think it's this intuitive sense of the unpredictability of Twitter's (et.al.) effect on culture that creates fear, panic, and unfounded claims.

    Lastly, say we had good historical data (like scientists may have 100 years from now). We would need some way to model average word length as a mean over all discourse genres for a really good sample set (i.e., diverse and large enough) for currently spoken and written language. We can't do this. So we use what is accessible and extrapolate from there: internet. But the internet is a special phenomenon (about 12 years old!!) with a crazy number of socio-cultural linguistic sub-genres. And this is where trained linguists come in handy, they already grapple with hard questions like linguistic relativity and the quality of information contained in phonetic and written characters. Linguists deal with hard problems like genres, and how language and culture relate and where the ability to use language comes from. For most people, the language-is-thought-is-culture fallacy goes un-investigated (or it's oversimplified) because people are running with their intuitions and circumstantial evidence. Anyone reading this blog is well aware of that.

    I have given up trying to explain this stuff to people at work. They see patterns in noise and no amount data removes the pattern they see; the problem is classifying a pattern as useful or not, and if the pattern reinforces some epistemic belief they have about reality and the world, then you can't convince them that they're personal belief is wrong. You just sum up the data and leave it at the doorstep of Karl Popper's house because you cannot falsify someone's belief in something, only their justification of it. Whether someone wants their beliefs to be justified is a matter of personal conviction: "The 'principle of causality' is the assertion that any event whatsoever can be causally explained — that it can be deductively [or probabilistically] predicted"

  35. jan said,

    October 29, 2011 @ 4:12 pm

    I wonder why Ralph Fiennes' first name is pronounced the way it is. It sounds like Ray or Rafe? Is that the usual pronunciation in Britain?

  36. Christopher said,

    October 29, 2011 @ 5:19 pm

    Man, we've long since passed the point where people have no idea what the hell Shakespeare was talking about.

    I would need to see some hard evidence before I'd believe that the average Baby Boomer or Gen Xer knew what a petard or a bodkin was.

  37. Robert said,

    October 29, 2011 @ 5:36 pm

    What does "interquartile" mean?

    [(myl) The quartiles are the 25th and 75th percentiles. The interquartile range is the difference between the corresponding values.]

  38. zellieh said,

    October 29, 2011 @ 5:45 pm

    @jan,

    The odd pronunciation of Ralph Fiennes name is an upper-class British thing; due to the accent, the "a" sound is more of an "ay", and the "l" is almost unspoken. Ordinary Brits pronounce Ralph "Ralf", where the "Ral" rhymes with the "Cal" in "California", just as many Americans would.

  39. Giacomo Ponzetto said,

    October 29, 2011 @ 5:52 pm

    Echoing eric's comment and Mark Liberman's reply, I am as puzzled by Ralph Fiennes's complaint about declining word and sentence length as by his attribution of responsibility to Twitter.

    As a native speaker of a Romance language, for many years I have been forcefully admonished by sundry teachers that English demands short words and short sentences, and that refusing to comply is at best a Latinism, and more likely (in my case) a clumsy Italianism.

    My high-school teacher used to tell us in so many words: "English prefers parataxis to hypotaxis." Going back to Victorian manuals—or are they really Edwardian?—the Fowlers' fourth rule is: "Prefer the short word to the long." (Fowler, H. W., and F. G. Fowler. 1906. The King's English. Oxford: Oxford University Press. P. 11).

    Both tenets are certainly impressed upon graduate students in economics in the U.S.—and by imitation the world over. A widely read usage guide (Cochrane, John H. 2005. "Writing Tips for Ph.D. Students." Mimeo, University of Chicago) instructs writers to "Use simple short words not big fancy words" (p. 7) and "Keep down the number of clauses in your sentences" (p. 8).

    Considering how difficult and at times painful it has proven for me to strive to adhere to such standards, I could not but appreciate if Mr. Fiennes were to lead a crusade in favor of Latinate polysyllabic words and subordinate-rich complex sentences. But shouldn't he address his complaints to the usage authorities preaching twofold brevity rather than to Twitter-using average writers?

  40. J Lee said,

    October 29, 2011 @ 5:58 pm

    to hell with those examples. the meanings of so many words in shakespeare's works have changed yet people who manifestly do not and indeed can not without training understand what shakespeare meant can delude themselves to the point of publically making claims like the above writer's.

    i am now more skeptical of the entire notion of a literary canon and i suspect it actually deters reading by convincing average people that if they resolve to read more it should be some 'classic' novel of ungodly length, or a shakespeare play they then have to pretend to understand.

  41. Rod Johnson said,

    October 29, 2011 @ 9:48 pm

    @zellieh: I don't think it's "due to the accent"–no one pronounces "Alf" (to pick the closest rhyme I can think of) "Afe" or has a "ay" where everyone else has "a." It's just an older pronunciation of Ralph. Ralph Vaughan Williams' name is pronounced the same way. I believe, in fact, that that [rælf] is a spelling pronunciation.

  42. Rod Johnson said,

    October 29, 2011 @ 9:53 pm

    Here's a little disquisition on the "Rafe" pronunciation of Ralph, and why it is that way: http://users.ox.ac.uk/~mert1230/Pronunciation_of_Ralph.htm He invokes compensatory lengthening with the loss of /l/, followed by the Great Vowel Shift, but, as he notes, that doesn't explain why "half" isn't pronounced "hafe."

  43. Rod Johnson said,

    October 29, 2011 @ 9:56 pm

    One last bit, for those of you who are still unsure how to pronounce Ralph Fiennes name, I commend this valuable Youtube video to you.

  44. Janice Byer said,

    October 30, 2011 @ 2:01 am

    "What I don't understand is why so many people equate brevity with stupidity."

    My guess is they're confounding fluency with articulation, forgetting that algebraically speaking the truth is simple, with straight talk being the shortest distant to one's point. It's been said the power of language is best gauged not by what it can say to us, but by what you doesn't need to say.

  45. Janice Byer said,

    October 30, 2011 @ 2:18 am

    Oops, Muphry's law means I should've written "…they're confounding fluent with articulate" not what I wrote.

  46. Spectre-7 said,

    October 30, 2011 @ 2:49 am

    @Janice Byer

    I started reading your second post and thought, "Oh good, she caught it."

    …and then you went and fixed something I'd passed over completely. :)

    It's been said the power of language is best gauged not by what it can say to us, but by what you doesn't need to say.

    I wasn't sure whether you were after what you don't need to say or what it doesn't need to say, and I imagine the error arose from editing one into the other. Lovely thought either way.

  47. Janice Byer said,

    October 30, 2011 @ 3:04 am

    AND in my last line, I should've written "…it doesn't…" NOT "…you doesn't…".

    My apologies. I guess I'm too old to stay up commenting like that XKCD comic stick figure.

  48. Colin said,

    October 30, 2011 @ 3:41 am

    (1) Have you been to a Shakespeare production lately, Christopher? My experience is that good acting can still put the plays across very well indeed. Without denying the impact of 400 years, it's possible that even in the original productions not everyone in the audience was clear on petards. Reading the plays is another matter.

    (2) Yeah, if anything teh 'net has enriched written language and given more people a feel for the difficulties of written communication. YMMV, IMHO etc. are kinda subtle in terms of the way they negotiate an exchange, and people are learning multiple written codes. The extraordinary fact is that in a place like the US, half the population is writing frequently, whether in e-mail, facebook, twitter, texting. That's gotta mean way more everyday writing by more people than at any point in the past.

    (3) Thanks to Josh Bowles for putting it so well. Part of it has to be an individual's age — we reach a point when we notice the gap between our idiolects and the current pop culture. But there's also something satisfying about a generalized narrative of decline.

  49. Janice Byer said,

    October 30, 2011 @ 3:46 am

    Spectre-7, thank you for your kind validation of my comment and for catching my error. My guilty pleasure is sometimes to "post first, then proof-read". I'm back on the wagon.

  50. Spectre-7 said,

    October 30, 2011 @ 4:27 am

    No worries, Janice. We're all guilty of shooting first and copy-editing later on occasion.

  51. Jeroen Mostert said,

    October 30, 2011 @ 6:42 am

    Completely agree with Colin that good acting can bring across the play even if you don't know what the bare bodkin is that a man could make his quietus with — or indeed what that's supposed to mean, exactly. For myself I found it impossible to appreciate Shakespeare until I saw the plays on stage. When the language is flowing and accompanied by the action, it's far easier to understand than when you're reading it, where you're continuously stumbling over the archaic nature of the text.

    That's not to say reading and analyzing the plays should be skipped in favor of watching performances — both are valuable. But people who teach Shakespeare solely by expecting a modern reader to understand and appreciate the written plays are committing a crime, the victims being the people who decide Shakespeare is "not for them" because they can't see the meaning for the text. Shakespeare is of course for everyone.

  52. meanderings » Blog Archive » Language Log »Up in ur internets, shortening all the words said,

    October 30, 2011 @ 9:43 am

    [...] "The business about word lengths [Fiennes claim that Twitter is shortening word-length] is th… language, twitter [...]

  53. Rod Johnson said,

    October 30, 2011 @ 10:53 am

    Janice, I think of that little button you push to post as the "Reveal Typos" button.

  54. linda seebach said,

    October 30, 2011 @ 12:47 pm

    Having had posts swallowed by comment software, I formed a habit of copying comments into a file I keep handy for the purpose. It has the secondary benefit of making me reread whatever I've written, before I push the Reveal Typos button. (I like that, I'm stealing it.) I still miss typos, let alone thinkos (what my son calls them) but fewer reach the eyes of other people.

    I like the 15-minute comment editing, though.

  55. Just another Peter said,

    October 30, 2011 @ 5:18 pm

    Just remember: never use a big word when a diminutive one will suffice.

  56. Taesun Moon said,

    October 30, 2011 @ 11:50 pm

    "removing the @'s and #'s": Do you mean you removed all tokens that begin with those characters or just those particular characters? Because hashtags in many cases are multiple tokens concatenated into one. You have one (from Gawker) in your post: #shutupvoldemort. That's three words masquerading as one. Also, user handles will probably be longer than the average English word. User handles and hashtags like that will skew the statistics.

    [(myl) I did it both ways. Removing just the @'s and #'s, but retaining the hashtags, gave a mean word length of 4.80. Removing all tokens beginning with @ or # gives a mean word length of 4.68.

    In my opinion, it's appropriate to leave hashtags in, since they are arguably new words of a special kind, which enrich the communications stream. But leaving them out doesn't come close to rescuing Mr. Fiennes and Dr. Davidson.]

  57. marie-lucie said,

    October 31, 2011 @ 12:23 am

    why "half" is not pronounced "hafe"

    The pronunciation of "halfpenny" as "haypny" suggests that the vowel change responsible for "Rafe" instead of "Ralf" occurred here too, before the loss of "f" caused by cluster simplification in the compound.

  58. No, Twitter Isn't Ruining the English Language - Forbes said,

    October 31, 2011 @ 7:23 am

    [...] claim that Twitter was phasing out longer words in favor of shorter ones, he decided to see if that was the case. So I grabbed the text of Hamlet, the text of a number of P.G. Wodehouse stories(Leave it to [...]

  59. Private Zydeco said,

    October 31, 2011 @ 7:23 am

    Rebus Grice, Yule's Constant…….

  60. No, Twitter Isn’t Ruining the English Language said,

    October 31, 2011 @ 11:31 am

    [...] claim that Twitter was phasing out longer words in favor of shorter ones, he decided to see if that was the case. So I grabbed the text of Hamlet, the text of a number of P.G. Wodehouse stories(Leave it to [...]

  61. Twitter Is Not the Enemy of the English Language (The Atlantic Wire) | News Bulletins said,

    October 31, 2011 @ 1:23 pm

    [...] 4.05 characters; in the DP‘s tweets, the mean word length was 4.80 characters," he found, writing on linguistic blog Language [...]

  62. Linguistics Professor Finds Average Word Length In A Tweet Is Longer Than In Shakespeare - AllTwitter said,

    October 31, 2011 @ 1:31 pm

    [...] Lieberman, a linguistics professor at the University of Pennsylvania who blogs at Language Log, was intrigued by Fiennes' comments and wanted to test his [...]

  63. Rod Johnson said,

    October 31, 2011 @ 4:37 pm

    Hey Mark–according to one of those trackbacks up there, you say Twitter is making the English language "better."

  64. jamessal said,

    October 31, 2011 @ 4:51 pm

    Eric: sentences with fewer clauses… aids easy readability

    Forgive me if someone else has already challenged that assertion, but Mark Liberman didn't in his comment beneath it (probably because it was somewhat off-topic), and it's just not true. Paratactic prose is often more difficult to read than hypotactic prose, because it leaves it up to the reader to make the semantic connections between the clauses. The children's book authors of yesteryear understood this better than today's (or at least today's editors: I know one personally who understands the value of parataxis but whose editors insist — or, more typically, the weasels, claim that their stupid bosses insist — that shorter is always easier); William Steig and Lewis Carroll both wrote more highly hypotactic prose, I'll bet, than Wodehouse or Shakespeare.

  65. jamessal said,

    October 31, 2011 @ 4:53 pm

    I'm sorry, that should have been "I know one personally who understands the value of hypotaxis…"

  66. Link love: language (36) « Sentence first said,

    October 31, 2011 @ 5:48 pm

    [...] Twitter eroding language? (Clue: [...]

  67. textiling « Arnold Zwicky's Blog said,

    October 31, 2011 @ 6:53 pm

    [...] rises — see, for example "Up in ur internets, shortening all the words" in LLog, here — it's worth looking back to other recent dire threats of this sort: texting and [...]

  68. Arnold Zwicky said,

    October 31, 2011 @ 7:06 pm

    Before Twitter, plain old texting: in The Onion, here.

  69. Twitter Is Not the Enemy of the English Language (The Atlantic Wire) - Tech | Tech Buzz Blog said,

    October 31, 2011 @ 7:35 pm

    [...] characters; in the DP‘s tweets, the mean word length was 4.80 characters," he found, writing on linguistic blog Language [...]

  70. When will people realise that technology doesn’t wreck language? « thesociallinguist said,

    November 1, 2011 @ 8:11 am

    [...] try and compete with the brilliant analysis/discussion over the the Language Log, here, here and here), but Fiennes' comments bely a deeper distrust among certain sections of society on the [...]

  71. Real trends in word and sentence length | English Teaching Daily said,

    November 1, 2011 @ 1:27 pm

    [...] couple of days ago, The Telegraph quoted an actor and a television producer emitting typically brainless "Kids Today" plaints about how modern modes of [...]

  72. Carol StJohn said,

    November 2, 2011 @ 1:09 pm

    The point of language in any form is communication. Being a pragmatist, I don't think the length of words is relevant if the message is clear. American English speakers always go for the shortened version of words, such as phone for telephone, net for internet, etc. We're also big on acronyms, such as AIDS or RAM, which become words in their own right. And who says compact disk when they can say CD? A high number of syllables doesn't necessarily convey more info. Likewise, using 4 (for) or U (you) only cause a problem if you insist on choking on them and being distracted from the idea.

    There has always been a distinction between formal and informal language, and even Shakespeare uses this in the speech of his aristocratic versus common characters. Twitter is informal. Writing a business letter is formal. I think people instinctively understand this.

    A living language flows and morphs to relate to our experience. The world is changing so fast that the language must follow. We need to communicate quickly and compactly to keep up. Maybe an Amazon native language doesn't change because their world remains the same.

  73. Rod Johnson said,

    November 2, 2011 @ 3:00 pm

    The point of language in any form is communication.

    Where has this been determined? Did someone make a decision to design language for this purpose? In what sense does any naturally evolving phenomenon have a "point"? Beware glibly functional, teleological explanations of language.

  74. Rod Johnson said,

    November 2, 2011 @ 8:59 pm

    …which, on further consideration, is pretty tangential to your actual point, with which I agree.

  75. Is English REALLY dying AGAIN ? | A journée in language said,

    November 3, 2011 @ 2:42 am

    [...] (a blog I thoroughly enjoy) has also chimed into the recent Fiennes-twitter debate.  In this article he's examined the length of words in Hamlet, a number of PG Wodehouses's stories and [...]

  76. This Week’s Language Blog Roundup | Wordnik ~ all the words said,

    November 4, 2011 @ 11:56 am

    [...] forces efficiency," but "also lends itself to some really, really fun wordplay.” (Mark Liberman at Language Log gets a shout-out in Knapp’s piece too.) Ben Zimmer went as far as to say that “Twitterology” [...]

  77. “Brevity is the soul of lingerie”*… « (Roughly) Daily said,

    November 6, 2011 @ 3:03 am

    [...] then did some research to create some (mitigating) context for his objection: A couple of days ago, The Telegraph quoted an actor and a television producer emitting typically brainless "Kids Today" plaints about how modern modes of [...]

  78. CRITICAL LINKING: November 7, 2011 | BOOK RIOT said,

    November 7, 2011 @ 7:03 am

    [...] 4.05 characters; in the DP‘s tweets, the mean word length was 4.80 characters,' he found, writing on linguistic blog Language [...]

  79. Brett Wilkes said,

    November 13, 2011 @ 1:15 am

    It's nice to read logical thoughts that bring sensible perspective to issues which the mainstream only hears of when sensationalistic bull like Fiennes' assertions pops up.

  80. De la twitterisation de la langue… | En Tous Cas… said,

    December 5, 2011 @ 12:33 pm

    [...] droit de cité. Mark Liberman, blogueur pour Language Log, s'est brillamment exprimé ici au sujet de cette prise de position de Fiennes. Liberman s'est amusé à comparer la longueur [...]

  81. Utarmar Twitter och sociala medier det moderna språkbruket? said,

    December 16, 2011 @ 9:58 am

    [...] Men hur står det till med det? En blogg tillhörande personal på Pennsylvanias universitet, Language log, matade in text från Hamlet, ett gäng P.G. Wodehouse-texter och de 100 senaste tweetsen från [...]

  82. OUPblog » Blog Archive » Verily, this tomfoolery must be quashed! said,

    December 19, 2011 @ 3:32 am

    [...] textspeak ones, I’ll leave that misconception to be elegantly squashed by Language Log’s Mark Liberman. It appears from his researches that words used in tweets are longer in terms of characters than [...]

  83. To Tweet, Perchance to Dream: Three Twitter Myths Dismissed - Scatter/Gather: a Razorfish blog about content strategy, pop culture and human behavior said,

    October 7, 2012 @ 9:18 pm

    [...] says Mark Lieberman, a linguistics professor at the University of Pennsylvania anyway. Lieberman studied 100 tweets [...]

  84. David Farkas said,

    November 1, 2012 @ 12:39 pm

    I first heard about this study through your interview with NPR. One point that stood out to me was when you described texting among children, teens, and adults. You mentioned that teens no longer shorten words to 'u', '4', 'wtvr' and instead now type 'you', 'for', 'whatever' etc. It was said that only a small population still text with these abbreviations and it was inferred that it is an age based gap.

    I am an interaction/user experience designer and would like to offer another suggestion. This is not based on age but on the demographics of phones being used. If we look at the trend of text messaging, the beginning was through feature phones where in order to type the letter C, the number 2 had to be pressed three times. This made for a very long workflow to type a simple word like 'for' (Press3-3 times for 'f', Press6-3 times for 'o', Press 7-3 times for 'r') and led to the use of simply typing '4'. As smart phones become more common, especially among teens, full keyboards are now available. It is now easy to type f-o-r on the keypad and, because numeric keys are often hidden as is the case with many touch screen devices, more difficult to navigate to the shortened numeric replacement. Carrying this thought further, many teens and young adults are embracing the smart phone market while younger children who have phones for emergencies and older adults who don't need or want the bells and whistles of smart phones are sticking with their feature phones. I would argue that age is an outcome of the type of phones people are using and it is the phone type that drives messaging-format and brevity and not age.

RSS feed for comments on this post