{"id":1890,"date":"2009-11-13T08:53:05","date_gmt":"2009-11-13T12:53:05","guid":{"rendered":"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=1890"},"modified":"2009-11-13T10:33:26","modified_gmt":"2009-11-13T14:33:26","slug":"body-loses-supreme-court-appeal","status":"publish","type":"post","link":"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=1890","title":{"rendered":"Body loses Supreme Court appeal"},"content":{"rendered":"<p>This morning, I appealed the somebody-vs.-someone story to the Supreme Court of the United States. The decision came quickly &#8212; details are below.<\/p>\n<p><!--more--><\/p>\n<p>A few days ago (\"<a href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=1884\">Less body in your lexicon?<\/a>\", 11\/11\/2009), I gave evidence confirming that <em>someone<\/em> and <em>everyone<\/em> are more formal than <em>somebody<\/em> and <em>everybody<\/em> (because the<em> __body \/ __ one<\/em> ratio is an order of magnitude larger in speech than in academic writing). But I also offered evidence that the <em>__one<\/em> forms are taking over from the<em> __body<\/em> forms, in all registers of English.<\/p>\n<p>Some of this evidence came from apparent-time differences in conversational transcripts, where it seems that the younger people are, the more likely they are to choose <em>__one<\/em> forms:<\/p>\n<p><a href=\"http:\/\/languagelog.ldc.upenn.edu\/myl\/BodyAge.png\"><img decoding=\"async\" title=\"Click to embiggen\" src=\"http:\/\/languagelog.ldc.upenn.edu\/myl\/BodyAge.png\" alt=\"\" width=\"475\" \/><\/a><\/p>\n<p>And some of the evidence came from historical-time counts, for example in the BYU Time Magazine corpus:<\/p>\n<p><a href=\"http:\/\/languagelog.ldc.upenn.edu\/myl\/TimeBodyOne1.png\"><img decoding=\"async\" title=\"Click to embiggen\" src=\"http:\/\/languagelog.ldc.upenn.edu\/myl\/TimeBodyOne1.png\" alt=\"\" width=\"475\" \/><\/a><\/p>\n<p>(Note that the x-axis numbers are the first years of decade-long bins &#8212; thus the value plotted at 1940 is the value for texts published between 1940 and 1949.)<\/p>\n<p>Thanks to Jerry Goldman of oyez.org and Tim Stanley of justia.com, I've got a nearly-complete archive of U.S. Supreme Court opinions in html form, and I extracted the year-by-year lexical histograms for an earlier Breakfast Experiment in which I tracked the trends in SCOTUS use of sentence-initial conjunctions:<\/p>\n<p><a href=\"http:\/\/languagelog.ldc.upenn.edu\/myl\/SCOTUS_Coord2.png\"><img decoding=\"async\" title=\"Click to embiggen\" src=\"http:\/\/languagelog.ldc.upenn.edu\/myl\/SCOTUS_Coord2.png\" alt=\"\" width=\"475\" \/><\/a><\/p>\n<p>So it was just a couple of minutes' work to check on the evolution of the __body \/ __one ratio.<\/p>\n<p>The year-by-year ratios are quite noisy, since all of the relevant words are relatively rare in these texts, so I averaged by decade. The results since 1866 look like this:<\/p>\n<p><a href=\"http:\/\/languagelog.ldc.upenn.edu\/myl\/SCOTUS_SomeEvery.png\"><img decoding=\"async\" title=\"Click to embiggen\" src=\"http:\/\/languagelog.ldc.upenn.edu\/myl\/SCOTUS_SomeEvery.png\" alt=\"\" width=\"475\" \/><\/a><\/p>\n<p>This picture is different in various ways from the Time Magazine data. The Time -body peak seems to be in the 1940-1950 decade, while the SCOTUS  peak seems to be in 1916-1925. And compared to Time, SCOTUS has anomalously-low values for the <em>somebody\/something<\/em> ratio in the 1926-1935 and 1936-1945 decades.<\/p>\n<p>Supreme Court opinions are not an ideal source of evidence for analyzing this phenomenon, because the number of authors is small and erratically variable, and (as in formal writing in general) the counts for these particular words are low overall. So in this case, the court's opinion may not have very wide application. Still, the SCOTUS data is consistent with a general decline, over the past few decades, in the frequency of <em>somebody<\/em> and <em>everybody<\/em> relative to <em>someone<\/em> and <em>everyone<\/em>.<\/p>\n<p>Thus I'm provisionally convinced that in this case, the culture as a whole (or at least its American portion) is trending fairly strongly towards the more formal member of two related pairs of words that are roughly equivalent in meaning.<\/p>\n<p>I can't think of another documented example of a case where the language as a whole is trending away from the vernacular rather than towards it &#8212; can you?<\/p>\n<hr \/>\n<p>Here are the raw SCOTUS counts in tabular form:<\/p>\n<table style=\"text-align: center;\" border=\"1\" cellspacing=\"2\" cellpadding=\"2\">\n<tbody>\n<tr>\n<td>Years<\/td>\n<td>Total words<\/td>\n<td>Somebody<\/td>\n<td>Someone<\/td>\n<td>Everybody<\/td>\n<td>Everyone<\/td>\n<\/tr>\n<tr>\n<td>1866-1875<\/td>\n<td>4,681,448<\/td>\n<td>12<\/td>\n<td>77<\/td>\n<td>7<\/td>\n<td>66<\/td>\n<\/tr>\n<tr>\n<td>1876-1885<\/td>\n<td>6,332,333<\/td>\n<td>12<\/td>\n<td>41<\/td>\n<td>23<\/td>\n<td>64<\/td>\n<\/tr>\n<tr>\n<td>1886-1895<\/td>\n<td>10,070,882<\/td>\n<td>26<\/td>\n<td>75<\/td>\n<td>27<\/td>\n<td>97<\/td>\n<\/tr>\n<tr>\n<td>1896-1905<\/td>\n<td>8,605,007<\/td>\n<td>13<\/td>\n<td>60<\/td>\n<td>40<\/td>\n<td>93<\/td>\n<\/tr>\n<tr>\n<td>1906-1915<\/td>\n<td>6,454,454<\/td>\n<td>14<\/td>\n<td>56<\/td>\n<td>15<\/td>\n<td>47<\/td>\n<\/tr>\n<tr>\n<td>1916-1925<\/td>\n<td>4,758,703<\/td>\n<td>12<\/td>\n<td>16<\/td>\n<td>16<\/td>\n<td>22<\/td>\n<\/tr>\n<tr>\n<td>1926-1935<\/td>\n<td>4,504,824<\/td>\n<td>8<\/td>\n<td>29<\/td>\n<td>8<\/td>\n<td>17<\/td>\n<\/tr>\n<tr>\n<td>1936-1945<\/td>\n<td>5,552,858<\/td>\n<td>8<\/td>\n<td>61<\/td>\n<td>20<\/td>\n<td>33<\/td>\n<\/tr>\n<tr>\n<td>1946-1955<\/td>\n<td>5,101,032<\/td>\n<td>18<\/td>\n<td>56<\/td>\n<td>35<\/td>\n<td>61<\/td>\n<\/tr>\n<tr>\n<td>1956-1965<\/td>\n<td>6,216,798<\/td>\n<td>40<\/td>\n<td>128<\/td>\n<td>25<\/td>\n<td>79<\/td>\n<\/tr>\n<tr>\n<td>1966-1975<\/td>\n<td>8,774,322<\/td>\n<td>36<\/td>\n<td>201<\/td>\n<td>20<\/td>\n<td>101<\/td>\n<\/tr>\n<tr>\n<td>1976-1885<\/td>\n<td>13,106,854<\/td>\n<td>36<\/td>\n<td>312<\/td>\n<td>28<\/td>\n<td>125<\/td>\n<\/tr>\n<tr>\n<td>1986-1995<\/td>\n<td>10,298,169<\/td>\n<td>36<\/td>\n<td>350<\/td>\n<td>14<\/td>\n<td>97<\/td>\n<\/tr>\n<tr>\n<td>1996-2005<\/td>\n<td>7,731,353<\/td>\n<td>15<\/td>\n<td>344<\/td>\n<td>18<\/td>\n<td>122<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>You should take these numbers with a grain or two of salt, since the process of extracting text from html can be messy, and I didn't spend much time dealing with the peculiarities of this particular set of pages.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This morning, I appealed the somebody-vs.-someone story to the Supreme Court of the United States. The decision came quickly &#8212; details are below.<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[60,10],"tags":[],"class_list":["post-1890","post","type-post","status-publish","format-standard","hentry","category-computational-linguistics","category-linguistic-history"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts\/1890","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1890"}],"version-history":[{"count":0,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts\/1890\/revisions"}],"wp:attachment":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1890"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1890"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1890"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}