{"id":646,"date":"2008-09-27T21:39:39","date_gmt":"2008-09-28T01:39:39","guid":{"rendered":"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=646"},"modified":"2008-09-27T21:50:44","modified_gmt":"2008-09-28T01:50:44","slug":"a-correlate-of-animacy","status":"publish","type":"post","link":"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=646","title":{"rendered":"A correlate of animacy"},"content":{"rendered":"<p>For the last couple of days, I've been in Chicago at an NSF-sponsored workshop on \"animacy and information status annotation\", organized by Annie Zaenen, Cathy O'Connor and Gregory Ward.<\/p>\n<p>A traditional and characteristic example of the role of animacy in English syntax is the way it affects the choice between the two ways of expressing genitive relations, X's Y vs. the Y of X.\u00a0 In general, the apostrophe-s structure is said to be preferred for animate Xs, while inanimates tend to go with the of-phrase. I'm a believer in Yogi Berra'a dictum that you can observe a lot just by watching, especially if you count things. So during the wrap-up session this afternoon, I thought I'd try using some simple web searches to probe this animacy-genitive relationship.<\/p>\n<p><!--more--><\/p>\n<p>Since we can only do string searches &#8212; the web isn't parsed &#8212; I wanted to find a reliable string-wise proxy for finding the head of the of-phrase, and decided to try entities with single-word names. Specifically, I decided to try people, companies, countries, and chemical elements. And to pin it down further, I decided to try some contemporary American politicians and some IT companies. (I started with four members in each category, but then I added one more country and one more element.)<\/p>\n<p>Thus the search string \"google's\" got 17 million hits, while the search string \"of google\" got 13.4 million hits, for a ratio of about 1.27. I repeated the analogous searches for the other 17 names. The results are kind of cute:<\/p>\n<div id=\"mainContent\">\n<table border=\"1\" cellspacing=\"2\" cellpadding=\"2\">\n<tbody>\n<tr>\n<td><\/td>\n<td><strong>__'s<\/strong><\/td>\n<td><strong>of __<\/strong><\/td>\n<td><strong>ratio<\/strong><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #ff0000;\">Giuliani<\/span><\/td>\n<td><span style=\"color: #ff0000;\">1.14M<\/span><\/td>\n<td><span style=\"color: #ff0000;\">140K<\/span><\/td>\n<td><span style=\"color: #ff0000;\">8.14<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #ff0000;\">McCain<\/span><\/td>\n<td><span style=\"color: #ff0000;\">23.6M<\/span><\/td>\n<td><span style=\"color: #ff0000;\">4.42M<\/span><\/td>\n<td><span style=\"color: #ff0000;\">5.34<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #ff0000;\">Clinton<\/span><\/td>\n<td><span style=\"color: #ff0000;\">11.6M<\/span><\/td>\n<td><span style=\"color: #ff0000;\">2.81M<\/span><\/td>\n<td><span style=\"color: #ff0000;\">4.13<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #ff0000;\">Obama<\/span><\/td>\n<td><span style=\"color: #ff0000;\">26M<\/span><\/td>\n<td><span style=\"color: #ff0000;\">7.6M<\/span><\/td>\n<td><span style=\"color: #ff0000;\">3.42<\/span><\/td>\n<\/tr>\n<tr>\n<td>Apple<\/td>\n<td>22.6M<\/td>\n<td>9.39M<\/td>\n<td>2.41<\/td>\n<\/tr>\n<tr>\n<td>IBM<\/td>\n<td>6.97M<\/td>\n<td>4.03M<\/td>\n<td>1.73<\/td>\n<\/tr>\n<tr>\n<td>Microsoft<\/td>\n<td>35.5M<\/td>\n<td>21.3M<\/td>\n<td>1.67<\/td>\n<\/tr>\n<tr>\n<td>Google<\/td>\n<td>17M<\/td>\n<td>13.4M<\/td>\n<td>1.27<\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #0000ff;\">America<\/span><\/td>\n<td><span style=\"color: #0000ff;\">113M<\/span><\/td>\n<td><span style=\"color: #0000ff;\">131M<\/span><\/td>\n<td><span style=\"color: #0000ff;\">0.863<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #0000ff;\">Canada<\/span><\/td>\n<td><span style=\"color: #0000ff;\">26.8M<\/span><\/td>\n<td><span style=\"color: #0000ff;\">60.5M<\/span><\/td>\n<td><span style=\"color: #0000ff;\">0.443<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #0000ff;\">Thailand<\/span><\/td>\n<td><span style=\"color: #0000ff;\">3.96M<\/span><\/td>\n<td><span style=\"color: #0000ff;\">11.8M<\/span><\/td>\n<td><span style=\"color: #0000ff;\">0.336<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #0000ff;\">England<\/span><\/td>\n<td><span style=\"color: #0000ff;\">10.9M<\/span><\/td>\n<td><span style=\"color: #0000ff;\">48M<\/span><\/td>\n<td><span style=\"color: #0000ff;\">0.227<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #0000ff;\">Belgium<\/span><\/td>\n<td><span style=\"color: #0000ff;\">799K<\/span><\/td>\n<td><span style=\"color: #0000ff;\">6.31M<\/span><\/td>\n<td><span style=\"color: #0000ff;\">0.127<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #008800;\">lithium<\/span><\/td>\n<td><span style=\"color: #008800;\">60.7K<\/span><\/td>\n<td><span style=\"color: #008800;\">1.73M<\/span><\/td>\n<td><span style=\"color: #008800;\">0.035<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #008800;\">arsenic<\/span><\/td>\n<td><span style=\"color: #008800;\">21.7K<\/span><\/td>\n<td><span style=\"color: #008800;\">1.19M<\/span><\/td>\n<td><span style=\"color: #008800;\">0.018<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #008800;\">silicon<\/span><\/td>\n<td><span style=\"color: #008800;\">50.9K<\/span><\/td>\n<td><span style=\"color: #008800;\">5.93M<\/span><\/td>\n<td><span style=\"color: #008800;\">0.0086<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #008800;\">hydrogen<\/span><\/td>\n<td><span style=\"color: #008800;\">45.3K<\/span><\/td>\n<td><span style=\"color: #008800;\">9.44M<\/span><\/td>\n<td><span style=\"color: #008800;\">0.0048<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"color: #008800;\">cadmium<\/span><\/td>\n<td><span style=\"color: #008800;\">4.01K<\/span><\/td>\n<td><span style=\"color: #008800;\">2.2M<\/span><\/td>\n<td><span style=\"color: #008800;\">0.0018<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The first thing to observe is that the four categories are non-overlapping. (I'm sure that if you added more names, you'd discover some overlaps; but still.)<\/p>\n<p>Within categories, the ordering makes a certain amount of sense. Lithium is a drug as well as an element, which makes it partly a member of another class that might be expected to rank higher on this scale (whatever this scale is, exactly). Arsenic is a poison. And it's not a surprise that America comes out as apparently the most animate of the countries.<\/p>\n<p>The ranking of the politicians and the IT firms puzzled me a bit at first. But then I conjectured that perhaps the scale is measuring a sort of unpredictable agency &#8212; what you might call the \"Maverick factor\".\u00a0 Maybe by changing all their connectors every 18 months, and building laptops that <a href=\"http:\/\/forums.macrumors.com\/showthread.php?t=346649\">freeze up<\/a> every other time you plug them into a projector, Apple gains in (this measure) of animacy, just as a cantankerous old car comes to seem more alive every time you have to beg it to start. On this theory Google and Obama, by being more reliable, seem a bit less agentive.<\/p>\n<p>[Of course, there are many other factors that could (and doubtless did) skew the counts enough to change the genitive-ratio rankings &#8212; for example, \"Google\" is often used as a modifier, in phrases like \"Google Analytics\" or \"Google News\", and as a result, strings like \"new version of Google Analytics\" increase the count of the \"google of\" search. Still, it's nice that the ratio held up in a semi-sensible way over 3-4 orders of magnitude.]<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>For the last couple of days, I've been in Chicago at an NSF-sponsored workshop on \"animacy and information status annotation\", organized by Annie Zaenen, Cathy O'Connor and Gregory Ward. A traditional and characteristic example of the role of animacy in English syntax is the way it affects the choice between the two ways of expressing [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[19],"tags":[],"class_list":["post-646","post","type-post","status-publish","format-standard","hentry","category-semantics"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts\/646","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=646"}],"version-history":[{"count":0,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts\/646\/revisions"}],"wp:attachment":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=646"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=646"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=646"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}