Automatic classification of g-dropping

Over at headsup: the blog, fev recently pondered variation in transcription practice ("Annals of g-droppin'", 6/6/2011).  He starts by noting that the same paper edited the same quote, in the same AP story, to have -in' in some but not all gerund-participles in one version, but -ing throughout in another version.  And his main concern is with the socio-political subtext of the choice to use eye-dialect in some cases and not in others:

It's worth puttin' the question to the AP (and/or your own political writers). What exactly are you trying to show about Palin's speech, and how consistent can you credibly claim to be about it — either within a single sentence of hers or among candidates who may have those or other speech features despite their necktie-wearin' formality?

Read the rest of this entry »

Comments (27)


Penultimate

Sometimes more is less. In Frazz for 6/6/2011, Caulfield is frustrated that Mrs. Olsen is among those who think that the first syllable of penultimate is some kind of emphatic particle:

Read the rest of this entry »

Comments (78)


Inventory of libfix postings

(and related material), assembled on my blog, here.

Comments (1)


Trod

Marc Lacey, "As Arizona Fire Rages, Officials Seek Its Cause", NYT 6/11/2011:

Deep within the burn zone, while trying to extinguish the more than 600-square mile Wallow Fire, firefighters have taken care not to trod on two small areas in the Bear Wallow Wilderness where smoke and flames were first spotted a mile or more apart on May 29. Those two fires quickly merged into one big, unruly, runaway blaze that eludes containment nearly two weeks later. (emphasis added)

In standard formal English, I believe, that should be "to tread on"; "trod" is the past tense form, with the past participle being "trodden" or "trod". (The editors of the New York Times agree with me, apparently, since the online article has now been changed to read "tread on" — so here is a screen shot of the original.)

Read the rest of this entry »

Comments (36)


Hockey in Punjabi

Hockey is not a popular sport in the Punjab but it is THE sport in Canada, which now has a large Punjabi population. An interesting example of cultural integration is the fact that CBC Sports now has hockey commentary in Punjabi.

Comments (40)


Max Mathews and his influence

R. Luke Dubois, "The first computer musician", NYT 6/9/2011:

If the difference between 1911 and 2011 is electricity and computation, then Max Mathews is one of the five most important musicians of the 20th Century. – Miller Puckette

Read the rest of this entry »

Comments (2)


The Navy SEALs of snowclones

David Craig writes:

I saw a post on Facebook declaring Chanticleer to be the Navy SEALs of music and got to wondering how far this snowclone had taken off.  A quick googling on "the navy seals of" and, in the first ten hits three were undeniable snowclones where "the Navy SEALs of X" had X filled with "the NFL", "cars", and "seahorses".  I can't convince my boss that future research into this is in the interests of my company so I'm passing the job on to you.

Read the rest of this entry »

Comments (17)


Gelman and Picasso

I very much enjoyed Andrew Gelman's post "Bayesian statistical pragmatism" (4/15/2011) on his blog Statistical Modeling, Causal Inference, and Social Science. And one aspect of that post struck me as especially relevant to some recent LL discussions:

I am surprised to see Kass write that scientists believe that the theoretical and real worlds are aligned. It is from acknowledging the discrepancies between these worlds that we can (a) feel free to make assumptions without being paralyzed by fear of making mistakes, and (b) feel free to check the fit of our models (those hypothesis tests again! Although I prefer graphical model checks, supplanted by p-values as necessary). All models are false, etc.

I assume that Kass is using the word "aligned" in a loose sense, to imply that scientists believe that their models are appropriate to reality even if not fully correct. But I would not even want to go that far. Often in my own applied work I have used models that have clear flaws, models that are at best "phenomenological" in the sense of fitting the data rather than corresponding to underlying processes of interest–and often such models don't fit the data so well either. But these models can still be useful: they are still a part of statistics and even a part of science (to the extent that science includes data collection and description as well as deep theories).

Read the rest of this entry »

Comments (9)


Speech-based lie detection in Russia

Andrew E. Kramer, "Russian A.T.M. With an Ear for the Truth", NYT 6/8/2011:

Russia’s biggest retail bank is testing a machine that the old K.G.B. might have loved, an A.T.M. with a built-in lie detector intended to prevent consumer credit fraud.

Consumers with no previous relationship with the bank could talk to the machine to apply for a credit card, with no human intervention required on the bank’s end.

The machine scans a passport, records fingerprints and takes a three-dimensional scan for facial recognition. And it uses voice-analysis software to help assess whether the person is truthfully answering questions that include “Are you employed?” and “At this moment, do you have any other outstanding loans?”

The voice-analysis system was developed by the Speech Technology Center, a company whose other big clients include the Federal Security Service — the Russian domestic intelligence agency descended from the Soviet K.G.B.

Dmitri V. Dyrmovsky, director of the center’s Moscow offices, said the new system was designed in part by sampling Russian law enforcement databases of recorded voices of people found to be lying during police interrogations.

Read the rest of this entry »

Comments (12)


Working together in all multilateral orifices

A couple of months ago, an Italian friend brought to my attention a quirk of Italian usage. Like English, Italian has adopted the Latin word forum to mean "a place for public discussion".  In English, as usual with borrowed Latin words, the plural is sometimes "fora" (the original Latin plural) and sometimes "forums" (the regular English form). In Italian, borrowed words are often treated as invariabile ("invariant"), so that the plural is the same as the singular. For forum this yields the plural "forum", and (for example) "in tutti i forum" (meaning "in all forums") is common on the web.

But there are also two Italian words foro, plural "fori": one means "hole, opening", and the other means "court, tribunal".

Read the rest of this entry »

Comments (33)


Little words

Molly Ireland, Richard Slatcher, Paul Eastwick, Lauren Scissors, Eli Finkel, and James Pennebaker, "Language Style Matching Predicts Relationship Initiation and Stability", Psychological Science 12/13/2010.

Previous relationship research has largely ignored the importance of similarity in how people talk with one another. Using natural language samples, we investigated whether similarity in dyads’ use of function words, called language style matching   (LSM), predicts outcomes for romantic relationships. In Study 1, greater LSM in transcripts of 40 speed dates predicted increased   likelihood of mutual romantic interest (odds ratio = 3.05). Overall, 33.3% of pairs with LSM above the median mutually desired   future contact, compared with 9.1% of pairs with LSM at or below the median. In Study 2, LSM in 86 couples’ instant messages   positively predicted relationship stability at a 3-month follow-up (odds ratio = 1.95). Specifically, 76.7% of couples with LSM   greater than the median were still dating at the follow-up, compared with 53.5% of couples with LSM at or below the median.   LSM appears to reflect implicit interpersonal processes central to romantic relationships.

This is interesting stuff, in my opinion. But so far, it's gotten relatively little uptake in the popular press. And in one of the few — rather obscure — pieces to mention this research, it was spun as a guide to dating success.

Read the rest of this entry »

Comments (29)


What caused the texting tsunami?

Comments (35)


More on the history of comprised of meaning "composed of"

Following up on my post "Counterfeit cultural capital" (5/11/2011), David Russinoff sent some additional information about the early history of expressions like "angles comprised of equal right lines" in English translations of Euclid.  I reproduce his note in full below, in order to make his efforts available to other interested scholars, while adding a warning to others that this may go a bit deep in the historical-lexicography weeds even for hardened LL readers.

Read the rest of this entry »

Comments (17)