Obituary for Fred Jelinek at Computational Linguistics

Back on September 15, when I posted the news of Fred Jelinek's death, I promised to say more when I'd had a chance to think about it. Then, a few days later, Robert Dale asked me to write an obituary for Fred to be published in the Computational Linguistics journal. The December 2010 issue is now out, and Fred's obituary is here.

Following Robert's suggestion, I aimed at a broad assessment of Fred's impact on the field, since CL recently published Fred's own detailed account of his professional life ("The Dawn of Statistical ASR and MT", CL 35(4):483-494 , 2009).

Comments (13)


Help! I'm trapped in a ???

For the past couple of weeks, I've been getting a bunch of curious email messages that start like these:

Thank you for contacting the comics and features department at The Washington Post.  Even though this is an automatic reply to inform you that we have received your comment, we still want you to know that we read every comment individually.

Thank you for contacting the Death Notices Advertising Department of the Washington Post and allowing us to serve you.  Your email has been received.  Listed below you will find general and required information that you may find useful.

Every day I get four or five similar acknowledgments from the comics department or the death notices department over at the Washington Post, although I've never sent any messages to either entity, or to any other WaPo address.

Read the rest of this entry »

Comments (10)


'The' culture war

As we've discussed from time to time, some English proper names take a definite article ("the Times", "the Bronx") and others don't ("Language Log", "Brooklyn"). The public transport system in Boston is called "the T"; the public transport system in Philadelphia is called "SEPTA".

But sometimes, the same name for the same (in some sense) entity gets a definite article in one speech community, and not in another. Apparently people in the Los Angeles area generally use definite articles with freeway numbers ("the 101", "the 405"), although people elsewhere in the U.S. generally don't. (See Language Hat, "'The' + Freeway", 8/1/2010, for some discussion and scholarly references.)

Yesterday, JC Dill sent in the picture on the right, along with an interesting sociolinguistic commentary:

As you may know, there's a war of definite articles between San Francisco (SF Bay Area aka SFBA) and Los Angeles (SoCal).  In the SF Bay Area we talk about taking 101 to San Jose, in SoCal they talk about taking the 101 to Ventura.

So it was with some surprise that I saw the Bank of America (formerly Bank of Italy, a SF company) ad in a MUNI bus stop today.  Clearly this company has lost their SF roots.

Read the rest of this entry »

Comments (130)


Comprehend this!

Perhaps the most illiterate phishing spam yet: ignoring the incompetence of having Velez Restrepo as the sender, jg_van88 (at a Chinese address) as the reply-to, and Mr(.) John Galvan as the alleged sender, with the X-Accept-Language set to Spanish, this message has at least 20 linguistic errors in the text, which is roughly one for each four words.

From gvelez@une.net.co
Wed Dec 15 11:11:57 2010
Date: Wed, 15 Dec 2010 03:11:43 -0800
From: velez restrepo guillermo <gvelez@une.net.co>
Subject: Comprehend This Proposal
Bcc:
Reply-to: jg_van88@w.cn
X-Mailer: Sun Java(tm) System Messenger Express 7.3-11.01 64bit (built Sep 1 2009)
X-Accept-Language: es
Priority: normal

Good day,

I am Mr John Galvan a staff of a private offshore AIG Private bank united kingdom.

I have a great proposal that we interest and benefit you, this proposal of mine is worth of £15,500,000.00 Million Pounds.I intend to give Four thy Percent of the total funds as compensation for your assistance. I will notify you on the full transaction on receipt of your response if interested, and I shall send you the details.

Kind Regards,
Mr. John Galvan

Read the rest of this entry »

Comments (72)


Words and things

Comments (46)


Language and Thought at the Economist

A new motion is open for debate today in the Economist's online series: "This house believes that the language we speak shapes how we think".  Lera Boroditsky is the designated defender of the motion, and I was recruited to be the designated opponent.

In this format, each side submits an opening statement, a rebuttal, and a closing statement. Readers get to comment, and also to vote on the motion. Our opening statements are now live.

Read the rest of this entry »

Comments (70)


Disintermediating the dustbin

Saturday's Dilbert:

Digital media offer wonderful opportunities for the study of language, communication, and culture. So despite short-term problems, both internal and external, I'm optimistic.

Comments (4)


"Rice positivists" vs. "contextualized popular epistemologies"

In discussing yesterday's post on the American Anthropological Association's removal of (the word) science from its long-range plan, several commenters were puzzled or skeptical about the foundations of the debate. One wrote:

The depiction of the different sides – both in the NYT article, the earlier Higher Ed article, and by many of the proponents – strike me as utter nonsense. Despite working in the field, I've yet to meet one of these anti-science cultural anthropologists.

And another wrote:

I can't quite grasp an academic yet non-scientific way to study these things.

It may help to recognize that the "anti-science cultural anthropologists" have traditionally framed their views as an opposition to "positivism".

Read the rest of this entry »

Comments (71)


"Marginalization is never a welcome experience"

From Nicholas Wade, "Anthropology a Science? Statement Deepens a Rift", NYT 12/9/2010:

Anthropologists have been thrown into turmoil about the nature and future of their profession after a decision by the American Anthropological Association at its recent annual meeting to strip the word “science” from a statement of its long-range plan.

The decision has reopened a long-simmering tension between researchers in science-based anthropological disciplines — including archaeologists, physical anthropologists and some cultural anthropologists — and members of the profession who study race, ethnicity and gender and see themselves as advocates for native peoples or human rights. […]

The association’s president, Virginia Dominguez of the University of Illinois, said in an e-mail that the word had been dropped because the board sought to include anthropologists who do not locate their work within the sciences, as well as those who do. She said the new statement could be modified if the board received any good suggestions for doing so.

Read the rest of this entry »

Comments (33)


X much

A couple of days ago, Language Hat hosted an interesting discussion of the "X much?" construction:

I have been asked about the history of the construction "X much?" as a rhetorical response (e.g., "Bitter much? Overanalzye much? Ad hominem much?").

Read the rest of this entry »

Comments (35)


Indo-European borrowing

The abstract of Shijulal Nelson-Sathi et al., "Networks uncover hidden lexical borrowing in Indo-European language evolution", Proc. Roy. Soc. B, published online 11/24/2010:

Language evolution is traditionally described in terms of family trees with ancestral languages splitting into descendent languages. However, it has long been recognized that language evolution also entails horizontal components, most commonly through lexical borrowing. For example, the English language was heavily influenced by Old Norse and Old French; eight per cent of its basic vocabulary is borrowed. Borrowing is a distinctly non-tree-like process—akin to horizontal gene transfer in genome evolution—that cannot be recovered by phylogenetic trees. Here, we infer the frequency of hidden borrowing among 2346 cognates (etymologically related words) of basic vocabulary distributed across 84 Indo-European languages. The dataset includes 124 (5%) known borrowings. Applying the uniformitarian principle to inventory dynamics in past and present basic vocabularies, we find that 1373 (61%) of the cognates have been affected by borrowing during their history. Our approach correctly identified 117 (94%) known borrowings. Reconstructed phylogenetic networks that capture both vertical and horizontal components of evolutionary history reveal that, on average, eight per cent of the words of basic vocabulary in each Indo-European language were involved in borrowing during evolution. Basic vocabulary is often assumed to be relatively resistant to borrowing. Our results indicate that the impact of borrowing is far more widespread than previously thought.

Read the rest of this entry »

Comments (45)


400 words for "your cute friend is next"?

Adding, ironically, to our "words for X" file, Scott Adams at the Dilbert Blog writes:

Here's a list of three things that you are unlikely to do, at least in this order:

1.       Watch a Swedish movie called The Girl with the Dragon Tattoo
2.       Read about the Swedish sex charges against Julian Assange
3.       Book a vacation to Sweden

I am always amused by the strange impact of unintended consequences. Julian Assange simply wanted to release some embarrassing information, have hot sex with a Swedish babe then have hot sex with an acquaintance of that same babe one day later. That's just one example of why the Swedish language has 400 words that all mean "and your cute friend is next."

But things didn't turn out as Assange hoped.  The unintended consequence of his actions is that he managed to make Sweden look like a country that's governed by congenital idiots and populated with nothing but crazy sluts and lawyers. And don't get me started about the quality of their condoms.

Read the rest of this entry »

Comments off


Word Weirding

Rick Rubenstein asks

Has there been any research done on the familiar phenomenon wherein a word which is repeated over and over begins to look misspelled, or even like complete gibberish?

Read the rest of this entry »

Comments (41)