Comments on: "They called for more structure"

By: David J. Littleboy

David J. Littleboy — Wed, 25 Feb 2015 23:25:23 +0000

"It seemed (at least with 20/20 hindsight on top of admittedly fuzzy recollection) like the model of natural language being used for examples in the AI class was, if not straight up generative semantics, ..." That's exactly right. We thought the wrong side won the linguistics wars. (Basically, Chomsky thought that it wouldn't be possible to deal with meaning in language scientifically, so meaning had to be ignored. The scruffy AI party line was that language is about meaning, so was the central concern, so we ended up reinventing generative semantics.) I took an intro course to linguistics from a generative semantics type (who posts here occasionally (hi, Larry!)) back then, and nearly everything he'd say would strike me as ridiculous, and I'd say so, and he'd say "you're right, but I'm teaching intro transformational, not generative semantics." The textbook used had one of the most egregious examples of academic dishonesty I had ever seen. As an exercise to show how useful phrase structure grammar was, it used fudged data (they used formal sentences when the ordinary forms wouldn't fit the phrase structure they were putting together) from Japanese. ROFL. But I learned to never, ever, even think of believing a linguist or anthropologist who tells you something about a language you don't speak. And that rule has proved correct several times over the last few years.

By: Mara K

Mara K — Wed, 25 Feb 2015 18:13:48 +0000

I don't even know what linguists have or haven't jettisoned, because my graduate syntax professor last semester insisted on teaching us the Chomskyan way. I took issue with the existence of covert movement and had an epic argument in with her in the last month of class about whether anything that happened after Spellout was really syntax (I argued it should be semantics or pragmatics). Her response: "Sure, but let's do it this way for the sake of the argument." Argh! tl;dr Where can I, as a graduate student who expected to learn about cutting-edge developments in theoretical linguistics, go to learn about anything that's happened in linguistics since Minimalism? Someone please recommend papers.

By: J. W. Brewer

J. W. Brewer — Wed, 25 Feb 2015 16:40:30 +0000

Schank was very big name around campus (at least if you spent any time socializing with with dweeby people interested in computer-related stuff) when I was an undergrad back in the '80's, and I recall hearing at the time (and wikipedia confirms it) that (understandably, since he was of the generation when "computer science" wasn't a thing that you could have gotten a Ph.D. in) his own doctorate was in linguistics, rather than in the more-typical-for-CS math or applied math or EE. However, I think my prior comment was inspired in part by memories of taking an AI-for-non-CS-majors class (not with Schank but with a younger colleague) the same semester I was taking a class on then-orthodox (but now badly out of date) transformational syntax. It seemed (at least with 20/20 hindsight on top of admittedly fuzzy recollection) like the model of natural language being used for examples in the AI class was, if not straight up generative semantics, at least heavily reliant on some very naive version of deep structure that the orthodox Chomskyans had already jettisoned a decade or so earlier (although they had not yet jettisoned the whole D v. S distinction yet). Although I guess it could be argued that getting a computer to simulate fluency in a highly impoverished version of a natural language (like the simplification characteristic of pidgins, but more so) that was simple and orderly enough to be accurately modeled by early/naive/superseded Chomskyanism would still have been a massively impressive accomplishment.

By: David J. Littleboy

David J. Littleboy — Wed, 25 Feb 2015 14:12:57 +0000

"A scruffy-AI researcher may want to enrich the current system to make more use of syntax, but will be perfectly happy to use a "big hairy four-by-four" approximation of syntax that is nailed onto the rest of the system with railroad spikes. The goal is to improve the end results by any expedient method." Hmm. That's not what I take "scruffy" to mean. "Scruffy" means having a cognitive theory of how people do things and attempting to implement that theory. Neat AI is more "scruffy" in your sense. In my sense of "scruffy", if you asked someone to name every museum they'd ever visited, they'd be slow and forget some. A "neat" AI program to respond to that question would be a simple database query, would be fast, and would never forget a museum. A program that had to work to justify traversing a given link (and had trouble finding links that might get to museum memories in the first place) would be slow, make mistakes, and get you a PhD from Schank in the early 1980s. Neat AI is about persuading a computer to do something impressive with no concern for whether or not people do it that way. E.g. Deep Blue, corpora-based MT, contemporary "machine learning" stuff. And pretty much everything else in AI for the last 25 years. To the best I can tell, no one's doing scruffy AI any more. Or at least that's what the terms look like to the average bloke who's passed the AI quals under Roger Schank. (Note that there's some argument as to whether neural network models are neat or scruffy. I take them to be neat in the extreme, because despite being vaguely reminiscent of tangles of neurons, they're based on praying that intelligence will emerge from doing the same stupid thing over and over again in parallel with no model of what intelligence is. Folks who like neural networks think they're modeling brains. Go figure.)

By: Mara K

Mara K — Tue, 24 Feb 2015 03:18:32 +0000

@zizoz Dictionary.com says a fastigium is "the highest point of a fever or disease; the period of greatest development of an infection." This makes me think the city itself is a disease, and the moment of its completion is the peak of the infection.

By: Zizoz

Zizoz — Tue, 24 Feb 2015 02:58:19 +0000

What is the significance of the word "fastigium"? It seems to mean "gable", which is of no help to me...

By: J. W. Brewer

J. W. Brewer — Mon, 23 Feb 2015 18:41:16 +0000

Mara K.: I don't know enough about either MT or the history of Esperanto to say, but that certainly seems like a *plausible* intuition. And perhaps as with Esperanto, once people actually started living in Brasilia they came up with various spontaneous/improvisational ways to make it more livable in practice than it would have been had the planners' aridly rationalistic vision not been diluted in that way. Although probably there's a difference in degree because using Esperanto in the first place probably self-selects for basic sympathy with the planners' original vision in a way that living in Brasilia probably doesn't. I think some of the early "neat AI" failures at getting computers to deal with natural language were trying to construct software around academic approaches to language (e.g. generative semantics) that have themselves subsequently fallen out of favor in linguistics departments. But the MT business may still be scruffy enough that inability-to-be-implemented-via-neat-AI may not be a good way to tell better academic theories of language from worse ones, because even the better ones may not (yet, at least) be susceptible of notably successful implementation in that context.

By: John Lawler

John Lawler — Mon, 23 Feb 2015 17:43:38 +0000

A somewhat dated (ca 1998) account of the two CL/NLP approaches -- which were quite distinct and even antagonistic at the time -- can be found in the last two chapters of Using Computers in Linguistics. The first of these, by Jim Hoard (then with Boeing), is definitely the neat approach, and its title heralds the present synthesis. The second one, by Sam Bayer and his group at MITRE, is about how far you can get picking low-hanging fruit, and how you can build ladders. Plus it has a rather nice summary of the history of the field.

By: Mara K

Mara K — Mon, 23 Feb 2015 16:59:18 +0000

@J.W the problem with "rationally-designed" languages is that once real people start using them they evolve and become messy and unplanned. Here is my intuition: MT between Loglan and the original Esperanto would probably be easier; MT between Lojban and today's Esperanto might be easier than, say, MT between French and Mandarin, which have both been evolving in unplanned ways for thousands of years, but not easy. Is this a good/correct intuition?

By: J.W. Brewer

J.W. Brewer — Mon, 23 Feb 2015 12:26:02 +0000

The failures and inhumane brutality of planned utopias like Brasilia and Chandigarh are contrasted with the unplanned but well-functioning complexity of natural language in James Scott's interesting book Seeing Like a State. From which I wonder if it follows that MT between two artificial rationally-designed languages (Esperanto to Lojban, or whatever) would be easier to implement via "neat" AI?

By: Mara K

Mara K — Mon, 23 Feb 2015 07:13:27 +0000

@ethan *looks up "fastigium"* It sounds to me more like the city is a disease. What does that say about machine translation?

By: Ethan

Ethan — Mon, 23 Feb 2015 07:01:40 +0000

@MaraK: "Fastigium" is not a nonsense word. Its meaning may be unknown to the narrator, but I take it either as irony on the part of the unknown city designers or a meta comment that breaks the fourth wall of the vignette and addresses the reader directly. To me it adds a connotation that the whole passage is sort of a fever dream. I leave it to others to speculate how much this contributed to Kevin Knight's choice of analogy.

By: Mara K

Mara K — Mon, 23 Feb 2015 05:02:00 +0000

@Jason this makes sense. Now what about the part where we discover that the city has a plan behind it, but that that plan is based around a nonsense word that simply looked pretty to the designers?