Latent trees
There's been some buzz recently about how syntactic structures are implicit in Large Language Models — most recently, the Liu et al. paper noted yesterday by Victor, and an accepted ms by Futrell and Mahowald at Behavioral and Brain Sciences, "How Linguistics Learned to Stop Worrying and Love the Language Models". Futrell and Mahowald recognize something that Liu et al. mostly ignore, namely that constituent structure is obviously implicit in statistical patterns of sequential data, at least if the sequences were generated by a constituency-sensitive process — and that algorithms taking advantage of that fact have been Out There for 70 years or more.
Read the rest of this entry »