Archive for Linguistics in the comics

Perils of topic modeling

Today's xkcd illustrates why topic modeling can be tricky, for people as well as for machines:

The mouseover title: "As the 'exotic animals in homemade aprons hosting baking shows' YouTube craze reached its peak in March 2020, Andrew Cuomo announced he was replacing the Statue of Liberty with a bronze pangolin in a chef's hat."

Read the rest of this entry »

Comments (11)

Learning empiricism

Comments (20)

New word of the week

Comments (15)

Cumulative punctuation

Comments (34)

Barstool punctuation

Comments (10)

The Pythagorean catastrophe

Comments (7)

Budgets on Mars

Comments (1)

Security guaranteed by the laws of physics

Comments (2)

The standard deduckling

Comments (5)

Alignment charts and other low-dimensional visualizations

Comments (12)

The League of Disappointing Authors

Comments (133)

New Years party themes

Today's xkcd:

The mouseover title: ""Off-by-one errors" isn't the easiest theme to build a party around, but I've seen worse."

Read the rest of this entry »

Comments (10)

Long ago, in a narratology far away…

Louisa Shepard, "‘May the force be with you’ and other fan fiction favorites", Penn Today 12/18/2019:

Starting with Star Wars, Penn researchers create a unique digital humanities tool to analyze the most popular phrases and character connections in fan fiction. […]

The Penn team started with the script of “Star Wars: The Force Awakens” and created algorithms to analyze the words in the script against those in millions of fan fiction stories. The unique program identifies the most popular phrases, characters, scenes, and connections that are repurposed by these writers and then displays them in a simple graph format.

The results are now available on their “fan engagement meter” at https://fanengagement.org.

Serendipitously, today's xkcd:

Read the rest of this entry »

Comments (1)