Debate words
As I mentioned a few days ago ("More political text analytics", 4/15/2016), I've now got more-or-less cleaned-up text from the 21 debates held so far in the current U.S. presidential campaign.
[Update — with some help from Chris Culy, I've done additional clean-up on the debate texts, and therefore have revised the numbers in this post slightly, as of 4/23/2016. None of the numbers have changed a lot, and none of the qualitative implications have changed at all.]
If we focus on the contributions to those 21 debates of the five remaining U.S. presidential candidates, we get 199,188 words in total, divided up like this:
Clinton | 56,989 |
Sanders | 50,649 |
Trump | 41,039 |
Cruz | 32,654 |
Kasich | 28,772 |
This morning I'll add a few small examples of the kind of information that can be derived from a dataset of this type.
Read the rest of this entry »