Thursday, August 04, 2016

Does Twitter Tell Who Will Be the Next President? Comparing Donald Trump’s and Hillary Clinton’s Twitter Influence

Over the last year Donald Trump has been doing a brilliant job kindling his initially highly unlikely candidacy as US Presidential applicant. Following the principle that there is no good or bad PR, that any news is good news as long as it is in the news, he has acted as a master provocateur. He has been a genius in hitting the soft spots of US society, constantly provoking increasingly broader parts of society with over 32,000 (and rapidly growing) racist, sexist, and religiously offensive tweets.

I was curious to see how his Twitter behavior would compare with the articulations on Twitter by his Democrat competitor for the job, Hillary Clinton. Therefore I used Condor’s Twitter EgoFetcher (thanks, Joao, for coming up with the idea) to collect the most recent 10,000 tweets about each candidate on August 4, 2016 at 10.00AM.  The EgoFetcher works in four steps: in step 1 it takes the last N (for example 10,000) tweets about the search term or Twitter handle (e.g. “Donald Trump”). Note that normally – except for an individual’s own tweets -  the  search API of Twitter only returns last week’s tweets.  This is not a problem for this search, as people send thousands of tweets about each candidate per hour. In step 2 it constructs a network with a link from twitterer B to twitterer A if B retweets A, or B mentions A in a tweet. In step 3 it takes the timelines of the 480 most influential people in the search results, the influence of these people is measured through their degree in the retweet network from step 2. For twitter users, their timeline is all their tweets, sorted from newest to oldest. In step 4 it adds for each tweet collected in the previous steps the first 100 retweets.


The picture below shows the combined Twitter network, with the tweets about Donald in yellow, and about Hillary in green.


When combining the search results of the queries for “Donald Trump” and “Hillary Clinton”, and measuring their betweenness centrality, which in social network analysis is commonly taken as a metric of influence, Donald easily beats Hillary. The pie chart below shows the betweenness centralities of both the search queries for the candidates, as well as the betweenness of their Twitter personalities (realDonaldTrump and HillaryClinton).



So will Donald become the next president? Not so fast.

When looking at each of the network separately, a different picture emerges. The picture below shows the network for Hillary at left, and for Donald at right. The green twitterers are the ones returned in the original search about the search term (“Hillary Clinton” or “Donald Trump”). The yellow twitterers are the people tweeting about the most influential 480 twitterers among the green people.


The first thing we note is that not only is the timeline crowd of Hillary more numerous than Donald’s (18,363 people retweeting about Hillary compared to 17,295 for Donald), but they are also much more retweeted (91,681 retweets instead of 85,890).
When calculating the six honest signals of collaboration described for example in my new book manuscript, the picture becomes even more pronounced.

The metrics shown above are calculated using the dynamic social network analysis features of Condor. Activity is the total number of tweets in the EgoFetcher network above originating from each candidate, where Hillary beats Donald. Emotionality, sentiment, and complexity of language are calculated using the machine learning natural language processing features of Condor, based on the language used in the Tweets. There is not much difference here, although Hillary’s fan base uses slightly more complex language, while Donald’s constituency is slightly more positive, which at least might be partially explained by his hashtag #makeAmericaGreatAgain. The large difference is in creativity, passion, and respect. Creativity is measured as average number of changes in network position among all the twitterers, from being central to be peripheral in the network. In earlier work we have shown that the more people rotate in their network position, the more creative they are. Passion is measured as the time it takes a person until he or she responds to a tweet. Respect is measured as the time it takes all other people to respond to her or his tweets. And in these three scores Hillary’s community beats Donald’s handily.

I will be curious to see if increased creativity, passion and respect translates to the election results on November 8.


1 comment:

  1. As I believe evening and late night will be the best times when your tweets will receive the most favourites and retweets, on average.

    ReplyDelete