Weekly Posts about Political Data Science

In 2019, I’m writing weekly posts about how we use techniques of data science to answer questions about politics.

G. Elliott Morris

Jan. 03, 2019

Categories: Rstats R for Political Data Tags: R Tidyverse Politics Data Science Probability Statistics



Introduction

This is a collection of short posts that answer relevant questions about politics using political data. Most posts are short and sweet, getting at my curiosities for the week in just a few paragraphs and corresponding lines of code. I like to think we’re answering some salient, yet simple, questions here. Due to the ability to show and hide the computer code used for the analyses, I think them pretty accessible to anyone interested in either politics or data science, but one will probably get more out of this if they have an interest in (or tolerance of!) data-related subjects.

Some of you will recall about a year ago when I began a project that attempted a similar task. I got four posts in and then stopped. Why? I wasn’t holding myself accountable. This year, I’ve resolved to embark on a weekly journey to write simple introductions to concepts and datasets (yes, they’ll sometimes be repetitive — data science is too!) that I have found useful as a data journalist and political analyst. My mission is cheesy, I know, but who doesn’t like a good brie? (STATA users, probably.)

For your reference, I have also published my own course about political data analysis in R, titled “Analyzing Polling and Election Data with R” at DataCamp.com. If you’re looking for a more formal and structured guide to learning data science, that’s one place to start.

I am also making all the data and code easily available for you all via GitHub. Don’t be afraid to create issues, make pull requests, etc.

Layout

The organization of this weekly series will be straightforward: there’s no organization. Why does there need to be? Apart from the introductory information in this page, each post should be a standalone answer to a simple question which we can use techniques of data science to answer. So, they’ll be linked together, but don’t think of this as a workshop or short course on R and politics. Again, for that, consult my DataCamp course.

Posts

Without further ado…

Heads up: you can also find a list of all the posts at the “R for Political Data” category page. It might be helpful if I forget to update the contents here.

  1. Week 1: Polarization in the 115th Congress
  2. Week 2: This Early Before 2020, It’s All About Name Recognition
  3. Week 2: How Marginal Tax Rates Work


Don’t forget to check out the source code and data on GitHub!