Difficult dialogues:

communicating data analyses effectively

Jo Hardin, Pomona College

8.5.2020

Land Acknowledgement

With our deepest respects to the Tongva and Serrano Peoples, past, present, and emerging.

http://www.laalmanac.com/history/hi05.php

Stanford Policing Data

https://openpolicing.stanford.edu/

https://openpolicing.stanford.edu/

Stanford Policing Data

Just under 100 datasets, ~1 TB of data. https://openpolicing.stanford.edu/data/

Just under 100 datasets, ~1 TB of data. https://openpolicing.stanford.edu/data/

Why this project?

Step 1: get data

con <- dbConnect(
  MySQL(), host = "XXX", user = "XXX",
  password = "XXX", dbname = "XXX")

raleigh_df <- DBI::dbGetQuery(con, "SELECT * FROM NCraleigh") 

Step 2: data viz

raleigh_df %>%
  
  # remove missing data
  filter(!is.na(sex) & !is.na(race)) %>%
  
  # use group_by and summarize to count number of stops per category
  group_by(sex, race) %>%
  summarize(count = n()) %>%
  ungroup() %>%
  
  # find the percentage of age/race stops
  mutate(percentage = round(prop.table(count), digits = 2)) %>%

Step 2: data viz

… continued

  # plot percentages
  ggplot(mapping = aes(x = sex, y = percentage, 
                       fill = race,
                       label = scales::percent(percentage))) +
  geom_bar(position = "dodge", stat = "identity") +
  
  # adjust labels
  geom_text(position = position_dodge(width = .9),    
            vjust = -0.5,   
            size = 3) + 
  scale_y_continuous(labels = scales::percent) +

  # provide labels
  ggtitle("Race & gender breakdown, % out of total")

Step 2: data viz

Step 2: advanced data viz

Accessing, joining, and wrangling all the datasets.

Step 2: advanced data viz

With facet_geo

Step 3: data viz of model

Language around sensitive data

Data Feminism by Catherine D’Ignazio & Lauren F. Kleinhttps://datafeminism.io/

Data Feminism by Catherine D’Ignazio & Lauren F. Kleinhttps://datafeminism.io/

Bias or Oppression?

Machine Bias ProPublica https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing

Machine Bias ProPublica https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing

Language around sensitive data

Data Feminism by Catherine D’Ignazio & Lauren F. Klein, https://datafeminism.io/

Data Feminism by Catherine D’Ignazio & Lauren F. Klein, https://datafeminism.io/

Equity vs Equality

Image credit: Interaction Institute for Social Change | Artist: Angus Maguire.

Image credit: Interaction Institute for Social Change | Artist: Angus Maguire.

Traffic stops

What are the systems that have created structural racial discrepancies in the US?

Traffic stops

Our results indicate that police stops and search decisions suffer from persistent racial bias and point to the value of policy interventions to mitigate these disparities.

While the research shows that stops are neither equal nor equitable, a discussion on equity belongs in the conversation around racial discrepancies in traffic stops.

Evidence of racial disparities in traffic stops.

Evidence of racial disparities in traffic stops.

Significance Magazine (August 2020)

Measures for determining racial disparities:

GitHub repo: https://github.com/bakuninpr/traffic-stops-and-racial-disparity

GitHub repo: https://github.com/bakuninpr/traffic-stops-and-racial-disparity

Traffic Stop

Motorcycle with Montana plates vs. Van with Colorado plates, traffic stop on Sunday, Aug 2, 2020.

Family with young children handcuffed.

Family with young children handcuffed.

8am weekly zoom calls

The undergraduates who did the complete analysis and wrote the report are Pomona College students: Amber Lee (‘22), Arm Wonghirundacha (‘22), Emma Godfrey (‘21), Ethan Ong (‘21), Ivy Yuan (‘21), Oliver Chang (‘22), and Will Gray (‘22).

Thank you!