In [1]:
import pandas as pd
In [3]:
# This is the raw 1.3 GB file downloaded from Chicago's data portal
# Go to https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-present/ijzp-q8t2
# And export csv to get the latest copy
raw_crimes = pd.read_csv('../data/Crimes_-_2001_to_present.csv')
In [4]:
crime_type_set = set(raw_crimes['Primary Type'])
crime_type_set
Out[4]:
From here, I dumped the types into crime_bins.csv and categorized by hand. I used this breakdown for help.