## import libraries
import pandas as pd
import numpy as np

#allows us to show plots in notebook
%matplotlib inline

filename = 'Data/SHEEF2010.txt' #created a variable that it would be easy to call

sheef = pd.read_csv(filename, sep = '\s+', header=1, index_col='Date')
#made dataframe into variable sheef, deleted unnecessary header, changed the index

Used the panda option read_csv because it can read txt files.

Used the \s+ because it registers white space as the delimiter and sep = None caused an error showing there was white space.

There was an extra header that had information on the dataframe, but was confusing the system about titles of the columns so I deleted that.

I wanted the dataframe to be sorted by time, so I changed the index.

sheef_drop = sheef.drop(sheef.columns[5:], axis=1)
#dropped columns with unwanted data

#plotted MW column

<matplotlib.axes._subplots.AxesSubplot at 0x11a72ab38>

sheef_mag = sheef[sheef.MW >= 3.0]
#called only data that matched the restriction MW >= 3.0, which means I only want to see earthquakes that were greater
#than a 3.0 on the Richter Scale

For my three operations, I chose to drop some columns, create a plot, and sort a dataframe.

1) By dropping the last three columns I got rid of unnecessary data, and streamlined the information we see.

2) By creating a plot I showed an interesting relationship. I needed to have my index as date (which I did when I first loaded the data) so that I could have the x axis as the date. I wanted to see the magnitude of earthquakes over time, which is why I did this. You can see that the amount and magnitude of earthquakes increases over time. Which is actually really interesting, because there could be several reasons for this but primarily it means that something is changing in the region!

3) I wanted to create a new dataframe that showed earthquakes with a magnitude greater than 3. I chose this magnitude because past 3 on the Richter scale is when the earthquake can be 'felt' by people.

