In [1]:
#importing the graphlab
import graphlab as gl


ACTION REQUIRED: Dependencies libstdc++-6.dll and libgcc_s_seh-1.dll not found.

1. Ensure user account has write permission to C:\tools\Anaconda3\envs\gl-env\lib\site-packages\graphlab
2. Run graphlab.get_dependencies() to download and install them.
3. Restart Python and import graphlab again.

By running the above function, you agree to the following licenses.

* libstdc++: https://gcc.gnu.org/onlinedocs/libstdc++/manual/license.html
* xz: http://git.tukaani.org/?p=xz.git;a=blob;f=COPYING
    

In [2]:
#importing the input data into graphlab SFrame dataset
crimeData = gl.SFrame.read_csv("Philadelphia_Crime_Rate_noNA.csv")
crimeData


---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-2-dc9ac8af02ff> in <module>()
      1 #importing the input data into graphlab SFrame dataset
----> 2 crimeData = gl.SFrame.read_csv("Philadelphia_Crime_Rate_noNA.csv")
      3 crimeData

AttributeError: 'module' object has no attribute 'SFrame'

In [ ]:
# setting plot canvas to this IPython notebook
gl.canvas.set_target('ipynb')

# plotting scatter plot 
crimeData.show(view="Scatter Plot", x="CrimeRate", y="HousePrice")

In [ ]:
# Calculating Linear Regression model

crimeData_model = gl.linear_regression.create(crimeData, target='HousePrice', features=['CrimeRate'],validation_set=None,verbose=False)

In [ ]:
crimeData_model.coefficients

In [ ]:
#importing matplotlib library for plotting

import matplotlib.pyplot as plt
%matplotlib inline

In [ ]:
plt.plot(crimeData['CrimeRate'],crimeData['HousePrice'],'.',
        crimeData['CrimeRate'],crimeData_model.predict(crimeData),'-')

In [ ]:
# Tryting the same linear model withput high influencial point

crimeData_woHI = crimeData[crimeData['HousePrice'] != 96200]

In [ ]:
crimeData_woHI_model = gl.linear_regression.create(crimeData_woHI, target='HousePrice', features=['CrimeRate'],validation_set=None,verbose=False)

In [ ]:
# getting linear model coefficients
crimeData_woHI_model.get('coefficients')

In [ ]:
plt.plot(crimeData_woHI['CrimeRate'],crimeData_woHI['HousePrice'],'.',
        crimeData_woHI['CrimeRate'],crimeData_woHI_model.predict(crimeData_woHI),'-')

In [4]:
gl.get_dependencies()


By running this function, you agree to the following licenses.

* libstdc++: https://gcc.gnu.org/onlinedocs/libstdc++/manual/license.html
* xz: http://git.tukaani.org/?p=xz.git;a=blob;f=COPYING
    
Downloading xz.
Extracting xz.
Downloading gcc-libs.
Extracting gcc-libs.
Copying gcc-libs into the installation directory.

In [ ]: