Introduction to scientific Python

Numpy

Exercise difficulty rating:
[1] easy. ~1 min
[2] medium. ~2-3 minutes
[3] difficult. Suitable for a standalone project.

Numpy is imported and aliased to np by convention



In [1]:

    
import numpy as np

Array creation

The basic data structure provided by Numpy is the ndarray (n-dimensional array). Each array can store items of the same datatype (e.g. integers, floats, etc.). Arrays can be created in a number of ways:

from a Python iterable (usually a list)



In [2]:

    
a = np.array([0, 1, 2]) # a rank 1 array of integers

using arange (similar to Python's builtin range but returns an array):



In [3]:

    
a = np.arange(start=0., stop=1., step=.1) # a rank 1 array of floats 
                                          # note that start is included but stop is not
b = np.arange(10) # a rank 1 array of ints from 0 to 9

using linspace and logspace



In [4]:

    
a = np.linspace(start=0, stop=5, num=10) # a rank 1 array of 10 evenly-spaced floats between 0 and 5
                                         # this can be thought of as a linear axis of a graph with evenly-spaced ticks
b = np.logspace(-6, -1, 6) # same as above but on logarithmic scale

using built-in helper functions



In [5]:

    
zeros = np.zeros(5) # rank 1 array of 5 zeros
ones = np.ones(3)   # it's in the name
eye = np.eye(8) # 8x8 identity matrix

Exercises

create a set of axes for a graph as Numpy arrays. Let the x axis be uniformly spaced from 0 to 60 with scale 1 (i.e. one tick per value) and the y axis be on log scale and represent the powers of 10 from 0 to 6 [1].



In [6]:

    
# answer

Array indexing and shapes

Basic array indexing is similar to Python lists. However, arrays can be indexed along multiple dimensions.



In [7]:

    
a = np.array([[0, 1, 2],
              [3, 4, 5],
              [6, 7, 8]])
print(a[0])    # prints [0, 1, 2]
print(a[0, 0]) # prints 0
print(a[:, 1]) # prints [1, 4, 7]

Here : denotes all elements along the axis. For a 2D array (i.e. a matrix) axis 0 is along rows and axis 1 along columns.

Slicing

Arrays support slicing along each axis, which works very similarly to Python lists.



In [8]:

    
a = np.linspace(0, 9, 10)
print(a[0:5]) # prints [0., 1., 2., 3., 4.]
b = np.array([[0,  1,  2],
              [3,  4,  5],
              [6,  7,  8],
              [9, 10, 11]])
print('-'*12)
print(b[:, 0:2]) # prints [[0,  1]
                 #         [3,  4]
                 #         [6,  7]
                 #         [9, 10]]









    



[ 0.  1.  2.  3.  4.]
------------
[[ 0  1]
 [ 3  4]
 [ 6  7]
 [ 9 10]]

Boolean indexing

Arrays can also be indexed using boolean values



In [9]:

    
a = np.arange(10)
print(a[a < 5]) # prints [0, 1, 2, 3, 4]
print(a[a % 2 == 0]) # prints [0, 2, 4, 6, 8]









    



[0 1 2 3 4]
[0 2 4 6 8]

Shapes

The shape of the array is a tuple of size (array.ndim), where each element is the size of the array along that axis.



In [10]:

    
a = np.array([0, 1, 2])
print(a.shape) # prints 3
b = np.array([[[0], [1], [2]],
              [[3], [4], [5]],
              [[6], [7], [8]]])
print(b.shape) # prints (3, 3, 1)









    



(3,)
(3, 3, 1)

The shape of the array can be changed using the reshape method.



In [11]:

    
a = np.arange(9).reshape((3, 3))
print(a) # prints [[0 1 2]
         #         [3 4 5]
         #         [6 7 8]]









    



[[0 1 2]
 [3 4 5]
 [6 7 8]]

Note that the total number of elements in the new array has to be equal to the number of elements in the initial array. The code below will throw an error.



In [12]:

    
try:
    a = np.arange(5).reshape((3, 3))
except ValueError as e:
    print(e)









    



cannot reshape array of size 5 into shape (3,3)

Exercises

get the first column and the third row of the following array [1].



In [13]:

    
a = np.array([[23, 324,  21, 116], 
              [ 0,  55, 232, 122],
              [42,  43,  44,  45],
              [178, 67, 567,  55]])



In [14]:

    
# answer

find all numbers divisible by 7 between 0 and 50 [1].



In [15]:

    
# answer

Mathematical operations and broadcasting

Basic arithmetics

Numpy arrays support the standard mathematical operations.



In [16]:

    
a = np.arange(9).reshape(3, 3)
b = np.array([10, 11, 12])
print(a + 1) # operation is performed on the whole array
print('-'*12)
print(a - 5)
print('-'*12)
print(b * 3)
print('-'*12)
print(b / 2)









    



[[1 2 3]
 [4 5 6]
 [7 8 9]]
------------
[[-5 -4 -3]
 [-2 -1  0]
 [ 1  2  3]]
------------
[30 33 36]
------------
[ 5.   5.5  6. ]

Broadcasting

Operations involving 2 or more arrays are also possible. However, they must obey the rules of broadcasting.



In [17]:

    
print(a * a) # multiply each element of a by itself
print('-'*12)
print(a + b) # add b elementwise to every row of a
print('-'*12)
print(a[:, 0] + b) # add b only to the first column of a









    



[[ 0  1  4]
 [ 9 16 25]
 [36 49 64]]
------------
[[10 12 14]
 [13 15 17]
 [16 18 20]]
------------
[10 14 18]

Notice that in the last 2 examples above we could perform the operations even though the arrays did not have the same shape. This is one of the most powerful features of Numpy that allows for very efficient computations. You can read more about broadcasting in the official documentation and here.

Built-in functions

Numpy has efficient implementations of many standard mathematical and statistical functions.



In [18]:

    
a = np.random.normal(0, 1, (4, 3)) # np.random.normal creates an array of normally-distributed random numbers
                                   # with given mean and standard deviation (e.g. 0 and 1) and in given shape.
print(np.mean(a)) # a number close to 0
print('-'*12)
print(np.mean(a) == a.mean()) # many functions are also implemented as methods in the ndarray class
print('-'*12)
print(np.std(a)) # close to 1
print('-'*12)
print(np.sum(a))  # Compute sum of all elements
print('-'*12)
print(np.sum(a, axis=0))  # Compute sum of each column
print('-'*12)
print(np.sum(a, axis=1))  # Compute sum of each row









    



0.356437646567
------------
True
------------
0.816144019341
------------
4.27725175881
------------
[ 2.82710525  1.40633229  0.04381422]
------------
[ 0.53855367  0.82141643  0.70052896  2.21675269]

Exercises

create a 6x6 array of normally distributed random numbers with mean 5 and standard deviation 10. Print its computed mean and standard deviation [1].



In [19]:

    
# answer

add 5 to every element of the array from the previous exercise [1].



In [20]:

    
# answer

multiply the third column of the array from the previous exercise by the given array [1].



In [21]:

    
b = np.array([0, 1, 0, 1, 0, 1])



In [22]:

    
# answer

create a 7x7 matrix with 7 on the leading diagonal and 0 everywhere else (note: you might find the eye function useful) [1].



In [23]:

    
# answer

the following array represents the spending, in pounds of 4 people over 5 months (rows represent time and columns the individuals). Compute the total spending of person 2. Compute the average spending in each month for each person (note: use the axis argument) [1].



In [24]:

    
#            person     1        2       3        4      # month
spending = np.array([[450.55, 340.67, 1023.98, 765.30],  # 1
                     [430.46, 315.99,  998.48, 760.78],  # 2
                     [470.30, 320.34, 1013.67, 774.50],  # 3
                     [445.62, 400.60, 1020.20, 799.45],  # 4
                     [432.01, 330.13, 1011.76, 750.91]]) # 5



In [25]:

    
# answer

the outer product of two vectors u and v can be obtained by multiplying each element of u by each element of v. Compute the outer product of the given vectors (note: use the reshape function) [2].



In [26]:

    
v = np.array([1,2,3])
w = np.array([4,5])



In [27]:

    
# answer

Linear algebra

Numpy has extensive support for linear algebra operations.



In [28]:

    
u = np.array([4, 2, 5]) # a row vector in R^3, shape (3,)
v = np.array([.5, .3, .87])
a = np.array([[3, -2,  1],
              [9,  6, 10],
              [6, -4,  3]]) # a 3x3 matrix
b = np.array([[.25,  .2],
              [4.3,  .1],
              [ 1., .82]]) # a 3x2 matrix

print(u.dot(v)) # the dot product of vectors
print('-'*12)

print(a.dot(u)) # the matrix vector product
print('-'*12)

print(a.dot(b)) # matrix multiplication
print('-'*12)

print(np.linalg.norm(u)) # norm aka magnitude of a vector
print('-'*12)

print(np.outer(u, v)) # uv^T
print('-'*12)

print(np.transpose(b)) # equivalent to b.T
print('-'*12)

print(u.T) # note that this does not turn a row vector into column vector
           # i.e. the shape of u.T is still (3,)
print('-'*12)

print(np.linalg.inv(a)) # find the inverse of a matrix, a^-1
print('-'*12)

print(np.linalg.solve(a, u)) # solve the linear system ax = u for x









    



6.95
------------
[13 98 31]
------------
[[ -6.85   1.22]
 [ 38.05  10.6 ]
 [-12.7    3.26]]
------------
6.7082039325
------------
[[ 2.    1.2   3.48]
 [ 1.    0.6   1.74]
 [ 2.5   1.5   4.35]]
------------
[[ 0.25  4.3   1.  ]
 [ 0.2   0.1   0.82]]
------------
[4 2 5]
------------
[[ 1.61111111  0.05555556 -0.72222222]
 [ 0.91666667  0.08333333 -0.58333333]
 [-2.         -0.          1.        ]]
------------
[ 2.94444444  0.91666667 -3.        ]

Exercises

write a function to compute the Euclidean distance between two vectors. Evaluate it on the given data [1].



In [29]:

    
u = np.array([.4, .23, .01])
v = np.array([.12, 1.1, .5])



In [30]:

    
# answer

the closed-form solution for the linear regression problem can be written as

$W = (X^{T}X)^{-1}X^{T}y$

Compute the weight matrix $W$ for the given data. Check if your weight and bias are close to the true values (2, 5). Compute the estimated values $\hat{y}$ for the test dataset [2].



In [31]:

    
def f_true(x):
    return 2 * x + 5

def f(x):
    return f_true(x) + np.random.normal(0, 1, xx.shape[0])

xx = np.linspace(-10, 10)
X = np.ones((xx.shape[0], 2))
X[:, 1] = xx
y = f(xx)
test = np.array([-5.5, 1.2, 4.8, 9.])



In [32]:

    
# answer

Vectorized functions and speed

The functions in Numpy, including the mathematical operators are implemented in efficient C code operating on whole arrays. Therfore, it is usually a good idea to avoid element-by-element computations in a for or while loop. Functions that operate on whole arrays are referred to as vectorized.



In [33]:

    
# a non-vectorized function
def log_pos_p1(x):
    
    """Compute the natural log + 1 of positive elements in x. 
       Return 0 for elements < 0."""
    
    result = np.zeros_like(x) # create an array of zeros with the same shape and type as x
    for i, e in np.ndenumerate(x): # ndenumerate returns tuples (index, element)
        if e > 0:
            result[i] = np.log(e) + 1
    return result
            
def log_pos_p1_vectorized(x):
    # same as above but faster
    result = np.zeros_like(x)
    result[x > 0] = np.log(x[x > 0]) + 1
    return result



In [34]:

    
x = np.arange(-1000., 1000.)



In [35]:

    
%timeit log_pos_p1(x)









    



2.88 ms ± 205 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)



In [36]:

    
%timeit log_pos_p1_vectorized(x)









    



19.9 µs ± 1.36 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)



In [37]:

    
# check if the results are correct
result_slow = log_pos_p1(x)
result_vectorized = log_pos_p1_vectorized(x)
np.allclose(result_slow, result_vectorized)









    Out[37]:





True

Exercises

the rectified linear unit (aka ReLU) is a commonly used activation function in machine learning. It can be computed using the following Python function:

def relu(x):
      result = np.zeros_like(x) # create an array of 0s with the same shape and type as x
      for i, e in np.ndenumerate(x): # ndenumerate returns tuples (index, element)
          if x > 0:
              result[i] = e
      return result

Vectorize this function. Evaluate the running time using the %timeit command on the data below [2].



In [38]:

    
def relu(x):
    result = np.zeros_like(x) # create an array of 0s with the same shape and type as x
    for i, e in np.ndenumerate(x): # ndenumerate returns tuples (index, element)
        if e > 0:
            result[i] = e
    return result

x = np.arange(-10000., 10000.)



In [39]:

    
# answer

the integral of a function can be numerically aproximated using the trapezoidal rule, defined as:
$\displaystyle\int_{a}^{b}f(x)dx \approx \frac{h}{2}f(a) + \frac{h}{2}f(b) + h\displaystyle\sum_{i=1}^{n-1}f(a + ih), \space h = \frac{b - a}{n}$
Write a vectorized function to integrate a function using the trapezoidal rule. It should accept as arguments the function to integrate, the lower and upper bounds and the number of iterations $n$. Do not use an explicit for-loop for the summation. Use your function to intergrate $x^3$ from 0 to 1 using $n=1000$ (the true value of this integral is $\frac{1}{4}$) [2].



In [40]:

    
def f(x):
    return x ** 3

a = 0
b = 1
n = 1000



In [41]:

    
# answer

Challenges

Write a simple gradient descent optimizer using numpy [3]. Use it to fit a linear regression model to the sample data from the previous exercises [2]. You might find the notebook from the first workshop on linear models useful.

Write a function to compute the elements of the Mandelbrot set with given resolution [3]. Estimate the area of the computed set [2]. Make the function as fast as possible using vectorization [3].

Logistic regression is a popular model used to estimate the probability of a binary outcome (i.e. 0 or 1). It belongs to a broader class of generalized linear models. Write a logistic regression solver using Numpy [3]. You can adopt multiple approaches, e.g. gradient descent from the first challenge or algorithms like Newton's method or iteratively-reweighted least squares (Wikipedia is a good place to start). Test your implementation on the classic Titanic dataset or any other binary classification problem. Compare the performance of your model to a standard implementation (e.g. scikit-learn) using a metric of your choice. Compare the speed of your method to the reference implementation and try to make it as fast as possible (using vectorization, faster algorithms, etc.)

Matplotlib Visualization

As its name conveys, matplotlib is plotting library inspired by the MATLAB plotting API.
seaborn is a wrapper library for matplotlib, encapsulating some of the low-level functionalities of it.

This is a beginers-level tutorial to matplotlib using seaborn palettes for "prettifying" the plots. Similar results can be replicated by just using matplotlib.

`inline` mode

When working on an interactive environment, such as IPython (and Jupyter), there is the option of draw plots without calling the .show() method, which is necessary for script mode. To do so, use the oneliner below.



In [42]:

    
# show plots without need of calling `.show()`
%matplotlib inline



In [43]:

    
# scientific computing library
import numpy as np

# visualization tools
import matplotlib.pyplot as plt
import seaborn as sns

# prettify plots
plt.rcParams['figure.figsize'] = [20.0, 5.0]
sns.set_palette(sns.color_palette("muted"))
sns.set_style("ticks")

# supress warnings
import warnings
warnings.filterwarnings('ignore')

# set random seed
np.random.seed(0)

Simple Plot

How to plot a 2D plot of two variables $x$ and $y$.

Warning
The length of the two variables should match, such that len(x) == len(y).

Single



In [44]:

    
# x-axis variable
x = np.linspace(-2*np.pi, 2*np.pi)
# y-axis variable
y = np.cos(x)

# making sure that the length of the two variables match
assert(len(x) == len(y))
# visualize
plt.plot(x, y);

# WARNING: Don't forget this line in `script` mode
# plt.show()

Multiple

Plot two functions $f$ and $g$ against the independent variable $x$ on the same graph.



In [45]:

    
# x-axis variable
x = np.linspace(-2*np.pi, 2*np.pi)
# y-axis variable
y_cos = np.cos(x)
y_sin = np.sin(x)

# visualize
plt.plot(x, y_cos)
plt.plot(x, y_sin);

# WARNING: Don't forget this line in `script` mode
# plt.show()

Metadata

Provide "metadata" for our plot, such as:

title
axes labels
legend



In [46]:

    
# x-axis variable
x = np.linspace(-2*np.pi, 2*np.pi)
# y-axis variable
y_cos = np.cos(x)
y_sin = np.sin(x)

# visualize
plt.plot(x, y_cos, label="cos(x)") # `label` argument is used in the plot legend to represent this curve
plt.plot(x, y_sin, label="sin(x)")

# meta data
plt.title("Simple Plot")
plt.xlabel("independent variable")
plt.ylabel("function values")
plt.legend();

# WARNING: Don't forget this line in `script` mode
# plt.show()

Styling

Advanced styling and formating options are available, such as:

marker style
line style
color
linewidth



In [47]:

    
# x-axis variable
x = np.linspace(-2*np.pi, 2*np.pi)
# y-axis variable
y_cos = np.cos(x)
y_sin = np.sin(x)
y_mix = y_cos + y_sin

# visualize
plt.plot(x, y_cos, linestyle="-",  marker="o", label="cos(x)", linewidth=1) 
plt.plot(x, y_sin, linestyle="--", marker="x", label="sin(x)", linewidth=3)
plt.plot(x, y_mix, linestyle="-.", marker="*", label="cos(x) + sin(x)", color='orange')

# meta data
plt.title("Simple Plot")
plt.xlabel("independent variable")
plt.ylabel("function values")
plt.legend();

# WARNING: Don't forget this line in `script` mode
# plt.show()

Task (Activation Functions Plots)

In neural networks, various activation functions are used, some of with are illustrated below!

TODO
Choose 2 activation functions,

implement them (as we did for $cos(x)$ and $sin(x)$ functions)
plot them in the same graph, using plt.plot

Make sure you provide a title, axes label and a legend!

Don't forget to put your personal tone to this task and style the plot!!



In [48]:

    
## SOLUTION

# activation function definitions
def sigmoid(z):
    return 1/(1 + np.exp(-z))

# x-axis variable
x = np.linspace(-10, 10)
# y-axis variable
y_sigm = sigmoid(x)
y_tanh = np.tanh(x)

# visualize
plt.plot(x, y_sigm, linestyle="-",  marker="o", label="sigm(x)", linewidth=1) 
plt.plot(x, y_tanh, linestyle="--", marker="x", label="tanh(x)", linewidth=3)

# meta data
plt.title("Activation Functions Plots")
plt.xlabel("independent variable")
plt.ylabel("function values")
plt.legend();

# WARNING: Don't forget this line in `script` mode
# plt.show()

Histogram

In the ICDSS Machine Learning Workshop: Linear Models, we plotted the histograms of the daily returns for three istruments (AAPL, GOOG, SPY) in order to get an idea of the underlying distribution of our stochastic processes. This is not the only user-case!

When working with random variables, in a stochastic setting, histograms are a very representative way to visualize the random variables PDFs.

Warning
Histograms are visualizing the distribution of a single random variable, so make sure you notice the difference between:

plt.plot(x, y)
plt.hist(z)

`plt.hist`



In [49]:

    
# uniform random variable
u = np.random.uniform(-1, 1, 1000)

# visualize
plt.hist(u, label="Uniform Distribution")

# metadata
plt.title("Matplotlib Histogram")
plt.xlabel("random variable (binned)")
plt.ylabel("frequency")
plt.legend();
# WARNING: Don't forget this line in `script` mode
# plt.show()

`sns.distplot`

seaborn wraps plt.hist method and applies also some KDE (Kernel Distribution Estimation), providing a better insight into the distribution of the random variable.



In [50]:

    
# uniform random variable
u = np.random.uniform(-1, 1, 1000)

# visualize
sns.distplot(u, label="Uniform Distribution")

# metadata
plt.title("Seaborn Distribution Plot")
plt.xlabel("random variable (binned)")
plt.ylabel("frequency")
plt.legend();
# WARNING: Don't forget this line in `script` mode
# plt.show()

Task (Stochastic Processes Distributions Plots)

We covered only the uniform distribution, but there are many more already implemented in NumPy. Generate similar plots for other distributions of your choice.

TODO
Choose 2 distributions,

implement them (as we did for uniform distribution)
plot them in the same graph, using plt.hist or sns.distplot

Make sure you provide a title, axes label and a legend!

Don't forget to put your personal tone to this task and style the plot!!



In [51]:

    
## SOLUTION

SciPy Optimization



In [52]:

    
# scientific computing library
import numpy as np
# optimization package
from scipy.optimize import minimize

# visualization tools
import matplotlib.pyplot as plt
import seaborn as sns

# show plots without need of calling `.show()`
%matplotlib inline

# prettify plots
plt.rcParams['figure.figsize'] = [20.0, 5.0]
sns.set_palette(sns.color_palette("muted"))
sns.set_style("ticks")

# supress warnings
import warnings
warnings.filterwarnings('ignore')

# set random seed
np.random.seed(0)

Find global minimizer of function $f$: $$f(x) = x^{2} - 2x + 5sin(x), \quad x \in \mathcal{R}$$



In [53]:

    
# function definition
def f(x):
    return x**2 - 2*x + 5*np.sin(x)



In [54]:

    
"""
`minimize(fun, x0, method, tol)`:
    
    Parameters
    ----------
    fun: callable
        Objective function
    x0: numpy.ndarray
        Initial guess
    str: string or callable
        Type of solver, consult
            [docs](https://docs.scipy.org/doc/scipy-0.19.1/reference/generated/scipy.optimize.minimize.html)
    tol: float
        Tolerance for termination
   
    Returns
    -------
    result: OptimizeResult
        x: numpy.ndarray
            Minimum
        success: bool
            Status of optomization
        message: string
            Description of cause of termination
"""
result = minimize(fun=f, x0=np.random.randn())
# report
print("Minimizer:", result.x)
print("Message:", result.message)
print("Success:", result.success)

# visualization
t = np.linspace(-5, 5)
plt.plot(t, f(t), label='Objective Function')
plt.plot(result.x, f(result.x), 'ro', label='Global Minimum')
plt.legend();









    



Minimizer: [-0.77901606]
Message: Optimization terminated successfully.
Success: True

Task (Polynomial Optimization)

Convex optimization is a useful tool that can be applied to various objective functions.

TODO
Use scipy.optimize.minimize function to minimize $f(x) = 2 - 9x + x^2, x \in \mathcal{R}$.



In [55]:

    
## SOLUTION

Linear Regression

Find a function $f: \mathcal{X} \rightarrow \mathcal{Y}$, such that: $$f(\mathbf{X}) = \mathbf{X} * w \approx \mathbf{y}$$

True target function:

$$g_{true}(x) = 7x - 3, \quad x \in \mathcal{R}$$



In [56]:

    
# target function
def g_true(x):
    return 7*x - 3

Observed target function:

$$g(x) = 7x - 3 + \epsilon, \quad x \in \mathcal{R} \text{ and } \epsilon \sim \mathcal{N}(0,1)$$



In [57]:

    
# observation function
def g(x):
    return g_true(x) + np.random.normal(0, 1, len(x))

Dataset



In [58]:

    
x = np.linspace(-10, 10)
y = g(x)

Preprocessing



In [59]:

    
# add a columns of ones in x
#  x  ->    X
# [1] -> [1, 1]
# [3] -> [1, 3]
# [7] -> [1, 7]
X = np.ones((len(x), 2))
X[:, 1] = x

Functioncal Programming Note

Use high-order functions to abstract calculations



In [60]:

    
def single_add(x, y, z):
    # do stuff with x, y, z
    return x + y + z

print("Single Addition: 1 + 2 + 3 =", single_add(1,2,3))

def high_order_add(x, y):
    def cyrrying_add(z):
        return x + y + z
    return cyrrying_add

print("Cyrrying Addition: 1 + 2 + 3 =", high_order_add(1,2)(3))









    



Single Addition: 1 + 2 + 3 = 6
Cyrrying Addition: 1 + 2 + 3 = 6

Mean Squared Error (L2-Norm Error)

$$MSE(\mathbf{X}, \mathbf{y}, w) = \frac{1}{2k} \sum_{i=1}^{k} (y_{i} - w_{i} * x_{i})^{2}$$



In [61]:

    
# high-order function
def loss(X, y):
    # mean squared error loss function
    def _mse(w):
        assert(len(X) == len(y))
        k = len(X)
        sum = 0
        for (xi, yi) in zip(X, y):
            sum += (yi - np.dot(xi, w))**2
        return sum/(2*k)
    return _mse



In [62]:

    
result = minimize(loss(X, y), [0,0])
w = result.x
print("Optimal weights:", w)
print("True weights:", [-3, 7])

plt.title("Linear Model")
plt.xlabel("x")
plt.ylabel("y")
plt.plot(x, y, 'o', label='Observations')
plt.plot(x, np.dot(X, w), '--', label='Linear Model')
plt.plot(x, g_true(x), label='True Target', alpha=0.5)
plt.legend();









    



Optimal weights: [-2.91263993  6.93318742]
True weights: [-3, 7]

3D Optimization Surface



In [63]:

    
from mpl_toolkits.mplot3d import Axes3D
__loss = loss(X, y)
b, m = np.meshgrid(np.linspace(-10, 30, 50), np.linspace(-10, 30, 50))
zs = np.array([__loss([_b, _m]) for _b, _m in zip(np.ravel(b), np.ravel(m))])
Z = zs.reshape(m.shape)
fig = plt.figure()
ax = fig.add_subplot(111, projection='3d')
ax.plot_surface(m, b, Z, rstride=1, cstride=1, alpha=0.5)
ax.scatter(w[1], w[0], __loss(w), color='red')
ax.set_title('Optimization Surface')
ax.set_xlabel('$w_{0}$')
ax.set_ylabel('$w_{1}$')
ax.set_zlabel('error');

Task (L1-Norm Error)

Mean Squared Error (MSE) is often called also L2-Norm Error, since the square (L2-Norm) of the prediction error is used. The L1-Norm Error term is used for the absolute value of the prediction error, such that: $$E_{L1}(\mathbf{X}, \mathbf{y}, w) = \frac{1}{2k} \sum_{i=1}^{k} |y_{i} - w_{i} * x_{i}|$$

TODO
Use scipy.optimize.minimize function to minimize the L1-Norm Error of the provided data (X, y).

Hint: Reimplement the loss function to calculate the L1-Norm Error, instead of the MSE.



In [64]:

    
## SOLUTION

Introduction to scientific Python

Numpy

Array creation

Exercises

Array indexing and shapes

Slicing

Boolean indexing

Shapes

Exercises

Mathematical operations and broadcasting

Basic arithmetics

Broadcasting

Built-in functions

Exercises

Linear algebra

Exercises

Vectorized functions and speed

Exercises

Challenges

Matplotlib Visualization

inline mode

Simple Plot

Single

Multiple

Metadata

Styling

Task (Activation Functions Plots)

Histogram

plt.hist

sns.distplot

Task (Stochastic Processes Distributions Plots)

SciPy Optimization

Task (Polynomial Optimization)

Linear Regression

True target function:

Observed target function:

Dataset

Preprocessing

Functioncal Programming Note

Mean Squared Error (L2-Norm Error)

3D Optimization Surface

Task (L1-Norm Error)

`inline` mode

`plt.hist`

`sns.distplot`