Plotting Arrays Using matplotlib



In [ ]:

    
%matplotlib inline

The argument after the ipython magic is called the backend for plotting. There are several available, also for creating their own zoomable windows. But we also can zoom within the notebook, see below.



In [ ]:

    
import numpy as np
import matplotlib.pyplot as pl # import this for plotting routines

Refresher -- acceleration with no initial velocity or displacement



In [83]:

    
a = 9.8       # Acceleration m s^{-2}
count = 101   # Number of numbers

timeArray = np.linspace(0, 10, count)    # Create an array of 101 times between 0 and 10 (inclusive)
distArray = 0.5 * a * timeArray**2       # Create an array of distances calculate from the times

Q. What do these arrays (distArray and timeArray) contain?



In [84]:

    
print(timeArray)
print
print(distArray)









    



[  0.    0.1   0.2   0.3   0.4   0.5   0.6   0.7   0.8   0.9   1.    1.1
   1.2   1.3   1.4   1.5   1.6   1.7   1.8   1.9   2.    2.1   2.2   2.3
   2.4   2.5   2.6   2.7   2.8   2.9   3.    3.1   3.2   3.3   3.4   3.5
   3.6   3.7   3.8   3.9   4.    4.1   4.2   4.3   4.4   4.5   4.6   4.7
   4.8   4.9   5.    5.1   5.2   5.3   5.4   5.5   5.6   5.7   5.8   5.9
   6.    6.1   6.2   6.3   6.4   6.5   6.6   6.7   6.8   6.9   7.    7.1
   7.2   7.3   7.4   7.5   7.6   7.7   7.8   7.9   8.    8.1   8.2   8.3
   8.4   8.5   8.6   8.7   8.8   8.9   9.    9.1   9.2   9.3   9.4   9.5
   9.6   9.7   9.8   9.9  10. ]
[  0.00000000e+00   4.90000000e-02   1.96000000e-01   4.41000000e-01
   7.84000000e-01   1.22500000e+00   1.76400000e+00   2.40100000e+00
   3.13600000e+00   3.96900000e+00   4.90000000e+00   5.92900000e+00
   7.05600000e+00   8.28100000e+00   9.60400000e+00   1.10250000e+01
   1.25440000e+01   1.41610000e+01   1.58760000e+01   1.76890000e+01
   1.96000000e+01   2.16090000e+01   2.37160000e+01   2.59210000e+01
   2.82240000e+01   3.06250000e+01   3.31240000e+01   3.57210000e+01
   3.84160000e+01   4.12090000e+01   4.41000000e+01   4.70890000e+01
   5.01760000e+01   5.33610000e+01   5.66440000e+01   6.00250000e+01
   6.35040000e+01   6.70810000e+01   7.07560000e+01   7.45290000e+01
   7.84000000e+01   8.23690000e+01   8.64360000e+01   9.06010000e+01
   9.48640000e+01   9.92250000e+01   1.03684000e+02   1.08241000e+02
   1.12896000e+02   1.17649000e+02   1.22500000e+02   1.27449000e+02
   1.32496000e+02   1.37641000e+02   1.42884000e+02   1.48225000e+02
   1.53664000e+02   1.59201000e+02   1.64836000e+02   1.70569000e+02
   1.76400000e+02   1.82329000e+02   1.88356000e+02   1.94481000e+02
   2.00704000e+02   2.07025000e+02   2.13444000e+02   2.19961000e+02
   2.26576000e+02   2.33289000e+02   2.40100000e+02   2.47009000e+02
   2.54016000e+02   2.61121000e+02   2.68324000e+02   2.75625000e+02
   2.83024000e+02   2.90521000e+02   2.98116000e+02   3.05809000e+02
   3.13600000e+02   3.21489000e+02   3.29476000e+02   3.37561000e+02
   3.45744000e+02   3.54025000e+02   3.62404000e+02   3.70881000e+02
   3.79456000e+02   3.88129000e+02   3.96900000e+02   4.05769000e+02
   4.14736000e+02   4.23801000e+02   4.32964000e+02   4.42225000e+02
   4.51584000e+02   4.61041000e+02   4.70596000e+02   4.80249000e+02
   4.90000000e+02]

To plot distArray vs. timeArray with a scatter plot:



In [85]:

    
pl.scatter(timeArray, distArray, color = 'k')









    Out[85]:





<matplotlib.collections.PathCollection at 0x119210b70>

To plot just a section to see the discrete nature (and add labels):



In [86]:

    
pl.scatter(timeArray, distArray, color = 'k')

pl.xlim(4, 6)
pl.ylim(50, 200)

pl.xlabel('time (s)')
pl.ylabel('distance (m)')









    Out[86]:





<matplotlib.text.Text at 0x119303a20>

Now with the notebook backend:



In [87]:

    
%matplotlib notebook



In [88]:

    
pl.scatter(timeArray, distArray, color = 'k')

pl.xlim(4, 6)
pl.ylim(50, 200)

pl.xlabel('time (s)')
pl.ylabel('distance (m)')









    














    











    Out[88]:





<matplotlib.text.Text at 0x118e07e80>



In [89]:

    
%matplotlib inline

To plot distArray vs. timeArray with a blue solid line:



In [94]:

    
pl.plot(timeArray, distArray, color='b', ls='-')
pl.xlabel('time (s)')           # xlabel is the abscissa
pl.ylabel('distance (m)')       # ylabel is the ordinate









    Out[94]:





<matplotlib.text.Text at 0x1190f4d30>

To save the figure, use savefig('filename') and the .pdf, or .eps, or .png, or ... extension (which Python interprets for you!):



In [96]:

    
pl.xlabel('time1 (s)')
pl.plot(timeArray, distArray, color='b', ls='-')

pl.ylabel('distance (m)')
pl.title('Position vs. Time')

pl.savefig('position_v_time.pdf')        # In the same cell as pl.plot
pl.savefig('position_v_time.eps')
pl.savefig('position_v_time.png')

Q. Where will these files be saved on our computer?



In [98]:

    
ls position_v_time*









    



position_v_time.eps  position_v_time.pdf  position_v_time.png

More array methods

Three topics today:

Array slicing vs. copying
"Allocating" or "initializing" arrays
Boolean logic on arrays

Making copies of arrays



In [99]:

    
yArray = np.linspace(0, 5, 6)   # take care of differences of interval determination here!
zArray = yArray[1:4]

print(yArray, zArray)

# Q. What will y and z contain?









    



[ 0.  1.  2.  3.  4.  5.] [ 1.  2.  3.]



In [100]:

    
yArray[3] = 10

Q. What does the next command yield?



In [101]:

    
print(yArray, zArray)









    



[  0.   1.   2.  10.   4.   5.] [  1.   2.  10.]



In [ ]:

zArray is not a copy of yArray, it is a slice of yArray!

AND:

All arrays generated by basic slicing are always views of the original array.

In other words, the variable zArray is a reference to three elements within yArray, elements 1, 2, and 3.

If this is not the desired behavior, copy arrays:



In [102]:

    
yArray = np.linspace(0, 5, 6)
zArray = yArray.copy()
print(yArray, zArray)









    



[ 0.  1.  2.  3.  4.  5.] [ 0.  1.  2.  3.  4.  5.]



In [104]:

    
zArray = yArray.copy()[1:4]   # you only `catch` the slice into new variable, rest of copy NOT
print(yArray, zArray)









    



[ 0.  1.  2.  3.  4.  5.] [ 1.  2.  3.]



In [105]:

    
yArray[3] = 10
print(yArray, zArray)









    



[  0.   1.   2.  10.   4.   5.] [ 1.  2.  3.]

"copy" is an attribute of every numpy array, as are "shape", "size", "min", "max", etc.

Allocating Arrays

If we want an array with the same "shape" as another array, we've seen that we can copy an array with:



In [106]:

    
xArray = np.array([1, 2, 3])
aArray = xArray.copy()
aArray









    Out[106]:





array([1, 2, 3])

then fill the array with the appropriate values.

However, we could also use numpy.zeros with the attributes xArray.shape and xArray.dtype:



In [108]:

    
print(xArray.shape)  # this is a 1D vector
print(xArray.ndim)
xArray









    



(3,)
1






    Out[108]:





array([1, 2, 3])



In [109]:

    
xArray.shape = (3,1)



In [110]:

    
print(xArray.shape)  # Now it's a 3x1 2D matrix!
print(xArray.ndim)
xArray









    



(3, 1)
2






    Out[110]:





array([[1],
       [2],
       [3]])



In [112]:

    
xArray.shape = (3,)



In [113]:

    
aArray = np.zeros(xArray.shape, xArray.dtype)
print(aArray.shape)
aArray









    



(3,)






    Out[113]:





array([0, 0, 0])

Which gives aArray the same "shape" and data type as xArray.

Q. What do I mean by the "shape" of the array?



In [116]:

    
np.zeros((2,3,4))









    Out[116]:





array([[[ 0.,  0.,  0.,  0.],
        [ 0.,  0.,  0.,  0.],
        [ 0.,  0.,  0.,  0.]],

       [[ 0.,  0.,  0.,  0.],
        [ 0.,  0.,  0.,  0.],
        [ 0.,  0.,  0.,  0.]]])

Q. And what is the data type (dtype)?

Alternatively we could do:



In [117]:

    
aArray = np.zeros_like(xArray)



In [119]:

    
np.zeros??



In [120]:

    
bArray = np.ones_like(xArray)



In [121]:

    
print(aArray, bArray)









    



[0 0 0] [1 1 1]

Generalized Indexing

Subarrays can be sliced too, with or without range:



In [123]:

    
# remember, we already imported numpy (as np)!
xArray = np.linspace(1, 10, 10)  
xArray









    Out[123]:





array([  1.,   2.,   3.,   4.,   5.,   6.,   7.,   8.,   9.,  10.])

Q. What will xArray contain?



In [124]:

    
# Note the double brackets indicating a subarray
xArray[[1, 5, 6]] = -1
xArray









    Out[124]:





array([  1.,  -1.,   3.,   4.,   5.,  -1.,  -1.,   8.,   9.,  10.])



In [125]:

    
# Using range instead:

xArray = np.linspace(1, 10, 10)
xArray[range(3, 10, 3)] = -1 
xArray









    Out[125]:





array([ 1.,  2.,  3., -1.,  5.,  6., -1.,  8.,  9., -1.])

Q. What will xArray contain?



In [126]:

    
# Compare
xArray = np.linspace(1, 10, 10)
xArray[[3, 6, 9]] = -1
xArray









    Out[126]:





array([ 1.,  2.,  3., -1.,  5.,  6., -1.,  8.,  9., -1.])

Boolean Logic

When do I use that?

missing or invalid data
investigating subset of a dataset
masking/filtering etc.

Complementary methods for dealing with missing or invalid data: numpy masked arrays

http://docs.scipy.org/doc/numpy/reference/maskedarray.html

(masked arrays are a bit harder to use, but offer more powerful features)

For example, return a slice of the array consisting of negative elements only:



In [127]:

    
xArray









    Out[127]:





array([ 1.,  2.,  3., -1.,  5.,  6., -1.,  8.,  9., -1.])



In [128]:

    
myArray = xArray < 0
myArray









    Out[128]:





array([False, False, False,  True, False, False,  True, False, False,  True], dtype=bool)



In [129]:

    
xArray[xArray < 0]









    Out[129]:





array([-1., -1., -1.])

This will replace the elements of a new xArray with values less than zero with the maximum of xArray:



In [130]:

    
xArray = np.arange(-5, 5)
xArray









    Out[130]:





array([-5, -4, -3, -2, -1,  0,  1,  2,  3,  4])



In [131]:

    
xArray[xArray < 0] = xArray.max()
xArray









    Out[131]:





array([4, 4, 4, 4, 4, 0, 1, 2, 3, 4])

Compound Conditionals & Arrays

numpy has routines for doing boolean logic:



In [132]:

    
xArray = np.arange(-5, 5)
xArray









    Out[132]:





array([-5, -4, -3, -2, -1,  0,  1,  2,  3,  4])

"and"



In [133]:

    
np.logical_and(xArray > 0, xArray % 2 == 1)

# % is the modulus:  x % 2 == 1 means the remainder of x/2 is 1

# Q. So, what should running this cell give us?









    Out[133]:





array([False, False, False, False, False, False,  True, False,  True, False], dtype=bool)

"or"



In [134]:

    
np.logical_or(xArray == xArray.min(), xArray == xArray.max())









    Out[134]:





array([ True, False, False, False, False, False, False, False, False,  True], dtype=bool)

#### "not"



In [135]:

    
np.logical_not(xArray == xArray.min())









    Out[135]:





array([False,  True,  True,  True,  True,  True,  True,  True,  True,  True], dtype=bool)

#### "any" or "all"



In [136]:

    
print(np.any(xArray > 10))
print(np.any(xArray < -2))









    



False
True



In [137]:

    
print(np.all(xArray > -10))
print(np.all(xArray > -2))









    



True
False



In [ ]: