In this book, we will use Python 3. For a good introduction, see e.g., the free books Whirlwind tour of Python by Jake Vanderplas or Dive into Python 3 by Mark Pilgrim.
This document is an example of a Jupyter notebook, which mixes code and results. When developing software, it is often better to use an IDE (interactive development environment), which keeps the code in separate files. I recommend Spyder Spyder, although many people use JupyterLab for a browser-based solution.
We will leverage many standard libraries from Python's "data science stack", listed in the table below. For a good introduction to these, see e.g., the free book Python Datascience Handbook by Jake Vanderplas, or the class Computational Statistics in Python by Cliburn Chen at Duke University. For an excellent book on scikit-learn, see Hands-on Machine Learning with Scikit-Learn, Keras and TensorFlow v2 by Aurelion Geron.
| Name | Functionality |
|---|---|
| Numpy | Vector and matrix computations |
| Scipy | Various scientific / math / stats / optimization functions |
| Matplotlib | Plotting |
| Seaborn | Extension of Matplotlib |
| Pandas | Manipulating tabular data and time series |
| Scikit-learn | Implements many "Classical" ML methods |
In [0]: