Wes McKinney, primary developer of pandas:
pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
In more tangible terms, pandas is a Python package that streamlines common data analysis tasks and makes handling large datasets more efficient and manageable. Probably the best way to demonstrate this is by example. For these examples it's not so important to understand exactly what's going on in terms of the syntax of the code, but more about what the code is doing.
The examples in this tutorial series use the netCDF dataset which can be downloaded by clicking here. This is a surface meteorology file which contains a number of variables which could be interesting for study. In addition, all the notebooks may be found here, they are the files with the *.ipynb file extension.
Series Objects - pandas.Series()
This section walks through reading and plotting a simple timeseries from a netCDF file using pandas. General topics include reading a netCDF file, writing a function, working with datetime objects, and plotting.
DataFrame Objects - pandas.DataFrame()
This section is not yet written, but will cover working with multiple variables and datasets. General topics will include joining and concatenating frames from different sources, working with multiple Series objects, and plotting multiple variables.
Importing Data - pandas.read_csv and pandas.read_table()
Also not written. Will talk about importing data from text/tabular/csv type files natively using pandas.
In [1]:
import pandas
help(pandas)