Let's play with Pandas!

Simple Analysis of the Cantareira reservoir hydroclimatic data

Import pandas, numpy and matplotlib.pyplot
Create a dataframe from the file "DataCantareira.csv".
Print only the data from 2005 to 2010
Clean your file by droping the NaN
What is the minimum daily accumulated rainfall? Does it seems a resonable observed value? Remove every events (row) with a negative rainfall from the dataframe. hint: use boolean indexing
When the cantareira reservoir experienced its lowest level? Hint:
- - select the volume column
- - Find the lowest value
- - Perform boolean indexing and select the corresponding index
Make a bar plot of the average annual precipitation.
Create a new column with the value of the reservoir volume in m^3. Knowing that the maximum capacity of the reservoir is approximatively 1000 billions of liter.

Hint:
- - Conversion: volume (m^3) = (volume (%)/100) * 10^9
- - Create a function which return the volume in m^3.
- - Use the apply method
In average, which month of the year the reservoir have the highest volume. Similarly, which month of the year there is the highest accumulation of rainfall? (Is there a delay?) Hint:
- use groupby



In [ ]: