IOOS System Test: [Extreme Events Theme]

(https://github.com/ioos/system-test/wiki/Development-of-Test-Themes# wiki-theme-2-extreme-events): Inundation

Compare modeled water levels with observations for a specified bounding box

and time period using IOOS recommended service standards for catalog search (CSW) and data retrieval (OPeNDAP & SOS).

Query CSW to find datasets that match criteria
Extract OPeNDAP data endpoints from model datasets and SOS endpoints from observational datasets
OPeNDAP model datasets will be granules
SOS endpoints may be datasets (from ncSOS) or collections of datasets (from NDBC, CO-OPS SOS servers)
Filter SOS services to obtain datasets
Extract data from SOS datasets
Extract data from model datasets at locations of observations
Compare time series data on same vertical datum



In [1]:

    
# Standard Library.
from warnings import warn
from datetime import datetime, timedelta

# Scientific stack.
import iris
iris.FUTURE.netcdf_promote = True

import numpy as np
import cartopy.crs as ccrs
import matplotlib.pyplot as plt
from matplotlib.transforms import offset_copy
from cartopy.io.img_tiles import MapQuestOpenAerial
from pandas import DataFrame, date_range, read_csv, concat

from iris.exceptions import CoordinateNotFoundError, ConstraintMismatchError

# Custom IOOS/ASA modules (available at PyPI).
from owslib import fes
from owslib.csw import CatalogueServiceWeb
from pyoos.collectors.coops.coops_sos import CoopsSos

# Local imports
from utilities import name_list, sos_name
from utilities import (fes_date_filter, coops2df, find_timevar, find_ij, nearxy,
                       service_urls, mod_df)

Specify a time range and bounding box of interest:



In [2]:

    
dates = {'Hurricane sandy':
         [datetime(2012, 10, 26), datetime(2012, 11, 2)],
         '2014 Feb 10-15 Storm':
         [datetime(2014, 2, 10), datetime(2014, 2, 15)],
         '2014 Recent': [datetime(2014, 3, 8), datetime(2014, 3, 11)],
         '2011': [datetime(2013, 4, 20), datetime(2013, 4, 24)]}

jd_now = datetime.utcnow()
jd_start,  jd_stop = jd_now - timedelta(days=3), jd_now + timedelta(days=3)

start_date = jd_start.strftime('%Y-%m-%d %H:00')
stop_date = jd_stop.strftime('%Y-%m-%d %H:00')

jd_start = datetime.strptime(start_date, '%Y-%m-%d %H:%M')
jd_stop = datetime.strptime(stop_date, '%Y-%m-%d %H:%M')

print('%s to %s' % (start_date, stop_date))









    



2014-06-10 17:00 to 2014-06-16 17:00



In [3]:

    
# Bounding Box [lon_min, lat_min, lon_max, lat_max]
area = {'Hawaii': [-160.0, 18.0, -154., 23.0],
        'Gulf of Maine': [-72.0, 41.0, -69.0, 43.0],
        'New York harbor region': [-75., 39., -71., 41.5]}

box = area['New York harbor region']

Search CSW for datasets of interest



In [4]:

    
if False:
    from IPython.core.display import HTML
    url = 'http://www.ngdc.noaa.gov/geoportal/'
    HTML('<iframe src=%s width=950 height=400></iframe>' % url)



In [5]:

    
# Connect to CSW, explore it's properties.
CSW = {'NGDC Geoportal':
       'http://www.ngdc.noaa.gov/geoportal/csw',
       'USGS WHSC Geoportal':
       'http://geoport.whoi.edu/geoportal/csw',
       'NODC Geoportal: granule level':
       'http://www.nodc.noaa.gov/geoportal/csw',
       'NODC Geoportal: collection level':
       'http://data.nodc.noaa.gov/geoportal/csw',
       'NRCAN CUSTOM':
       'http://geodiscover.cgdi.ca/wes/serviceManagerCSW/csw',
       'USGS Woods Hole GI_CAT':
       'http://geoport.whoi.edu/gi-cat/services/cswiso',
       'USGS CIDA Geonetwork':
       'http://cida.usgs.gov/gdp/geonetwork/srv/en/csw',
       'USGS Coastal and Marine Program':
       'http://cmgds.marine.usgs.gov/geonetwork/srv/en/csw',
       'USGS Woods Hole Geoportal':
       'http://geoport.whoi.edu/geoportal/csw',
       'CKAN testing site for new Data.gov':
       'http://geo.gov.ckan.org/csw',
       'EPA':
       'https://edg.epa.gov/metadata/csw',
       'CWIC':
       'http://cwic.csiss.gmu.edu/cwicv1/discovery'}

endpoint = CSW['NGDC Geoportal']
csw = CatalogueServiceWeb(endpoint, timeout=60)
csw.version









    Out[5]:





'2.0.2'



In [6]:

    
for oper in csw.operations:
    if oper.name == 'GetRecords':
        cnstr = oper.constraints['SupportedISOQueryables']['values']
        print('\nISO Queryables:%s\n' % '\n'.join(cnstr))









    



ISO Queryables:apiso:Subject
apiso:Title
apiso:Abstract
apiso:AnyText
apiso:Format
apiso:Identifier
apiso:Modified
apiso:Type
apiso:BoundingBox
apiso:CRS.Authority
apiso:CRS.ID
apiso:CRS.Version
apiso:RevisionDate
apiso:AlternateTitle
apiso:CreationDate
apiso:PublicationDate
apiso:OrganizationName
apiso:HasSecurityConstraints
apiso:Language
apiso:ResourceIdentifier
apiso:ParentIdentifier
apiso:KeywordType
apiso:TopicCategory
apiso:ResourceLanguage
apiso:GeographicDescriptionCode
apiso:Denominator
apiso:DistanceValue
apiso:DistanceUOM
apiso:TempExtent_begin
apiso:TempExtent_end
apiso:ServiceType
apiso:ServiceTypeVersion
apiso:Operation
apiso:OperatesOn
apiso:OperatesOnIdentifier
apiso:OperatesOnName
apiso:CouplingType

Convert User Input into FES filters.



In [7]:

    
start, stop = fes_date_filter(start_date, stop_date)
bbox = fes.BBox(box)



In [8]:

    
or_filt = fes.Or([fes.PropertyIsLike(propertyname='apiso:AnyText',
                                     literal=('*%s*' % val),
                                     escapeChar='\\',
                                     wildCard='*',
                                     singleChar='?') for val in name_list])

ROMS model output often has Averages and History files. The Averages files are usually averaged over a tidal cycle or more, while the History files are snapshots at that time instant. We are not interested in averaged data for this test, so in the cell below we remove any Averages files here by removing any datasets that have the term "Averages" in the metadata text. A better approach would be to look at the cell_methods attributes propagated through to some term in the ISO metadata, but this is not implemented yet, as far as I know



In [9]:

    
val = 'Averages'
not_filt = fes.Not([fes.PropertyIsLike(propertyname='apiso:AnyText',
                                       literal=('*%s*' % val),
                                       escapeChar='\\',
                                       wildCard='*',
                                       singleChar='?')])

filter_list = [fes.And([bbox, start, stop, or_filt, not_filt])]

Try request using multiple filters "and" syntax: [[filter1,filter2]].



In [10]:

    
csw.getrecords2(constraints=filter_list, maxrecords=1000, esn='full')
print(len(csw.records.keys()))

Now print out some titles



In [11]:

    
for rec, item in csw.records.items():
    print(item.title)









    



NYHOPS Forecast Model Results
ROMS ESPRESSO Real-Time Operational IS4DVAR Forecast System Version 2 (NEW) 2013-present FMRC History (Best)
NECOFS GOM3 (FVCOM) - Northeast US - Latest Forecast
NECOFS GOM3 Wave - Northeast US - Latest Forecast
DBOFS - Delaware Bay Operational Forecast System - NOAA CO-OPS - POM
NYOFS - New York and New Jersey Operational Forecast System - NOAA CO-OPS - POM
COAWST Forecast System : USGS : US East Coast and Gulf of Mexico (Experimental)
Barotropic Tide Model for the Pacific Basin
CBOFS - Chesapeake Bay Operational Forecast System - NOAA CO-OPS - POM
ESTOFS Storm Surge Model - Atlantic - v1.0.0 - NOAA - NCEP - ADCIRC
NECOFS Massachusetts (FVCOM) - Massachusetts Coastal - Latest Forecast
NDBC Standard Meteorological Buoy Data
HYbrid Coordinate Ocean Model (HYCOM): Global

Print out all the OPeNDAP Data URL endpoints



In [12]:

    
dap_urls = service_urls(csw.records,
                        service='odp:url')
print("\n".join(dap_urls))









    



http://colossus.dl.stevens-tech.edu/thredds/dodsC/latest/Complete_gcmplt.nc
http://tds.marine.rutgers.edu/thredds/dodsC/roms/espresso/2013_da/his_Best/ESPRESSO_Real-Time_v2_History_Best_Available_best.ncd
http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_GOM3_FORECAST.nc
http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_WAVE_FORECAST.nc
http://opendap.co-ops.nos.noaa.gov/thredds/dodsC/DBOFS/fmrc/Aggregated_7_day_DBOFS_Fields_Forecast_best.ncd
http://opendap.co-ops.nos.noaa.gov/thredds/dodsC/NYOFS/fmrc/Aggregated_7_day_NYOFS_Fields_Forecast_best.ncd
http://geoport.whoi.edu/thredds/dodsC/coawst_4/use/fmrc/coawst_4_use_best.ncd
http://oos.soest.hawaii.edu/thredds/dodsC/hioos/tide_pac
http://opendap.co-ops.nos.noaa.gov/thredds/dodsC/CBOFS/fmrc/Aggregated_7_day_CBOFS_Fields_Forecast_best.ncd
http://geoport-dev.whoi.edu/thredds/dodsC/estofs/atlantic
http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_FVCOM_OCEAN_MASSBAY_FORECAST.nc
http://oos.soest.hawaii.edu/thredds/dodsC/pacioos/hycom/global

Print out all the SOS Data URL endpoints



In [13]:

    
sos_urls = service_urls(csw.records,
                        service='sos:url')
print("\n".join(sos_urls))

1. Get observations from SOS

Here we are using a custom class from pyoos to read the CO-OPS SOS. This is definitely unsavory, as the whole point of using a standard is avoid the need for custom classes for each service. Need to examine the consequences of removing this and just going with straight SOS service using OWSLib.



In [14]:

    
collector = CoopsSos()

collector.set_datum('NAVD')  # MSL
collector.server.identification.title
collector.start_time = jd_start
collector.end_time = jd_stop
collector.variables = [sos_name]



In [15]:

    
ofrs = collector.server.offerings
print(len(ofrs))
for p in ofrs[700:710]:
    print(p)









    



1027
Offering id: station-8411060, name: urn:ioos:station:NOAA.NOS.CO-OPS:8411060
Offering id: station-8413320, name: urn:ioos:station:NOAA.NOS.CO-OPS:8413320
Offering id: station-8418150, name: urn:ioos:station:NOAA.NOS.CO-OPS:8418150
Offering id: station-8419317, name: urn:ioos:station:NOAA.NOS.CO-OPS:8419317
Offering id: station-8423898, name: urn:ioos:station:NOAA.NOS.CO-OPS:8423898
Offering id: station-8443970, name: urn:ioos:station:NOAA.NOS.CO-OPS:8443970
Offering id: station-8447386, name: urn:ioos:station:NOAA.NOS.CO-OPS:8447386
Offering id: station-8447387, name: urn:ioos:station:NOAA.NOS.CO-OPS:8447387
Offering id: station-8447435, name: urn:ioos:station:NOAA.NOS.CO-OPS:8447435
Offering id: station-8447930, name: urn:ioos:station:NOAA.NOS.CO-OPS:8447930

Find the SOS stations within our bounding box and time extent

We would like to just use a filter on a collection to get a new collection, but PYOOS doesn't do that yet. So we do a GetObservation request for a collection, including a bounding box, and asking for one value at the start of the time period of interest. We use that to do a bounding box filter on the SOS server, which returns 1 point for each station found. So for 3 stations, we get back 3 records, in CSV format. We can strip the station ids from the CSV, and then we have a list of stations we can use with pyoos. The template for the GetObservation query for the bounding box filtered collection was generated using the GUI at http://opendap.co-ops.nos.noaa.gov/ioos-dif-sos/



In [16]:

    
iso_start = jd_start.strftime('%Y-%m-%dT%H:%M:%SZ')
print(iso_start)
box_str = ','.join(str(e) for e in box)
print(box_str)









    



2014-06-10T17:00:00Z
-75.0,39.0,-71.0,41.5



In [17]:

    
url = ('http://opendap.co-ops.nos.noaa.gov/ioos-dif-sos/SOS?'
       'service=SOS&request=GetObservation&version=1.0.0&'
       'observedProperty=%s&offering=urn:ioos:network:NOAA.NOS.CO-OPS:'
       'WaterLevelActive&featureOfInterest=BBOX:%s&responseFormat='
       'text/csv&eventTime=%s' % (sos_name, box_str, iso_start))

print(url)
obs_loc_df = read_csv(url)









    



http://opendap.co-ops.nos.noaa.gov/ioos-dif-sos/SOS?service=SOS&request=GetObservation&version=1.0.0&observedProperty=water_surface_height_above_reference_datum&offering=urn:ioos:network:NOAA.NOS.CO-OPS:WaterLevelActive&featureOfInterest=BBOX:-75.0,39.0,-71.0,41.5&responseFormat=text/csv&eventTime=2014-06-10T17:00:00Z



In [18]:

    
obs_loc_df.head()









    Out[18]:






  
    
      
      station_id
      sensor_id
      latitude (degree)
      longitude (degree)
      date_time
      water_surface_height_above_reference_datum (m)
      datum_id
      vertical_position (m)
    
  
  
    
      0
       urn:ioos:station:NOAA.NOS.CO-OPS:8461490
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8461490:A1
       41.3614
      -72.0900
       2014-06-10T17:00:00Z
       0.263
       urn:ioos:def:datum:noaa::MLLW
       1.074
    
    
      1
       urn:ioos:station:NOAA.NOS.CO-OPS:8465705
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8465705:A1
       41.2833
      -72.9083
       2014-06-10T17:00:00Z
       0.910
       urn:ioos:def:datum:noaa::MLLW
       5.618
    
    
      2
       urn:ioos:station:NOAA.NOS.CO-OPS:8467150
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8467150:A1
       41.1733
      -73.1817
       2014-06-10T17:00:00Z
       1.048
       urn:ioos:def:datum:noaa::MLLW
       0.604
    
    
      3
       urn:ioos:station:NOAA.NOS.CO-OPS:8510560
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8510560:A1
       41.0483
      -71.9600
       2014-06-10T17:00:00Z
       0.211
       urn:ioos:def:datum:noaa::MLLW
       1.177
    
    
      4
       urn:ioos:station:NOAA.NOS.CO-OPS:8516945
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8516945:A1
       40.8103
      -73.7649
       2014-06-10T17:00:00Z
       1.236
       urn:ioos:def:datum:noaa::MLLW
       3.928



In [19]:

    
stations = [sta.split(':')[-1] for sta in obs_loc_df['station_id']]
obs_lon = [sta for sta in obs_loc_df['longitude (degree)']]
obs_lat = [sta for sta in obs_loc_df['latitude (degree)']]
print(stations)









    



['8461490', '8465705', '8467150', '8510560', '8516945', '8518750', '8531680', '8534720', '8539094', '8548989']

Generate a uniform 6-min time base for model/data comparison:



In [20]:

    
ts_rng = date_range(start=jd_start, end=jd_stop, freq='6Min')
ts = DataFrame(index=ts_rng)
print(jd_start, jd_stop)
print(len(ts))









    



(datetime.datetime(2014, 6, 10, 17, 0), datetime.datetime(2014, 6, 16, 17, 0))
1441

Create a list of obs dataframes, one for each station:



In [21]:

    
obs_df = []
sta_names = []
sta_failed = []
for sta in stations:
    b = coops2df(collector, sta, sos_name)
    name = b.name
    sta_names.append(name)
    print(name)
    if b.empty:
        sta_failed.append(name)
        b = DataFrame(np.arange(len(ts)) * np.NaN, index=ts.index, columns=['Observed Data'])
        b.name = name
    # Limit interpolation to 10 points (10 @ 6min = 1 hour).
    col = 'Observed Data'
    concatenated = concat([b, ts], axis=1).interpolate(limit=10)[col]
    obs_df.append(DataFrame(concatenated))
    obs_df[-1].name = b.name









    



New London, CT
New Haven, CT
Bridgeport, CT
Montauk, NY





    



utilities.py:93: UserWarning: Station New Haven, CT is not NAVD datum. 'Wrong Datum for this station: The correct Datum values are: MHHW, MHW, MTL, MSL, MLW, MLLW, STND'
  warn("Station %s is not NAVD datum. %s" % (long_name, e))
utilities.py:93: UserWarning: Station Kings Point, NY is not NAVD datum. 'Wrong Datum for this station: The correct Datum values are: MHHW, MHW, MTL, MSL, MLW, MLLW, STND'
  warn("Station %s is not NAVD datum. %s" % (long_name, e))






    



Kings Point, NY
The Battery, NY
Sandy Hook, NJ
Atlantic City, NJ
Burlington, Delaware River, NJ





    



utilities.py:93: UserWarning: Station Burlington, Delaware River, NJ is not NAVD datum. 'Wrong Datum for this station: The correct Datum values are: MHHW, MHW, MTL, MSL, MLW, MLLW, STND'
  warn("Station %s is not NAVD datum. %s" % (long_name, e))
utilities.py:93: UserWarning: Station Newbold, PA is not NAVD datum. 'Wrong Datum for this station: The correct Datum values are: MHHW, MHW, MTL, MSL, MLW, MLLW, STND'
  warn("Station %s is not NAVD datum. %s" % (long_name, e))






    



Newbold, PA



In [22]:

    
geodetic = ccrs.Geodetic(globe=ccrs.Globe(datum='WGS84'))
tiler = MapQuestOpenAerial()
fig, ax = plt.subplots(figsize=(8, 8), subplot_kw=dict(projection=tiler.crs))
# Open Source Imagery from MapQuest (max zoom = 16?)
zoom = 8
extent = [box[0], box[2], box[1], box[3]]
ax.set_extent(extent, geodetic)
ax.add_image(tiler, zoom)

ax.scatter(obs_lon, obs_lat, marker='o', s=30,
           color='cyan', transform=ccrs.PlateCarree())
geodetic_transform = ccrs.Geodetic()._as_mpl_transform(ax)
text_transform = offset_copy(geodetic_transform, units='dots', x=-7, y=+7)

for x, y, label in zip(obs_lon, obs_lat, sta_names):
    ax.text(x, y, label, horizontalalignment='left',
            transform=text_transform, color='white')
gl = ax.gridlines(draw_labels=True)
gl.xlabels_top = gl.ylabels_right = False
ax.set_title('Water Level Gauge Locations')









    Out[22]:





<matplotlib.text.Text at 0x7123d90>

Get model output from OPeNDAP URLS

Try to open all the OPeNDAP URLS using Iris from the British Met Office. If 1D, assume dataset is data, if 2D assume dataset is an unstructured grid model, and if 3D, assume it's a structured grid model.

Construct an Iris contraint to load only cubes that match the std_name_list:



In [23]:

    
print('\n'.join(name_list))
name_in_list = lambda cube: cube.standard_name in name_list
constraint = iris.Constraint(cube_func=name_in_list)









    



water level
sea_surface_height
sea_surface_elevation
sea_surface_height_above_geoid
sea_surface_height_above_sea_level
water_surface_height_above_reference_datum
sea_surface_height_above_reference_ellipsoid

Use only data within 0.04 degrees (about 4 km).



In [24]:

    
max_dist = 0.04

Use only data where the standard deviation of the time series exceeds 0.01 m (1 cm) this eliminates flat line model time series that come from land points that should have had missing values.



In [25]:

    
min_var = 0.01



In [26]:

    
for url in dap_urls:
    try:
        a = iris.load_cube(url, constraint)
        # convert to units of meters
        # a.convert_units('m')     # this isn't working for unstructured data
        # take first 20 chars for model name
        mod_name = a.attributes['title'][0:20]
        r = a.shape
        timevar = find_timevar(a)
        lat = a.coord(axis='Y').points
        lon = a.coord(axis='X').points
        jd = timevar.units.num2date(timevar.points)
        start = timevar.units.date2num(jd_start)
        istart = timevar.nearest_neighbour_index(start)
        stop = timevar.units.date2num(jd_stop)
        istop = timevar.nearest_neighbour_index(stop)

        # Only proceed if we have data in the range requested.
        if istart != istop:
            nsta = len(obs_lon)
            if len(r) == 3:
                print('[Structured grid model]:', url)
                d = a[0, :, :].data
                # Find the closest non-land point from a structured grid model.
                if len(lon.shape) == 1:
                    lon, lat = np.meshgrid(lon, lat)
                j, i, dd = find_ij(lon, lat, d, obs_lon, obs_lat)
                for n in range(nsta):
                    # Only use if model cell is within 0.1 degree of requested
                    # location.
                    if dd[n] <= max_dist:
                        arr = a[istart:istop, j[n], i[n]].data
                        if arr.std() >= min_var:
                            c = mod_df(arr, timevar, istart, istop,
                                       mod_name, ts)
                            name = obs_df[n].name
                            obs_df[n] = concat([obs_df[n], c], axis=1)
                            obs_df[n].name = name
            elif len(r) == 2:
                print('[Unstructured grid model]:', url)
                # Find the closest point from an unstructured grid model.
                index, dd = nearxy(lon.flatten(), lat.flatten(),
                                   obs_lon, obs_lat)
                for n in range(nsta):
                    # Only use if model cell is within 0.1 degree of requested
                    # location.
                    if dd[n] <= max_dist:
                        arr = a[istart:istop, index[n]].data
                        if arr.std() >= min_var:
                            c = mod_df(arr, timevar, istart, istop,
                                       mod_name, ts)
                            name = obs_df[n].name
                            obs_df[n] = concat([obs_df[n], c], axis=1)
                            obs_df[n].name = name
            elif len(r) == 1:
                print('[Data]:', url)
    except (ValueError, RuntimeError, CoordinateNotFoundError,
            ConstraintMismatchError) as e:
        warn("\n%s\n" % e)
        pass









    



/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'h' referenced by data variable u'v' via variable u's_rho': Dimensions (u'eta_rho', u'xi_rho') do not span (u'time', u's_rho', u'eta_v', u'xi_v')
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'zeta' referenced by data variable u'v' via variable u's_rho': Dimensions (u'time', u'eta_rho', u'xi_rho') do not span (u'time', u's_rho', u'eta_v', u'xi_v')
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'h' referenced by data variable u'u' via variable u's_rho': Dimensions (u'eta_rho', u'xi_rho') do not span (u'time', u's_rho', u'eta_u', u'xi_u')
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'zeta' referenced by data variable u'u' via variable u's_rho': Dimensions (u'time', u'eta_rho', u'xi_rho') do not span (u'time', u's_rho', u'eta_u', u'xi_u')
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1290: UserWarning: Gracefully filling 'time' dimension coordinate masked points
  warnings.warn(msg.format(str(cf_coord_var.cf_name)))






    



('[Structured grid model]:', 'http://colossus.dl.stevens-tech.edu/thredds/dodsC/latest/Complete_gcmplt.nc')
('[Structured grid model]:', 'http://tds.marine.rutgers.edu/thredds/dodsC/roms/espresso/2013_da/his_Best/ESPRESSO_Real-Time_v2_History_Best_Available_best.ncd')
('[Unstructured grid model]:', 'http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_GOM3_FORECAST.nc')





    



/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1004: UserWarning: Ignoring variable u'siglay' referenced by variable u'v': Dimensions (u'siglay', u'node') do not span (u'time', u'siglay', u'nele')
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1004: UserWarning: Ignoring variable u'siglay' referenced by variable u'ww': Dimensions (u'siglay', u'node') do not span (u'time', u'siglay', u'nele')
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1004: UserWarning: Ignoring variable u'siglay' referenced by variable u'u': Dimensions (u'siglay', u'node') do not span (u'time', u'siglay', u'nele')
  warnings.warn(msg)
-c:59: UserWarning: 
'Expected to find exactly 1  coordinate, but found none.'







    



('[Structured grid model]:', 'http://opendap.co-ops.nos.noaa.gov/thredds/dodsC/DBOFS/fmrc/Aggregated_7_day_DBOFS_Fields_Forecast_best.ncd')





    



/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'h' referenced by data variable u'Cs_r' via variable u's_rho': Dimensions (u'eta_rho', u'xi_rho') do not span (u's_rho',)
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'zeta' referenced by data variable u'Cs_r' via variable u's_rho': Dimensions (u'time', u'eta_rho', u'xi_rho') do not span (u's_rho',)
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'h' referenced by data variable u'Cs_w' via variable u's_w': Dimensions (u'eta_rho', u'xi_rho') do not span (u's_w',)
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'zeta' referenced by data variable u'Cs_w' via variable u's_w': Dimensions (u'time', u'eta_rho', u'xi_rho') do not span (u's_w',)
  warnings.warn(msg)
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'mask' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))






    



('[Structured grid model]:', 'http://opendap.co-ops.nos.noaa.gov/thredds/dodsC/NYOFS/fmrc/Aggregated_7_day_NYOFS_Fields_Forecast_best.ncd')
('[Structured grid model]:', 'http://geoport.whoi.edu/thredds/dodsC/coawst_4/use/fmrc/coawst_4_use_best.ncd')
('[Structured grid model]:', 'http://oos.soest.hawaii.edu/thredds/dodsC/hioos/tide_pac')
('[Structured grid model]:', 'http://opendap.co-ops.nos.noaa.gov/thredds/dodsC/CBOFS/fmrc/Aggregated_7_day_CBOFS_Fields_Forecast_best.ncd')





    



/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'sigma' invalid units 'sigma_level'
  warnings.warn(msg.format(msg_name, msg_units))
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'nbdv' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))






    



('[Unstructured grid model]:', 'http://geoport-dev.whoi.edu/thredds/dodsC/estofs/atlantic')
('[Unstructured grid model]:', 'http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_FVCOM_OCEAN_MASSBAY_FORECAST.nc')
('[Structured grid model]:', 'http://oos.soest.hawaii.edu/thredds/dodsC/pacioos/hycom/global')






    



/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'neta' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'ibtypee' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'nvell' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'ibtype' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'nbvv' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'nvel' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/filipe/.virtualenvs/iris/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1195: UserWarning: Ignoring netCDF variable 'nvdll' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))



In [27]:

    
for df in obs_df:
    ax = df.plot(figsize=(14, 6), title=df.name, legend=False)
    plt.setp(ax.lines[0], linewidth=4.0, color='0.7', zorder=1)
    ax.legend()
    ax.set_ylabel('m')

Plot again, but now remove the mean offset (relative to data) from all plots.



In [28]:

    
for df in obs_df:
    amean = df[jd_start:jd_now].mean()
    name = df.name
    df = df - amean + amean.ix[0]
    df.name = name
    ax = df.plot(figsize=(14, 6), title=df.name, legend=False)
    plt.setp(ax.lines[0], linewidth=4.0, color='0.7', zorder=1)
    ax.legend()
    ax.set_ylabel('m')
    print(amean.ix[0] - amean)









    



Observed Data           0.000000
NYHOPS Forecast Mode   -0.235059
NECOFS GOM3 (FVCOM)    -0.312789
dtype: float64
Observed Data          NaN
NYHOPS Forecast Mode   NaN
NECOFS GOM3 (FVCOM)    NaN
ESTOFS Storm Surge M   NaN
dtype: float64
Observed Data           0.000000
NYHOPS Forecast Mode   -0.280171
ROMS ESPRESSO Real-T    0.309511
NECOFS GOM3 (FVCOM)    -0.306929
ESTOFS Storm Surge M    0.159833
dtype: float64
Observed Data           0.000000
NYHOPS Forecast Mode   -0.244448
NECOFS GOM3 (FVCOM)    -0.313148
COAWST Forecast Syst   -0.082960
ESTOFS Storm Surge M   -0.063505
dtype: float64
Observed Data          NaN
NYHOPS Forecast Mode   NaN
NECOFS GOM3 (FVCOM)    NaN
NYOFS - New York and   NaN
ESTOFS Storm Surge M   NaN
dtype: float64
Observed Data           0.000000
NYHOPS Forecast Mode   -0.153475
ROMS ESPRESSO Real-T    0.291613
NECOFS GOM3 (FVCOM)    -0.294186
NYOFS - New York and   -0.034242
dtype: float64
Observed Data           0.000000
NYHOPS Forecast Mode   -0.147237
ROMS ESPRESSO Real-T    0.379440
NECOFS GOM3 (FVCOM)    -0.217136
NYOFS - New York and    0.025665
COAWST Forecast Syst   -0.038497
ESTOFS Storm Surge M   -0.108329
dtype: float64
Observed Data           0.000000
NYHOPS Forecast Mode   -0.224575
ROMS ESPRESSO Real-T    0.307620
NECOFS GOM3 (FVCOM)    -0.274459
DBOFS - Delaware Bay   -0.099669
COAWST Forecast Syst   -0.131332
ESTOFS Storm Surge M   -0.166251
dtype: float64
Observed Data          NaN
DBOFS - Delaware Bay   NaN
dtype: float64
Observed Data          NaN
DBOFS - Delaware Bay   NaN
dtype: float64

	station_id	sensor_id	latitude (degree)	longitude (degree)	date_time	water_surface_height_above_reference_datum (m)	datum_id	vertical_position (m)
0	urn:ioos:station:NOAA.NOS.CO-OPS:8461490	urn:ioos:sensor:NOAA.NOS.CO-OPS:8461490:A1	41.3614	-72.0900	2014-06-10T17:00:00Z	0.263	urn:ioos:def:datum:noaa::MLLW	1.074
1	urn:ioos:station:NOAA.NOS.CO-OPS:8465705	urn:ioos:sensor:NOAA.NOS.CO-OPS:8465705:A1	41.2833	-72.9083	2014-06-10T17:00:00Z	0.910	urn:ioos:def:datum:noaa::MLLW	5.618
2	urn:ioos:station:NOAA.NOS.CO-OPS:8467150	urn:ioos:sensor:NOAA.NOS.CO-OPS:8467150:A1	41.1733	-73.1817	2014-06-10T17:00:00Z	1.048	urn:ioos:def:datum:noaa::MLLW	0.604
3	urn:ioos:station:NOAA.NOS.CO-OPS:8510560	urn:ioos:sensor:NOAA.NOS.CO-OPS:8510560:A1	41.0483	-71.9600	2014-06-10T17:00:00Z	0.211	urn:ioos:def:datum:noaa::MLLW	1.177
4	urn:ioos:station:NOAA.NOS.CO-OPS:8516945	urn:ioos:sensor:NOAA.NOS.CO-OPS:8516945:A1	40.8103	-73.7649	2014-06-10T17:00:00Z	1.236	urn:ioos:def:datum:noaa::MLLW	3.928