IOOS System Test: Extreme Events Theme: Inundation

Compare modeled water levels with observations for a specified bounding box and time period using IOOS recommended service standards for catalog search (CSW) and data retrieval (OPeNDAP & SOS).

Query CSW to find datasets that match criteria
Extract OPeNDAP data endpoints from model datasets and SOS endpoints from observational datasets
OPeNDAP model datasets will be granules
SOS endpoints may be datasets (from ncSOS) or collections of datasets (from NDBC, CO-OPS SOS servers)
Filter SOS services to obtain datasets
Extract data from SOS datasets
Extract data from model datasets at locations of observations
Compare time series data on same vertical datum



In [1]:

    
from pylab import *
from owslib.csw import CatalogueServiceWeb
from owslib import fes
import random
import netCDF4
import pandas as pd
import datetime as dt
from pyoos.collectors.coops.coops_sos import CoopsSos
import cStringIO
import iris
import urllib2
import parser
from lxml import etree
import cartopy.crs as ccrs
import cartopy.feature as cfeature
from cartopy.io.img_tiles import MapQuestOpenAerial, MapQuestOSM, OSM
iris.FUTURE.netcdf_promote = True
%matplotlib inline

Specify a time range and bounding box of interest:



In [2]:

    
# specific specific times (UTC) ...

# hurricane sandy
jd_start = dt.datetime(2012,10,26)
jd_stop = dt.datetime(2012,11,2)

# 2014 feb 10-15 storm
jd_start = dt.datetime(2014,2,10)
jd_stop = dt.datetime(2014,2,15)

# 2014 recent
jd_start = dt.datetime(2014,3,8)
jd_stop = dt.datetime(2014,3,11)

# 2011 
#jd_start = dt.datetime(2013,4,20)
#jd_stop = dt.datetime(2013,4,24)

# ... or relative to now
jd_now = dt.datetime.utcnow()
jd_start = jd_now - dt.timedelta(days=3)
jd_stop = jd_now + dt.timedelta(days=3)

start_date = jd_start.strftime('%Y-%m-%d %H:00')
stop_date  = jd_stop.strftime('%Y-%m-%d %H:00')

jd_start = dt.datetime.strptime(start_date,'%Y-%m-%d %H:%M')
jd_stop = dt.datetime.strptime(stop_date,'%Y-%m-%d %H:%M')

print start_date,'to',stop_date









    



2015-02-20 21:00 to 2015-02-26 21:00



In [3]:

    
# Bounding Box [lon_min, lat_min, lon_max, lat_max]
#box=[-75., 39., -71., 41.5]  # new york harbor region
box=[-72.0, 41.0, -69.0, 43.0]   # gulf of maine
#box=[-160.0, 18.0, -154., 23.0] #hawaii

Now we need to specify all the names we know for water level, names that will get used in the CSW search, and also to find data in the datasets that are returned. This is ugly and fragile. There hopefully will be a better way in the future...



In [4]:

    
name_list=['water_surface_height_above_reference_datum',
    'sea_surface_height_above_geoid','sea_surface_elevation',
    'sea_surface_height_above_reference_ellipsoid','sea_surface_height_above_sea_level',
    'sea_surface_height','water level']

sos_name = 'water_surface_height_above_reference_datum'

Search CSW for datasets of interest



In [5]:

    
#from IPython.core.display import HTML
#HTML('<iframe src=http://www.ngdc.noaa.gov/geoportal/ width=950 height=400></iframe>')



In [6]:

    
# connect to CSW, explore it's properties

endpoint = 'http://www.ngdc.noaa.gov/geoportal/csw' # NGDC Geoportal
#endpoint = 'http://geoport.whoi.edu/geoportal/csw'  # USGS WHSC Geoportal

#endpoint = 'http://www.nodc.noaa.gov/geoportal/csw'   # NODC Geoportal: granule level
#endpoint = 'http://data.nodc.noaa.gov/geoportal/csw'  # NODC Geoportal: collection level   
#endpoint = 'http://geodiscover.cgdi.ca/wes/serviceManagerCSW/csw'  # NRCAN CUSTOM
#endpoint = 'http://geoport.whoi.edu/gi-cat/services/cswiso' # USGS Woods Hole GI_CAT
#endpoint = 'http://cida.usgs.gov/gdp/geonetwork/srv/en/csw' # USGS CIDA Geonetwork
#endpoint = 'http://cmgds.marine.usgs.gov/geonetwork/srv/en/csw' # USGS Coastal and Marine Program
#endpoint = 'http://geoport.whoi.edu/geoportal/csw' # USGS Woods Hole Geoportal 
#endpoint = 'http://geo.gov.ckan.org/csw'  # CKAN testing site for new Data.gov
#endpoint = 'https://edg.epa.gov/metadata/csw'  # EPA
#endpoint = 'http://cwic.csiss.gmu.edu/cwicv1/discovery'  # CWIC

csw = CatalogueServiceWeb(endpoint,timeout=60)
csw.version









    Out[6]:





'2.0.2'



In [7]:

    
# hopefully something like this will be implemented in fes soon
def dateRange(start_date='1900-01-01',stop_date='2100-01-01',constraint='overlaps'):
    if constraint == 'overlaps':
        start = fes.PropertyIsLessThanOrEqualTo(propertyname='apiso:TempExtent_begin', literal=stop_date)
        stop = fes.PropertyIsGreaterThanOrEqualTo(propertyname='apiso:TempExtent_end', literal=start_date)
    elif constraint == 'within':
        start = fes.PropertyIsGreaterThanOrEqualTo(propertyname='apiso:TempExtent_begin', literal=start_date)
        stop = fes.PropertyIsLessThanOrEqualTo(propertyname='apiso:TempExtent_end', literal=stop_date)
    return start,stop



In [8]:

    
print start_date,stop_date









    



2015-02-20 21:00 2015-02-26 21:00



In [9]:

    
# convert User Input into FES filters
start,stop = dateRange(start_date,stop_date)
bbox = fes.BBox(box,crs='urn:ogc:def:crs:OGC:1.3:CRS84')



In [10]:

    
or_filt = fes.Or([fes.PropertyIsLike(propertyname='apiso:AnyText',literal=('*%s*' % val),
                    escapeChar='\\',wildCard='*',singleChar='?') for val in name_list])

ROMS model output often has Averages and History files. The Averages files are usually averaged over a tidal cycle or more, while the History files are snapshots at that time instant. We are not interested in averaged data for this test, so in the cell below we remove any Averages files here by removing any datasets that have the term "Averages" in the metadata text. A better approach would be to look at the cell_methods attributes propagated through to some term in the ISO metadata, but this is not implemented yet, as far as I know



In [11]:

    
val = 'Averages'
not_filt = fes.Not([fes.PropertyIsLike(propertyname='apiso:AnyText',literal=('*%s*' % val),
                        escapeChar='\\',wildCard='*',singleChar='?')])



In [12]:

    
filter_list = [fes.And([ bbox, start, stop, or_filt, not_filt]) ]



In [13]:

    
# try request using multiple filters "and" syntax: [[filter1,filter2]]
csw.getrecords2(constraints=filter_list,maxrecords=1000,esn='full')
print len(csw.records.keys())



In [14]:

    
csw.request









    Out[14]:





'<csw:GetRecords xmlns:csw="http://www.opengis.net/cat/csw/2.0.2" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ogc="http://www.opengis.net/ogc" xmlns:gml="http://www.opengis.net/gml" outputSchema="http://www.opengis.net/cat/csw/2.0.2" outputFormat="application/xml" version="2.0.2" service="CSW" resultType="results" maxRecords="1000" xsi:schemaLocation="http://www.opengis.net/cat/csw/2.0.2 http://schemas.opengis.net/csw/2.0.2/CSW-discovery.xsd"><csw:Query typeNames="csw:Record"><csw:ElementSetName>full</csw:ElementSetName><csw:Constraint version="1.1.0"><ogc:Filter><ogc:And><ogc:BBOX><ogc:PropertyName>ows:BoundingBox</ogc:PropertyName><gml:Envelope srsName="urn:ogc:def:crs:OGC:1.3:CRS84"><gml:lowerCorner>-72.0 41.0</gml:lowerCorner><gml:upperCorner>-69.0 43.0</gml:upperCorner></gml:Envelope></ogc:BBOX><ogc:PropertyIsLessThanOrEqualTo><ogc:PropertyName>apiso:TempExtent_begin</ogc:PropertyName><ogc:Literal>2015-02-26 21:00</ogc:Literal></ogc:PropertyIsLessThanOrEqualTo><ogc:PropertyIsGreaterThanOrEqualTo><ogc:PropertyName>apiso:TempExtent_end</ogc:PropertyName><ogc:Literal>2015-02-20 21:00</ogc:Literal></ogc:PropertyIsGreaterThanOrEqualTo><ogc:Or><ogc:PropertyIsLike wildCard="*" singleChar="?" escapeChar="\\"><ogc:PropertyName>apiso:AnyText</ogc:PropertyName><ogc:Literal>*water_surface_height_above_reference_datum*</ogc:Literal></ogc:PropertyIsLike><ogc:PropertyIsLike wildCard="*" singleChar="?" escapeChar="\\"><ogc:PropertyName>apiso:AnyText</ogc:PropertyName><ogc:Literal>*sea_surface_height_above_geoid*</ogc:Literal></ogc:PropertyIsLike><ogc:PropertyIsLike wildCard="*" singleChar="?" escapeChar="\\"><ogc:PropertyName>apiso:AnyText</ogc:PropertyName><ogc:Literal>*sea_surface_elevation*</ogc:Literal></ogc:PropertyIsLike><ogc:PropertyIsLike wildCard="*" singleChar="?" escapeChar="\\"><ogc:PropertyName>apiso:AnyText</ogc:PropertyName><ogc:Literal>*sea_surface_height_above_reference_ellipsoid*</ogc:Literal></ogc:PropertyIsLike><ogc:PropertyIsLike wildCard="*" singleChar="?" escapeChar="\\"><ogc:PropertyName>apiso:AnyText</ogc:PropertyName><ogc:Literal>*sea_surface_height_above_sea_level*</ogc:Literal></ogc:PropertyIsLike><ogc:PropertyIsLike wildCard="*" singleChar="?" escapeChar="\\"><ogc:PropertyName>apiso:AnyText</ogc:PropertyName><ogc:Literal>*sea_surface_height*</ogc:Literal></ogc:PropertyIsLike><ogc:PropertyIsLike wildCard="*" singleChar="?" escapeChar="\\"><ogc:PropertyName>apiso:AnyText</ogc:PropertyName><ogc:Literal>*water level*</ogc:Literal></ogc:PropertyIsLike></ogc:Or><ogc:Not><ogc:PropertyIsLike wildCard="*" singleChar="?" escapeChar="\\"><ogc:PropertyName>apiso:AnyText</ogc:PropertyName><ogc:Literal>*Averages*</ogc:Literal></ogc:PropertyIsLike></ogc:Not></ogc:And></ogc:Filter></csw:Constraint></csw:Query></csw:GetRecords>'

Now print out some titles



In [15]:

    
for rec,item in csw.records.iteritems():
    print item.title









    



NECOFS Massachusetts (FVCOM) - Massachusetts Coastal - Latest Forecast
NYHOPS Forecast Model Results
ROMS ESPRESSO Real-Time Operational IS4DVAR Forecast System Version 2 (NEW) 2013-present FMRC History (Best)
NECOFS GOM3 (FVCOM) - Northeast US - Latest Forecast
NECOFS GOM3 Wave - Northeast US - Latest Forecast
COAWST Forecast System : USGS : US East Coast and Gulf of Mexico (Experimental)
Barotropic Tide Model for the Pacific Basin
ESTOFS Storm Surge Model - Atlantic - v1.0.0 - NOAA - NCEP - ADCIRC
NDBC Standard Meteorological Buoy Data
HYbrid Coordinate Ocean Model (HYCOM): Global
G1SST, 1km blended SST
NERACOOS Gulf of Maine Ocean Array: Realtime Buoy Observations: A0133 Western Maine Shelf: A0133 CTD52m Massachusetts Bay
NERACOOS Gulf of Maine Ocean Array: Realtime Buoy Observations: A01 Massachusetts Bay: A01 OPTICS3m Massachusetts Bay

Define a function that will return the endpoint for a specified service type



In [16]:

    
def service_urls(records,service_string='urn:x-esri:specification:ServiceType:odp:url'):
    """
    extract service_urls of a specific type (DAP, SOS) from records
    """
    urls=[]
    for key,rec in records.iteritems():
        #create a generator object, and iterate through it until the match is found
        #if not found, gets the default value (here "none")
        url = next((d['url'] for d in rec.references if d['scheme'] == service_string), None)
        if url is not None:
            urls.append(url)
    return urls

Print out all the OPeNDAP Data URL endpoints



In [17]:

    
dap_urls = service_urls(csw.records,service_string='urn:x-esri:specification:ServiceType:odp:url')
print "\n".join(dap_urls)









    



http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_FVCOM_OCEAN_MASSBAY_FORECAST.nc
http://colossus.dl.stevens-tech.edu/thredds/dodsC/latest/Complete_gcmplt.nc
http://tds.marine.rutgers.edu/thredds/dodsC/roms/espresso/2013_da/his_Best/ESPRESSO_Real-Time_v2_History_Best_Available_best.ncd
http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_GOM3_FORECAST.nc
http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_WAVE_FORECAST.nc
http://geoport.whoi.edu/thredds/dodsC/coawst_4/use/fmrc/coawst_4_use_best.ncd
http://oos.soest.hawaii.edu/thredds/dodsC/hioos/tide_pac
http://geoport-dev.whoi.edu/thredds/dodsC/estofs/atlantic
http://oos.soest.hawaii.edu/thredds/dodsC/pacioos/hycom/global
http://thredds.axiomalaska.com/thredds/dodsC/G1_SST_GLOBAL.nc
http://www.neracoos.org/thredds/dodsC/UMO/DSG/SOS/A01/CTD52m/HistoricRealtime/Agg.ncml
http://www.neracoos.org/thredds/dodsC/UMO/DSG/SOS/A01/OPTICS_S3m/HistoricRealtime/Agg.ncml

Print out all the SOS Data URL endpoints



In [18]:

    
sos_urls = service_urls(csw.records,service_string='urn:x-esri:specification:ServiceType:sos:url')
print "\n".join(sos_urls)









    



http://www.neracoos.org/thredds/sos/UMO/DSG/SOS/A01/CTD52m/HistoricRealtime/Agg.ncml?service=SOS&version=1.0.0&request=GetCapabilities
http://www.neracoos.org/thredds/sos/UMO/DSG/SOS/A01/OPTICS_S3m/HistoricRealtime/Agg.ncml?service=SOS&version=1.0.0&request=GetCapabilities



In [19]:

    
def nearxy(x,y,xi,yi):
    """
    find the indices x[i] of arrays (x,y) closest to the points (xi,yi)
    """
    ind=ones(len(xi),dtype=int)
    dd=ones(len(xi),dtype='float')
    for i in arange(len(xi)):
        dist=sqrt((x-xi[i])**2+(y-yi[i])**2)
        ind[i]=dist.argmin()
        dd[i]=dist[ind[i]]
    return ind,dd



In [20]:

    
def find_ij(x,y,d,xi,yi):
    """
    find non-NaN cell d[j,i] that are closest to points (xi,yi).
    """
    index = where(~isnan(d.flatten()))[0]
    ind,dd = nearxy(x.flatten()[index],y.flatten()[index],xi,yi)
    j,i=ind2ij(x,index[ind])
    return i,j,dd



In [21]:

    
def find_timevar(cube):
    """
    return the time variable from Iris. This is a workaround for
    Iris having problems with FMRC aggregations, which produce two time coordinates
    """
    try:
        cube.coord(axis='T').rename('time')
    except:
        pass
    timevar = cube.coord('time')
    return timevar



In [22]:

    
def ind2ij(a,index):
    """
    returns a[j,i] for a.ravel()[index]
    """
    n,m = shape(lon)
    j = ceil(index/m).astype(int)
    i = remainder(index,m)
    return i,j

1. Get observations from SOS

Here we are using a custom class from pyoos to read the CO-OPS SOS. This is definitely unsavory, as the whole point of using a standard is avoid the need for custom classes for each service. Need to examine the consequences of removing this and just going with straight SOS service using OWSLib.



In [23]:

    
collector = CoopsSos()
#collector.set_datum('NAVD')
collector.set_datum('MSL')



In [24]:

    
collector.server.identification.title









    Out[24]:





'NOAA.NOS.CO-OPS SOS'



In [25]:

    
collector.start_time = jd_start
collector.end_time = jd_stop
collector.variables = [sos_name]



In [26]:

    
ofrs = collector.server.offerings



In [27]:

    
print len(ofrs)
for p in ofrs[700:710]: print p









    



1036
Offering id: station-UNI1021, name: urn:ioos:station:NOAA.NOS.CO-OPS:UNI1021
Offering id: station-UNI1022, name: urn:ioos:station:NOAA.NOS.CO-OPS:UNI1022
Offering id: station-UNI1023, name: urn:ioos:station:NOAA.NOS.CO-OPS:UNI1023
Offering id: station-UNI1024, name: urn:ioos:station:NOAA.NOS.CO-OPS:UNI1024
Offering id: station-WAC1401, name: urn:ioos:station:NOAA.NOS.CO-OPS:WAC1401
Offering id: station-1611400, name: urn:ioos:station:NOAA.NOS.CO-OPS:1611400
Offering id: station-1612340, name: urn:ioos:station:NOAA.NOS.CO-OPS:1612340
Offering id: station-1612480, name: urn:ioos:station:NOAA.NOS.CO-OPS:1612480
Offering id: station-1615680, name: urn:ioos:station:NOAA.NOS.CO-OPS:1615680
Offering id: station-1617433, name: urn:ioos:station:NOAA.NOS.CO-OPS:1617433

Find the SOS stations within our bounding box and time extent

We would like to just use a filter on a collection to get a new collection, but PYOOS doesn't do that yet. So we do a GetObservation request for a collection, including a bounding box, and asking for one value at the start of the time period of interest. We use that to do a bounding box filter on the SOS server, which returns 1 point for each station found. So for 3 stations, we get back 3 records, in CSV format. We can strip the station ids from the CSV, and then we have a list of stations we can use with pyoos. The template for the GetObservation query for the bounding box filtered collection was generated using the GUI at http://opendap.co-ops.nos.noaa.gov/ioos-dif-sos/



In [28]:

    
iso_start = jd_start.strftime('%Y-%m-%dT%H:%M:%SZ')
print iso_start
box_str=','.join(str(e) for e in box)
print box_str









    



2015-02-20T21:00:00Z
-72.0,41.0,-69.0,43.0



In [29]:

    
url=('http://opendap.co-ops.nos.noaa.gov/ioos-dif-sos/SOS?'
     'service=SOS&request=GetObservation&version=1.0.0&'
     'observedProperty=%s&offering=urn:ioos:network:NOAA.NOS.CO-OPS:WaterLevelActive&'
     'featureOfInterest=BBOX:%s&responseFormat=text/csv&eventTime=%s') % (sos_name,box_str,iso_start)
print url
obs_loc_df = pd.read_csv(url)









    



http://opendap.co-ops.nos.noaa.gov/ioos-dif-sos/SOS?service=SOS&request=GetObservation&version=1.0.0&observedProperty=water_surface_height_above_reference_datum&offering=urn:ioos:network:NOAA.NOS.CO-OPS:WaterLevelActive&featureOfInterest=BBOX:-72.0,41.0,-69.0,43.0&responseFormat=text/csv&eventTime=2015-02-20T21:00:00Z



In [30]:

    
obs_loc_df.head()









    Out[30]:






  
    
      
      station_id
      sensor_id
      latitude (degree)
      longitude (degree)
      date_time
      water_surface_height_above_reference_datum (m)
      datum_id
      vertical_position (m)
    
  
  
    
      0
       urn:ioos:station:NOAA.NOS.CO-OPS:8443970
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8443970:B1
       42.3548
      -71.0534
       2015-02-20T21:00:00Z
       0.468
       urn:ioos:def:datum:noaa::MLLW
       1.074
    
    
      1
       urn:ioos:station:NOAA.NOS.CO-OPS:8447386
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8447386:B1
       41.7043
      -71.1641
       2015-02-20T21:00:00Z
      -0.381
       urn:ioos:def:datum:noaa::MLLW
       6.356
    
    
      2
       urn:ioos:station:NOAA.NOS.CO-OPS:8447930
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8447930:B1
       41.5233
      -70.6717
       2015-02-20T21:00:00Z
      -0.492
       urn:ioos:def:datum:noaa::MLLW
       0.797
    
    
      3
       urn:ioos:station:NOAA.NOS.CO-OPS:8452660
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8452660:B1
       41.5050
      -71.3267
       2015-02-20T21:00:00Z
      -0.540
       urn:ioos:def:datum:noaa::MLLW
       0.577
    
    
      4
       urn:ioos:station:NOAA.NOS.CO-OPS:8454000
       urn:ioos:sensor:NOAA.NOS.CO-OPS:8454000:B1
       41.8071
      -71.4012
       2015-02-20T21:00:00Z
      -0.405
       urn:ioos:def:datum:noaa::MLLW
       1.064



In [31]:

    
stations = [sta.split(':')[-1] for sta in obs_loc_df['station_id']]
print stations
obs_lon = [sta for sta in obs_loc_df['longitude (degree)']]
obs_lat = [sta for sta in obs_loc_df['latitude (degree)']]









    



['8443970', '8447386', '8447930', '8452660', '8454000']



In [32]:

    
print stations









    



['8443970', '8447386', '8447930', '8452660', '8454000']

Get longName from SOS DescribeSensor (station) request



In [33]:

    
def get_Coops_longName(sta):
    """
    get longName for specific station from COOPS SOS using DescribeSensor request
    """
    url=('http://opendap.co-ops.nos.noaa.gov/ioos-dif-sos/SOS?service=SOS&'
        'request=DescribeSensor&version=1.0.0&outputFormat=text/xml;subtype="sensorML/1.0.1"&'
        'procedure=urn:ioos:station:NOAA.NOS.CO-OPS:%s') % sta
    tree = etree.parse(urllib2.urlopen(url))
    root = tree.getroot()
    longName=root.xpath("//sml:identifier[@name='longName']/sml:Term/sml:value/text()", namespaces={'sml':"http://www.opengis.net/sensorML/1.0.1"})
    return longName

Request CSV response from SOS and convert to Pandas DataFrames



In [34]:

    
def coops2df(collector,coops_id,sos_name):
    collector.features = [coops_id]
    collector.variables = [sos_name]
    response = collector.raw(responseFormat="text/csv")
    data_df = pd.read_csv(cStringIO.StringIO(str(response)), parse_dates=True, index_col='date_time')
#    data_df['Observed Data']=data_df['water_surface_height_above_reference_datum (m)']-data_df['vertical_position (m)']
    data_df['Observed Data']=data_df['water_surface_height_above_reference_datum (m)']

    a = get_Coops_longName(coops_id)
    if len(a)==0:
        long_name=coops_id
    else:
        long_name=a[0]
        
    data_df.name=long_name
    return data_df

Generate a uniform 6-min time base for model/data comparison:



In [35]:

    
ts_rng = pd.date_range(start=jd_start, end=jd_stop, freq='6Min')
ts = pd.DataFrame(index=ts_rng)
print jd_start,jd_stop
print len(ts)









    



2015-02-20 21:00:00 2015-02-26 21:00:00
1441

Create a list of obs dataframes, one for each station:



In [36]:

    
obs_df=[]
sta_names=[]
for sta in stations:
    b=coops2df(collector,sta,sos_name)
    sta_names.append(b.name)
    print b.name
    # limit interpolation to 10 points (10 @ 6min = 1 hour)
    obs_df.append(pd.DataFrame(pd.concat([b, ts],axis=1).interpolate(limit=10)['Observed Data']))
    obs_df[-1].name=b.name









    



Boston, MA
Fall River, MA
Woods Hole, MA
Newport, RI
Providence, RI

from matplotlib.transforms import offset_copy geodetic = ccrs.Geodetic(globe=ccrs.Globe(datum='WGS84')) figure(figsize=(8,8)) # Open Source Imagery from MapQuest (max zoom = 16?) tiler = MapQuestOpenAerial() # Open Street Map (max zoom = 18?) #tiler = OSM() ax = plt.axes(projection=tiler.crs) extent=[box[0],box[2],box[1],box[3]] ax.set_extent(extent, geodetic) ax.add_image(tiler, 7) plt.scatter(obs_lon,obs_lat,marker='o',s=30.0, color='cyan',transform=ccrs.PlateCarree()) geodetic_transform = ccrs.Geodetic()._as_mpl_transform(ax) text_transform = offset_copy(geodetic_transform, units='dots', x=-7,y=+7) for x,y,label in zip(obs_lon,obs_lat,sta_names): plt.text(x,y,label,horizontalalignment='left',transform=text_transform,color='white') gl=ax.gridlines(draw_labels=True) gl.xlabels_top = False gl.ylabels_right = False title('Water Level Gauge Locations')

Get model output from OPeNDAP URLS

Try to open all the OPeNDAP URLS using Iris from the British Met Office. If 1D, assume dataset is data, if 2D assume dataset is an unstructured grid model, and if 3D, assume it's a structured grid model.

Construct an Iris contraint to load only cubes that match the std_name_list:



In [37]:

    
print name_list
def name_in_list(cube):
    return cube.standard_name in name_list
constraint = iris.Constraint(cube_func=name_in_list)









    



['water_surface_height_above_reference_datum', 'sea_surface_height_above_geoid', 'sea_surface_elevation', 'sea_surface_height_above_reference_ellipsoid', 'sea_surface_height_above_sea_level', 'sea_surface_height', 'water level']



In [38]:

    
def mod_df(arr,timevar,istart,istop,mod_name,ts):
    """
    return time series (DataFrame) from model interpolated onto uniform time base
    """
    t=timevar.points[istart:istop]
    jd = timevar.units.num2date(t)

    # eliminate any data that is closer together than 10 seconds
    # this was required to handle issues with CO-OPS aggregations, I think because
    # they use floating point time in hours, which is not very accurate, so the FMRC
    # aggregation is aggregating points that actually occur at the same time
    dt =diff(jd)
    s = array([ele.seconds for ele in dt])
    ind=where(s>10)[0]
    arr=arr[ind+1]
    jd=jd[ind+1]
    
    b = pd.DataFrame(arr,index=jd,columns=[mod_name])
    # eliminate any data with NaN
    b = b[isfinite(b[mod_name])]
    # interpolate onto uniform time base, fill gaps up to: (10 values @ 6 min = 1 hour)
    c = pd.concat([b, ts],axis=1).interpolate(limit=10)
    return c



In [39]:

    
# FIXME: Filtering bad URLs.
dap_urls = [link for link in dap_urls
            if 'hycom' not in link]  # Cartesian coords are not implemented.



In [40]:

    
# use only data within 0.04 degrees (about 4 km)
max_dist=0.04 

# use only data where the standard deviation of the time series exceeds 0.01 m (1 cm)
# this eliminates flat line model time series that come from land points that 
# should have had missing values.
min_var=0.01

for url in dap_urls:
    try:
        a = iris.load_cube(url,constraint)
        # convert to units of meters
 #       a.convert_units('m')     # this isn't working for unstructured data
        # take first 20 chars for model name
        mod_name = a.attributes['title'][0:20]
        r = shape(a)
        timevar = find_timevar(a)
        lat = a.coord(axis='Y').points
        lon = a.coord(axis='X').points
        jd = timevar.units.num2date(timevar.points)
        istart = timevar.nearest_neighbour_index(timevar.units.date2num(jd_start))
        istop = timevar.nearest_neighbour_index(timevar.units.date2num(jd_stop))
        
        # only proceed if we have data in the range requested
        if istart != istop:            
            nsta = len(obs_lon)
            if len(r)==3:
                print '[Structured grid model]:', url
                d = a[0,:,:].data
                # find the closest non-land point from a structured grid model
                if len(shape(lon))==1:
                    lon,lat= meshgrid(lon,lat)
                j,i,dd = find_ij(lon,lat,d,obs_lon,obs_lat)
                for n in range(nsta):
                    # only use if model cell is within 0.1 degree of requested location
                    if dd[n] <= max_dist:
                        arr = a[istart:istop,j[n],i[n]].data                    
                        if arr.std() >= min_var:
                            c = mod_df(arr,timevar,istart,istop,mod_name,ts)
                            name= obs_df[n].name
                            obs_df[n]=pd.concat([obs_df[n],c],axis=1)
                            obs_df[n].name = name
            elif len(r)==2:
                print '[Unstructured grid model]:', url
                # find the closest point from an unstructured grid model
                index,dd = nearxy(lon.flatten(),lat.flatten(),obs_lon,obs_lat)
                for n in range(nsta):
                    # only use if model cell is within 0.1 degree of requested location
                    if dd[n] <= max_dist:
                        arr = a[istart:istop,index[n]].data
                        if arr.std() >= min_var:
                            c = mod_df(arr,timevar,istart,istop,mod_name,ts)
                            name = obs_df[n].name
                            obs_df[n]=pd.concat([obs_df[n],c],axis=1)
                            obs_df[n].name = name 
            elif len(r)==1:
                print '[Data]:', url
                        
    except:
        pass









    



[Unstructured grid model]: http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_FVCOM_OCEAN_MASSBAY_FORECAST.nc
[Structured grid model]:





    



/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/cf.py:1004: UserWarning: Ignoring variable u'siglay' referenced by variable u'ww': Dimensions (u'siglay', u'node') do not span (u'time', u'siglay', u'nele')
  warnings.warn(msg)
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/cf.py:1004: UserWarning: Ignoring variable u'siglay' referenced by variable u'u': Dimensions (u'siglay', u'node') do not span (u'time', u'siglay', u'nele')
  warnings.warn(msg)
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/cf.py:1004: UserWarning: Ignoring variable u'siglay' referenced by variable u'v': Dimensions (u'siglay', u'node') do not span (u'time', u'siglay', u'nele')
  warnings.warn(msg)
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'h' referenced by data variable u'v' via variable u's_rho': Dimensions (u'eta_rho', u'xi_rho') do not span (u'time', u's_rho', u'eta_v', u'xi_v')
  warnings.warn(msg)
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'zeta' referenced by data variable u'v' via variable u's_rho': Dimensions (u'time', u'eta_rho', u'xi_rho') do not span (u'time', u's_rho', u'eta_v', u'xi_v')
  warnings.warn(msg)
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'h' referenced by data variable u'u' via variable u's_rho': Dimensions (u'eta_rho', u'xi_rho') do not span (u'time', u's_rho', u'eta_u', u'xi_u')
  warnings.warn(msg)
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/cf.py:1038: UserWarning: Ignoring formula terms variable u'zeta' referenced by data variable u'u' via variable u's_rho': Dimensions (u'time', u'eta_rho', u'xi_rho') do not span (u'time', u's_rho', u'eta_u', u'xi_u')
  warnings.warn(msg)
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1291: UserWarning: Gracefully filling 'time' dimension coordinate masked points
  warnings.warn(msg.format(str(cf_coord_var.cf_name)))






    



 http://colossus.dl.stevens-tech.edu/thredds/dodsC/latest/Complete_gcmplt.nc
[Structured grid model]: http://tds.marine.rutgers.edu/thredds/dodsC/roms/espresso/2013_da/his_Best/ESPRESSO_Real-Time_v2_History_Best_Available_best.ncd
[Unstructured grid model]: http://www.smast.umassd.edu:8080/thredds/dodsC/FVCOM/NECOFS/Forecasts/NECOFS_GOM3_FORECAST.nc
[Structured grid model]: http://geoport.whoi.edu/thredds/dodsC/coawst_4/use/fmrc/coawst_4_use_best.ncd
[Structured grid model]: http://oos.soest.hawaii.edu/thredds/dodsC/hioos/tide_pac
[Unstructured grid model]:





    



/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1196: UserWarning: Ignoring netCDF variable 'nbdv' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1196: UserWarning: Ignoring netCDF variable 'neta' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1196: UserWarning: Ignoring netCDF variable 'ibtypee' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1196: UserWarning: Ignoring netCDF variable 'nvell' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1196: UserWarning: Ignoring netCDF variable 'ibtype' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1196: UserWarning: Ignoring netCDF variable 'nbvv' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1196: UserWarning: Ignoring netCDF variable 'nvel' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1196: UserWarning: Ignoring netCDF variable 'nvdll' invalid units 'nondimensional'
  warnings.warn(msg.format(msg_name, msg_units))
/home/usgs/miniconda/envs/ioos/lib/python2.7/site-packages/iris/fileformats/_pyke_rules/compiled_krb/fc_rules_cf_fc.py:1359: UserWarning: Failed to create 'time' dimension coordinate: The points array must be strictly monotonic.
Gracefully creating 'time' auxiliary coordinate instead.
  error=e_msg))






    



 http://geoport-dev.whoi.edu/thredds/dodsC/estofs/atlantic



In [41]:

    
for df in obs_df:
    print df.head()









    



                     Observed Data  NECOFS Massachusetts  NECOFS GOM3 (FVCOM) 
2015-02-20 21:00:00         -1.117                   NaN                   NaN
2015-02-20 21:06:00         -1.215                   NaN                   NaN
2015-02-20 21:12:00         -1.307                   NaN                   NaN
2015-02-20 21:18:00         -1.400                   NaN                   NaN
2015-02-20 21:24:00         -1.515                   NaN                   NaN
                     Observed Data  NECOFS GOM3 (FVCOM)   ESTOFS Storm Surge M
2015-02-20 21:00:00         -1.053                   NaN                   NaN
2015-02-20 21:06:00         -1.042                   NaN                   NaN
2015-02-20 21:12:00         -1.032                   NaN                   NaN
2015-02-20 21:18:00         -1.024                   NaN                   NaN
2015-02-20 21:24:00         -1.013                   NaN                   NaN
                     Observed Data  NECOFS Massachusetts  \
2015-02-20 21:00:00         -0.792                   NaN   
2015-02-20 21:06:00         -0.816                   NaN   
2015-02-20 21:12:00         -0.824                   NaN   
2015-02-20 21:18:00         -0.817                   NaN   
2015-02-20 21:24:00         -0.824                   NaN   

                     ROMS ESPRESSO Real-T  NECOFS GOM3 (FVCOM)   \
2015-02-20 21:00:00                   NaN                   NaN   
2015-02-20 21:06:00                   NaN                   NaN   
2015-02-20 21:12:00                   NaN                   NaN   
2015-02-20 21:18:00                   NaN                   NaN   
2015-02-20 21:24:00                   NaN                   NaN   

                     COAWST Forecast Syst  ESTOFS Storm Surge M  
2015-02-20 21:00:00                   NaN                   NaN  
2015-02-20 21:06:00                   NaN                   NaN  
2015-02-20 21:12:00                   NaN                   NaN  
2015-02-20 21:18:00                   NaN                   NaN  
2015-02-20 21:24:00                   NaN                   NaN  
                     Observed Data  NYHOPS Forecast Mode  \
2015-02-20 21:00:00         -1.069                   NaN   
2015-02-20 21:06:00         -1.033                   NaN   
2015-02-20 21:12:00         -1.013                   NaN   
2015-02-20 21:18:00         -0.981                   NaN   
2015-02-20 21:24:00         -0.972                   NaN   

                     ROMS ESPRESSO Real-T  NECOFS GOM3 (FVCOM)   \
2015-02-20 21:00:00                   NaN                   NaN   
2015-02-20 21:06:00                   NaN                   NaN   
2015-02-20 21:12:00                   NaN                   NaN   
2015-02-20 21:18:00                   NaN                   NaN   
2015-02-20 21:24:00                   NaN                   NaN   

                     COAWST Forecast Syst  ESTOFS Storm Surge M  
2015-02-20 21:00:00                   NaN                   NaN  
2015-02-20 21:06:00                   NaN                   NaN  
2015-02-20 21:12:00                   NaN                   NaN  
2015-02-20 21:18:00                   NaN                   NaN  
2015-02-20 21:24:00                   NaN                   NaN  
                     Observed Data  NYHOPS Forecast Mode  NECOFS GOM3 (FVCOM) 
2015-02-20 21:00:00         -1.090                   NaN                   NaN
2015-02-20 21:06:00         -1.090                   NaN                   NaN
2015-02-20 21:12:00         -1.089                   NaN                   NaN
2015-02-20 21:18:00         -1.092                   NaN                   NaN
2015-02-20 21:24:00         -1.097                   NaN                   NaN



In [42]:

    
for df in obs_df:
    p=df.plot(figsize=(14,6),title=df.name,legend=False)
    setp(p.lines[0],linewidth=4.0,color=[0.7,0.7,0.7],zorder=1)
    legend()
    ylabel('m')

# plot again, but now remove the mean offset (relative to data) from all plots for df in obs_df: amean=df[jd_start:jd_now].mean() df = df - amean + amean.ix[0] # print amean.ix[0]-amean df.plot(figsize=(14,6)) ylabel('m')



In [42]:

	station_id	sensor_id	latitude (degree)	longitude (degree)	date_time	water_surface_height_above_reference_datum (m)	datum_id	vertical_position (m)
0	urn:ioos:station:NOAA.NOS.CO-OPS:8443970	urn:ioos:sensor:NOAA.NOS.CO-OPS:8443970:B1	42.3548	-71.0534	2015-02-20T21:00:00Z	0.468	urn:ioos:def:datum:noaa::MLLW	1.074
1	urn:ioos:station:NOAA.NOS.CO-OPS:8447386	urn:ioos:sensor:NOAA.NOS.CO-OPS:8447386:B1	41.7043	-71.1641	2015-02-20T21:00:00Z	-0.381	urn:ioos:def:datum:noaa::MLLW	6.356
2	urn:ioos:station:NOAA.NOS.CO-OPS:8447930	urn:ioos:sensor:NOAA.NOS.CO-OPS:8447930:B1	41.5233	-70.6717	2015-02-20T21:00:00Z	-0.492	urn:ioos:def:datum:noaa::MLLW	0.797
3	urn:ioos:station:NOAA.NOS.CO-OPS:8452660	urn:ioos:sensor:NOAA.NOS.CO-OPS:8452660:B1	41.5050	-71.3267	2015-02-20T21:00:00Z	-0.540	urn:ioos:def:datum:noaa::MLLW	0.577
4	urn:ioos:station:NOAA.NOS.CO-OPS:8454000	urn:ioos:sensor:NOAA.NOS.CO-OPS:8454000:B1	41.8071	-71.4012	2015-02-20T21:00:00Z	-0.405	urn:ioos:def:datum:noaa::MLLW	1.064