The water network consists of a streamflow network, consisting of connections between river gauges, reservoirs, and junctions, and a canal network. The canal network is a bipartite network, with links from nodes in the streamflow network and the counties. The initwaternet.jl file loads these networks into the global environment. It uses cached Julia Data (.jld) files to speed up the loading process, creating them if they aren't present in the data directory.

Scientific documentation

The river network is documented here: https://www.overleaf.com/read/gftdkjjkdrsn

The Canal Network

The canal network is first produced in R and then loaded into Julia. In Julia, it is used as just a DataFrame with the names of the river network nodes added.

The columns in the waterdraws.jld data are as follows:

The fips column is the FIPS code of the receiving county.
The source column is the row number of the gauge, as it appears in the network variable of the waternet.RData file.
The justif column provides a justification for why the gauge should be available for feeding the county. It contains short categorical entries, described in the R script that generates the countydraws.RData file (network3/demand/allcounties.R).
The downhill column compares the elevation of the county with the elevation of the gauge, if we know it. It is 1 if county's average elevation is below the gauges, such that the water can be tapped for free.
The exdist column is greater than 0 if the county had to be connected to a gauge arbitrarily to ensure that it had any source. In this case, the county is connected to the closest gauge, and this column is the geodesic distance in km.
The gaugeid is the only column added by Julia, and is in the same format as the keys in the wateridverts dictionary, which allows easy access to the nodes in the river network by name.



In [8]:

    
using DataFrames
draws = deserialize(open("../data/waterdraws.jld", "r"))









    Out[8]:




fips source justif downhill exdist gaugeid
1 1001 2262 contains 0 0.0 usgs.02422500
2 1003 2174 contains 0 0.0 usgs.02376500
3 1003 2175 contains 0 0.0 usgs.02377570
4 1003 2176 contains 0 0.0 usgs.02378170
5 1003 2177 contains 0 0.0 usgs.02378300
6 1003 2178 contains 0 0.0 usgs.02378500
7 1003 12946 contains NA 0.0 junction.1405-dn
8 1003 12990 contains NA 0.0 junction.1487-dn
9 1003 13046 contains NA 0.0 junction.1607-up
10 1005 2085 contains 1 0.0 usgs.02342933
11 1005 13389 contains NA 0.0 junction.2322-up
12 1005 13404 contains NA 0.0 junction.2350-up
13 1007 2275 contains 1 0.0 usgs.02424000
14 1009 2314 contains 0 0.0 usgs.02449882
15 1009 2315 contains 0 0.0 usgs.02450000
16 1009 2316 contains 0 0.0 usgs.02450180
17 1009 2321 contains 0 0.0 usgs.02455000
18 1009 2322 contains 1 0.0 usgs.02455185
19 1009 11417 contains 1 0.0 reservoir.2341
20 1009 11423 contains 0 0.0 reservoir.2347
21 1009 14095 contains NA 0.0 junction.3862-dn
22 1009 14110 contains NA 0.0 junction.3896-up
23 1009 14181 contains NA 0.0 junction.4085-up
24 1011 11553 contains 0 0.0 reservoir.2482
25 1015 2233 contains 1 0.0 usgs.02403310
26 1015 2234 contains 1 0.0 usgs.02403500
27 1015 11425 contains 1 0.0 reservoir.2349
28 1019 2225 contains 0 0.0 usgs.02398300
29 1019 2227 contains 0 0.0 usgs.02399200
30 1019 2228 contains 0 0.0 usgs.02400100
&vellip &vellip &vellip &vellip &vellip &vellip &vellip

Development and contributing

You can replace the data/countydraws.RData file with another R Data file which contains the variable draws. draws should be a data.frame minimally with the columns fips and source.

Until the script for generating the countydraws.RData file is migrated into the repository, please do the following to extend the dataset:

Copy the countydraws.RData file into a new sources/waternet directory as countydraws.v1.RData.
Add your script for modifying the data to the same directory, and have it output a new countydraws.v2.RData file. Copy this into the data directory.
If there is already a countydraws.v(N).RData (for $N \ge 2$) file, use the latest one as your input, and output a file countydraws.v(N+1).RData.

Future work:

Have each canal specify a flow limit.
Include a column for an optional price for using that canal.

Missing canals

By Laureline

Several utilities and facilities operate accros multiple counties. For instance, New York City water supply system source its water from the Catskill Mountains and the Delaware river in Delaware County and distribute the water to all of the boroughs.

Utilities of this type are not rare and occur at many locations across CONTUS. So that the water network allows to link the point of source and the point of use, additionnal connections have been added to the countydraws file.

Cross-county utilities

The first step consists in finding the utilities and facilities that operate across multiple counties. This is done by finding all of the water utilities operating within a given county on the Drinking Water Mapping Application to Protect Source Waters website (https://dwmapspublic.rti.org/). The website then redirects to the Safe Drinking Water Information System (SDWIS) Federal Reporting Services, which provides the list of counties served by the given water system (such as https://ofmpub.epa.gov/apex/sfdw/f?p=SDWIS_FED_REPORTS_PUBLIC:PWS_SEARCH:::::PWSID:NY7003493). This led to the construction of a dataset canals.txt with the first column being the source county (referred by FIPS), and the other ones listing the counties the water facilities present in the source county serve.

Add missing canals to the water network

The second step is to complete the countydraws dataset. This is done using the R script script_incorporation_missing_data.R, which simply adds a connection between each gauge within a source county to the point of use county.

Current status

As the number of water utilities is consequent, this task has not been accomplished for all counties yet. As a starting point, we focused our research on problematic areas: counties that presented a suspicious public supply withdrawals/population ratio, and counties with important population. The following plot (fig 1.a) shows the USGS 2010 public supply fresh water withdrawals in function of the population. The dots in green are the counties that have been added to the missing canals set. Using this new set of connections, a public supplied demand has been estimated by assuming that the withdrawals performed within a given county are distributed to all of the linked counties (including itself) proportionnally to the population they contain. The new estimated demand is illustrated in fig 1.b in function of population and a comparison for the treated counties between demands and withdrawals can be found in fig 1.c.

The following figure shows the difference between withdrawals and the estimated demand for all CONTUS.

	fips	source	justif	downhill	exdist	gaugeid
1	1001	2262	contains	0	0.0	usgs.02422500
2	1003	2174	contains	0	0.0	usgs.02376500
3	1003	2175	contains	0	0.0	usgs.02377570
4	1003	2176	contains	0	0.0	usgs.02378170
5	1003	2177	contains	0	0.0	usgs.02378300
6	1003	2178	contains	0	0.0	usgs.02378500
7	1003	12946	contains	NA	0.0	junction.1405-dn
8	1003	12990	contains	NA	0.0	junction.1487-dn
9	1003	13046	contains	NA	0.0	junction.1607-up
10	1005	2085	contains	1	0.0	usgs.02342933
11	1005	13389	contains	NA	0.0	junction.2322-up
12	1005	13404	contains	NA	0.0	junction.2350-up
13	1007	2275	contains	1	0.0	usgs.02424000
14	1009	2314	contains	0	0.0	usgs.02449882
15	1009	2315	contains	0	0.0	usgs.02450000
16	1009	2316	contains	0	0.0	usgs.02450180
17	1009	2321	contains	0	0.0	usgs.02455000
18	1009	2322	contains	1	0.0	usgs.02455185
19	1009	11417	contains	1	0.0	reservoir.2341
20	1009	11423	contains	0	0.0	reservoir.2347
21	1009	14095	contains	NA	0.0	junction.3862-dn
22	1009	14110	contains	NA	0.0	junction.3896-up
23	1009	14181	contains	NA	0.0	junction.4085-up
24	1011	11553	contains	0	0.0	reservoir.2482
25	1015	2233	contains	1	0.0	usgs.02403310
26	1015	2234	contains	1	0.0	usgs.02403500
27	1015	11425	contains	1	0.0	reservoir.2349
28	1019	2225	contains	0	0.0	usgs.02398300
29	1019	2227	contains	0	0.0	usgs.02399200
30	1019	2228	contains	0	0.0	usgs.02400100
&vellip	&vellip	&vellip	&vellip	&vellip	&vellip	&vellip