Gadfly

Carlos Lizama

Outline:

Introduction
The grammar of graphics
Plotting arrays and functions.
Plotting DataFrames
Pros and Cons.

Introduction

Leland Wilkinson (2005) created the Grammar of Graphics to describe deep features that underlie all statistical graphics.

The grammar tells us that a statistical graphic is a mapping from data to aesthetic attributes (color, shape, size) of geometric objects (points, lines, bars). The plot may also contain statistical transformation of the data and is drawn on a specific coordinate system. Faceting can be used to generate that same plot for different subsets of dataset. It is the combination of these independent components that make up a graphic.

Hadley Wickham (2009) builds on Wilkinson's grammar and adapts it within R. He develops the ggplot2 package.
Gadfly is a package that implements the Grammar of Graphics in Julia, based mainly on ggplot2.

The grammar of graphics

The main components of the grammar:

The Data that we want to visualize and a set of asthetic mappings describing how variables in the data are mapped to aethetic attributes which define how the data should be perceived.
Geometric object, geoms for short, represent what we actually see on the plot: points, lines, polygons, etc.
Statistical transformation, stats for shor, summarize data in many useful ways. For instance, the binning to create histograms. Stats are optional, but very useful.
The scales map values in the data space to values in an aesthetic space, whether it be color, size or shape. Scales also draw legend or axes.
A coordinate system, coor for short, describes how data coordinates are mapped to the plane of the graphic. It also provides axes and gridlines to make it possible to read the graph. Examples of coordinate system are the Cartesian cordinate system and the polar coordinate system.
A faceting specification describes how to break up the data into subsets and how to display those subsets as small multiples.

The Grammar of Graphics in Julia

In Julia, we can speficy:

aethetics, scales, coordinates, guides, geometries, stats.

Data

The Data is supplied in form of DataFrame.
Although the DataFrame is optional.

Statistics

Statistics are functions taking as input one or more aesthetics, operating on those values, then output to one or more aesthetics. For example, drawing of boxplots typically uses the boxplot statistic (Stat.boxplot) that takes as input the x and y aesthetic, and outputs the middle, and upper and lower hinge, and upper and lower fence aesthetics.

Scales

Scales, similarly to statistics, apply a transformation to the original data, typically mapping one aesthetic to the same aesthetic, while retaining the original value, e.g. Scale.x_log

Geometries

Geometries are responsible for actually doing the drawing. A geometry takes as input one or more aesthetics, and used data bound to these aesthetics to draw things.

Guides

Very similar to geometries are guides, which draw graphics supporting the actual visualization, such as axis ticks and labels and color keys. The major distinction is that geometries always draw within the rectangular plot frame, while guides have some special layout considerations.



In [1]:

    
using Gadfly

Examples Gadfly

Plotting arrays and anonymous functions



In [2]:

    
# plot arrays

x = collect(linspace(-5,5,8))
y = 5*cos(x)+x
plot(x=x, y=y)









    Out[2]:



In [3]:

    
# line options: point, line, smooth. Use Geom.
x = collect(linspace(-5,5,8))
y = 5*cos(x)+x
plot(x=x, y=y, Geom.step())  # optional arguments: direction :vh :hv









    Out[3]:



In [4]:

    
# plot anonymous fuctions
plot([sin, cos], -5, 5)









    Out[4]:



In [5]:

    
plot([x->5*cos(x) + x, x->5*sin(x) + x], -5, 5)









    Out[5]:



In [6]:

    
f(x) = 5*cos(x) + x









    Out[6]:





f (generic function with 1 method)



In [7]:

    
# customize plots: title, axis, labels, ...
plot(f, -4, 4, Guide.xlabel("variable x"), Guide.ylabel("variable y=f(x)"), Guide.title("This is the title"))









    Out[7]:



In [8]:

    
# more on Guide, xrug, yrug. 
plot(x=x, y=y, Guide.xrug, Guide.yrug)









    Out[8]:



In [9]:

    
# more on guide: ticks
xt = [-3, -2, 3, 4]
yt = [-1, 0, 2]
plot(x=x, y=y, Geom.line, Guide.xticks(ticks=xt, orientation=:vertical), Guide.yticks(ticks=yt))
# optional: label: true or false.









    Out[9]:



In [10]:

    
# more than one plot at the same time: layers
y1 = y+1
plot(layer(x=x, y=y, Geom.line), layer(x=x, y=y1, Geom.smooth, Theme(default_color=colorant"red")))









    Out[10]:



In [11]:

    
# others: Geom: vline, hline.
plot(x=x, y=y, xintercept=[4], yintercept=[-2], Geom.line, Geom.hline(), Geom.vline(color=colorant"orange", size=1mm))









    Out[11]:



In [12]:

    
# histograms
x0 = randn(10000)
plot(x=x0, Geom.histogram)









    Out[12]:



In [13]:

    
# Scale. x_continuous, y_continuous, x_log, x_log10, etc. Same for y.
plot(x=x, y=y, Scale.x_continuous(format=:scientific), Geom.line)









    Out[13]:



In [14]:

    
# Coord.cartesian xmin, xmax
plot(x=x, y=y, Coord.cartesian(xmin=-2,xmax=4))









    Out[14]:



In [15]:

    
# Scale. x_continuous, y_continuous, x_log, x_log10, etc. Same for y.
x1 = collect(linspace(0,1,10))
y1 = exp(x1)
plot(x=x1, y=y1, Scale.y_log())









    Out[15]:



In [16]:

    
# Other features: Geom.path
n = 10
xjumps = randn(n)
yjumps = randn(n)
plot(x=cumsum(xjumps),y=cumsum(yjumps),Geom.path())









    Out[16]:



In [17]:

    
# Other features: Geom.ribbon
ymin = y - 1
ymax = y + 1
plot(x=x, y=y, ymax=ymax, ymin=ymin, Geom.line, Geom.ribbon)









    Out[17]:

Plotting Datasets



In [18]:

    
using RDatasets
using DataFrames



In [19]:

    
Data1 = dataset("car","Salaries")
# The 2008-09 nine-month academic salary for Assistant Professors, Associate Professors and 
# Professors in a college in the U.S.









    Out[19]:




Rank Discipline YrsSincePhD YrsService Sex Salary
1 Prof B 19 18 Male 139750
2 Prof B 20 16 Male 173200
3 AsstProf B 4 3 Male 79750
4 Prof B 45 39 Male 115000
5 Prof B 40 41 Male 141500
6 AssocProf B 6 6 Male 97000
7 Prof B 30 23 Male 175000
8 Prof B 45 45 Male 147765
9 Prof B 21 20 Male 119250
10 Prof B 18 18 Female 129000
11 AssocProf B 12 8 Male 119800
12 AsstProf B 7 2 Male 79800
13 AsstProf B 1 1 Male 77700
14 AsstProf B 2 0 Male 78000
15 Prof B 20 18 Male 104800
16 Prof B 12 3 Male 117150
17 Prof B 19 20 Male 101000
18 Prof A 38 34 Male 103450
19 Prof A 37 23 Male 124750
20 Prof A 39 36 Female 137000
21 Prof A 31 26 Male 89565
22 Prof A 36 31 Male 102580
23 Prof A 34 30 Male 93904
24 Prof A 24 19 Male 113068
25 AssocProf A 13 8 Female 74830
26 Prof A 21 8 Male 106294
27 Prof A 35 23 Male 134885
28 AsstProf B 5 3 Male 82379
29 AsstProf B 11 0 Male 77000
30 Prof B 12 8 Male 118223
&vellip &vellip &vellip &vellip &vellip &vellip &vellip



In [20]:

    
# density
plot(Data1, x="Salary", Geom.density)   # Geom.density = Geom.line, Stat.density









    Out[20]:



In [21]:

    
# histogram
plot(Data1, x="Salary", Geom.histogram, color="Discipline") # Geom.histogram = Geom.bar, Stat.histogram









    Out[21]:



In [22]:

    
# 2D histogram
plot(Data1, x="Salary", y="YrsService", Geom.histogram2d(xbincount=40, ybincount=40))









    Out[22]:



In [23]:

    
# error bars
using Distributions

sds = [1, 1/2, 1/4, 1/8, 1/16, 1/32]
n = 10
ys = [mean(rand(Normal(0, sd), n)) for sd in sds]
ymins = ys .- (1.96 * sds / sqrt(n))
ymaxs = ys .+ (1.96 * sds / sqrt(n))

plot(x=1:length(sds), y=ys, ymin=ymins, ymax=ymaxs, Geom.point, Geom.errorbar)









    Out[23]:



In [24]:

    
Data2 = dataset("datasets","USArrests")
# This data set contains statistics, in arrests per 100,000 residents for assault, murder, and rape in 
# each of the 50 US states in 1973. Also given is the percent of the population living in urban areas.









    Out[24]:




State Murder Assault UrbanPop Rape
1 Alabama 13.2 236 58 21.2
2 Alaska 10.0 263 48 44.5
3 Arizona 8.1 294 80 31.0
4 Arkansas 8.8 190 50 19.5
5 California 9.0 276 91 40.6
6 Colorado 7.9 204 78 38.7
7 Connecticut 3.3 110 77 11.1
8 Delaware 5.9 238 72 15.8
9 Florida 15.4 335 80 31.9
10 Georgia 17.4 211 60 25.8
11 Hawaii 5.3 46 83 20.2
12 Idaho 2.6 120 54 14.2
13 Illinois 10.4 249 83 24.0
14 Indiana 7.2 113 65 21.0
15 Iowa 2.2 56 57 11.3
16 Kansas 6.0 115 66 18.0
17 Kentucky 9.7 109 52 16.3
18 Louisiana 15.4 249 66 22.2
19 Maine 2.1 83 51 7.8
20 Maryland 11.3 300 67 27.8
21 Massachusetts 4.4 149 85 16.3
22 Michigan 12.1 255 74 35.1
23 Minnesota 2.7 72 66 14.9
24 Mississippi 16.1 259 44 17.1
25 Missouri 9.0 178 70 28.2
26 Montana 6.0 109 53 16.4
27 Nebraska 4.3 102 62 16.5
28 Nevada 12.2 252 81 46.0
29 New Hampshire 2.1 57 56 9.5
30 New Jersey 7.4 159 89 18.8
&vellip &vellip &vellip &vellip &vellip &vellip



In [25]:

    
# labels
plot(Data2, x="UrbanPop", y="Murder", label="State" , Geom.label, Geom.point)









    Out[25]:



In [26]:

    
Data3 = dataset("datasets","chickwts")
# An experiment was conducted to measure and compare the effectiveness of various feed supplements 
# on the growth rate of chickens.









    Out[26]:




Weight Feed
1 179 horsebean
2 160 horsebean
3 136 horsebean
4 227 horsebean
5 217 horsebean
6 168 horsebean
7 108 horsebean
8 124 horsebean
9 143 horsebean
10 140 horsebean
11 309 linseed
12 229 linseed
13 181 linseed
14 141 linseed
15 260 linseed
16 203 linseed
17 148 linseed
18 169 linseed
19 213 linseed
20 257 linseed
21 244 linseed
22 271 linseed
23 243 soybean
24 230 soybean
25 248 soybean
26 327 soybean
27 329 soybean
28 250 soybean
29 193 soybean
30 271 soybean
&vellip &vellip &vellip



In [27]:

    
# boxplot
plot(Data3, x="Feed", y="Weight", Geom.boxplot)









    Out[27]:



In [28]:

    
# Data by categories.
Data4 = dataset("Ecdat","Wages1")
# a panel of 595 observations from 1976 to 1982









    Out[28]:




Exper Sex School Wage
1 9 female 13 6.3152956461
2 12 female 12 5.4797699786
3 11 female 11 3.6421699174
4 9 female 14 4.5933365997
5 8 female 14 2.4181574607
6 9 female 14 2.0940581101
7 8 female 12 5.5120039196
8 10 female 12 3.5484271597
9 12 female 10 5.8182263596
10 7 female 12 3.8277804998
11 10 female 14 6.7368936796
12 10 female 13 12.861342479
13 10 female 13 7.6555609995
14 9 female 10 2.4497795198
15 10 female 13 6.1056621468
16 3 female 15 8.2680058795
17 9 female 14 1.8624054829
18 13 female 11 3.9808917197
19 10 female 12 4.7634601775
20 11 female 12 6.3805955861
21 8 female 14 5.6311238472
22 7 female 11 2.4880573248
23 7 female 14 1.2237382772
24 5 female 12 1.3456535391
25 5 female 14 0.7511116452
26 10 female 14 11.330230279
27 11 female 12 3.8947576562
28 6 female 13 2.9985152513
29 11 female 12 3.2678961879
30 9 female 12 1.3551971812
&vellip &vellip &vellip &vellip &vellip



In [29]:

    
plot(Data4, x="Exper", y="Wage", color="Sex")









    Out[29]:



In [30]:

    
Data5 = dataset("Zelig", "approval")
# The (approximately) quarterly approval rating for the President of the United States from the first month of 2001
# to the last month of 2005.









    Out[30]:




Month Year Approve Disapprove Unsure SeptOct2001 IraqWar AvgPrice
1 2 2001 58.67 23.67 17.67 0 0 144.975
2 3 2001 58.0 26.67 15.33 0 0 140.925
3 4 2001 60.5 29.5 10.0 0 0 155.16
4 5 2001 55.0 33.33 11.67 0 0 170.175
5 6 2001 54.0 34.0 12.0 0 0 161.625
6 7 2001 56.5 34.0 9.5 0 0 142.06
7 8 2001 56.0 35.0 9.0 0 0 142.075
8 9 2001 75.67 18.33 6.0 1 0 152.15
9 10 2001 88.0 9.0 3.0 1 0 131.54
10 11 2001 87.0 8.67 4.33 0 0 117.05
11 12 2001 86.0 10.5 3.5 0 0 108.6
12 1 2002 83.67 12.67 3.67 0 0 110.725
13 2 2002 82.0 14.0 4.0 0 0 111.375
14 3 2002 79.25 15.75 5.0 0 0 124.925
15 4 2002 76.25 19.0 4.75 0 0 139.7
16 5 2002 76.33 17.67 6.0 0 0 139.175
17 6 2002 73.4 20.2 6.4 0 0 138.225
18 7 2002 70.5 23.5 6.0 0 0 139.7
19 8 2002 66.5 27.0 6.5 0 0 139.575
20 9 2002 67.2 28.2 4.6 0 0 139.96
21 10 2002 64.75 29.0 6.25 0 0 144.525
22 11 2002 66.33 27.0 6.67 0 0 141.9
23 12 2002 62.75 31.5 5.75 0 0 138.58
24 1 2003 60.17 35.0 4.83 0 0 145.75
25 2 2003 58.75 35.75 5.5 0 0 161.3
26 3 2003 65.2 30.6 4.2 0 1 169.3
27 4 2003 70.0 25.75 4.25 0 1 158.9
28 5 2003 66.33 30.0 3.67 0 1 149.725
29 6 2003 62.0 34.33 3.67 0 0 149.28
30 7 2003 59.67 36.67 3.67 0 0 151.25
&vellip &vellip &vellip &vellip &vellip &vellip &vellip &vellip &vellip



In [31]:

    
plot(Data5, x="Month",  y="Approve", color="Year", Geom.line)









    Out[31]:



In [32]:

    
Data6 = dataset("Zelig","macro")
# Selected macroeconomic indicators for many countries.









    Out[32]:




Country Year GDP Unem CapMob Trade
1 United States 1966 5.1111407 3.8 0 9.622906
2 United States 1967 2.2772829 3.8 0 9.983546
3 United States 1968 4.7 3.6 0 10.08912
4 United States 1969 2.8 3.5 0 10.43593
5 United States 1970 -0.2 4.9 0 10.49535
6 United States 1971 3.1 5.9 0 11.27827
7 United States 1972 5.4 5.6 0 11.21771
8 United States 1973 5.7 4.9 0 11.76705
9 United States 1974 -0.9 5.6 0 13.77255
10 United States 1975 -0.8 8.5 0 17.42326
11 United States 1976 4.7 7.7 0 16.52211
12 United States 1977 5.5 7.1 0 17.23492
13 United States 1978 4.7 6.1 0 17.54099
14 United States 1979 2.6 5.8 0 18.17591
15 United States 1980 -0.4 7.1 0 19.73285
16 United States 1981 3.4 7.5 0 21.51057
17 United States 1982 -3.0 9.5 0 20.53895
18 United States 1983 2.9 9.5 0 18.56972
19 United States 1984 7.2 7.5 0 17.81588
20 United States 1985 3.8 7.1 0 18.02899
21 United States 1986 2.8 7.0 0 17.20371
22 United States 1987 3.7 6.2 0 17.23095
23 United States 1988 4.6 5.5 0 18.29418
24 United States 1989 2.8 5.2700837 0 19.413526
25 United States 1990 0.9 5.4145596 0 20.638364
26 Canada 1966 6.8021676 3.6 0 38.45467
27 Canada 1967 2.9236458 4.1 0 40.16167
28 Canada 1968 5.6 4.8 0 41.06574
29 Canada 1969 5.2 4.7 0 42.76849
30 Canada 1970 2.6 5.9 0 44.16533
&vellip &vellip &vellip &vellip &vellip &vellip &vellip



In [33]:

    
plot(Data6, x = "Year", y="GDP", color="Country", Geom.line)









    Out[33]:



In [34]:

    
Data7 = dataset("vcd","Suicide")
# Data from Heuer (1979) on suicide rates in West Germany classified by age, sex, and method of suicide.









    Out[34]:




Freq Sex Method Age AgeGroup Method2
1 4 male poison 10 10-20 poison
2 0 male cookgas 10 10-20 gas
3 0 male toxicgas 10 10-20 gas
4 247 male hang 10 10-20 hang
5 1 male drown 10 10-20 drown
6 17 male gun 10 10-20 gun
7 1 male knife 10 10-20 knife
8 6 male jump 10 10-20 jump
9 0 male other 10 10-20 other
10 348 male poison 15 10-20 poison
11 7 male cookgas 15 10-20 gas
12 67 male toxicgas 15 10-20 gas
13 578 male hang 15 10-20 hang
14 22 male drown 15 10-20 drown
15 179 male gun 15 10-20 gun
16 11 male knife 15 10-20 knife
17 74 male jump 15 10-20 jump
18 175 male other 15 10-20 other
19 808 male poison 20 10-20 poison
20 32 male cookgas 20 10-20 gas
21 229 male toxicgas 20 10-20 gas
22 699 male hang 20 10-20 hang
23 44 male drown 20 10-20 drown
24 316 male gun 20 10-20 gun
25 35 male knife 20 10-20 knife
26 109 male jump 20 10-20 jump
27 289 male other 20 10-20 other
28 789 male poison 25 25-35 poison
29 26 male cookgas 25 25-35 gas
30 243 male toxicgas 25 25-35 gas
&vellip &vellip &vellip &vellip &vellip &vellip &vellip



In [35]:

    
# grouped data.
p = plot(Data7, xgroup="Sex", ygroup="Method", x="Age", y="Freq", Geom.subplot_grid(Geom.bar))









    Out[35]:



In [36]:

    
draw(SVG("myplot.svg", 14cm, 25cm), p)  # to save in other formats use pkg Cairo and Fontconfig.



In [37]:

    
# contour
volcano = convert(Array,(dataset("datasets","volcano")))









    Out[37]:





87x61 Array{Int64,2}:
 100  100  101  101  101  101  101  100  …  106  106  105  105  104  104  103
 101  101  102  102  102  102  102  101     107  106  106  105  105  104  104
 102  102  103  103  103  103  103  102     107  107  106  106  105  105  104
 103  103  104  104  104  104  104  103     108  107  107  106  106  105  105
 104  104  105  105  105  105  105  104     108  107  107  107  106  106  105
 105  105  105  106  106  106  106  105  …  108  108  108  107  107  106  106
 105  106  106  107  107  107  107  106     109  109  108  108  107  107  106
 106  107  107  108  108  108  108  107     110  109  109  108  108  107  106
 107  108  108  109  109  109  109  108     110  110  109  109  108  107  107
 108  109  109  110  110  110  110  109     111  110  110  109  108  107  107
 109  110  110  111  111  111  111  110  …  112  111  110  109  108  107  106
 110  110  111  113  112  111  113  112     113  111  110  109  108  107  106
 110  111  113  115  114  113  114  114     114  112  110  109  108  107  105
   ⋮                        ⋮            ⋱         ⋮                        ⋮
 102  103  103  104  104  105  106  106  …   96   96   96   96   96   96   96
 101  102  103  103  104  105  105  106      96   96   96   96   96   96   96
 100  101  102  102  103  103  104  104      96   96   96   96   96   96   96
 100  101  101  102  102  103  103  104      96   96   96   96   96   96   95
  99  100  101  102  102  103  103  103      96   96   96   96   96   95   95
  99  100  100  101  101  102  102  102  …   95   95   95   95   95   95   95
  99  100  100  100  101  101  101  102      95   95   95   95   95   95   94
  99   99   99   99  100  100  101  101      95   94   94   94   94   94   94
  98   99   99   99   99  100  100  101      94   94   94   94   94   94   94
  98   98   98   99   99   99  100  100      94   94   94   94   94   94   94
  97   98   98   98   99   99   99  100  …   94   94   94   94   94   94   94
  97   97   97   98   98   99   99   99      94   94   94   94   94   94   94



In [38]:

    
plot(z=volcano, Geom.contour(levels=[110, 130, 150, 170, 190]))
# arguments(optional): levels: it could be either an array of contour levels, or the number of levels to plot.
# plot(z=volcano, Geom.contour(levels=5))
# plot(z=volcano, Geom.contour(levels=[110, 130, 150, 170, 190]))









    Out[38]:



In [39]:

    
# contour also works for functions!!
plot(z=(x,y) -> x*exp(-(x-round(Int, x))^2-y^2), x=linspace(-8,8,150), y=linspace(-2,2,150), Geom.contour)









    Out[39]:

Advantages and Disadvantages

Advantages

Nice plots.
Great to display data.

Disadvantages

A bit slow.
3D graphs?
ggplot2 is not completely implemented, e.g. there is no polar plots in Gadfly

References

[1] Gadfly Github page and this
[2] https://en.wikibooks.org/wiki/Introducing_Julia/Plotting
[3] The Grammar of Graphics (2005), Leland Wilkinson.
[4] ggplot2: Elegant Graphics for Data Analysis (2009), Hadley Wickham.
[5] ggplo2 Essentials (2015), Donato Teutonico.
[6] R graphics cookbook (2013), Winston Chang



In [ ]:

	Rank	Discipline	YrsSincePhD	YrsService	Sex	Salary
1	Prof	B	19	18	Male	139750
2	Prof	B	20	16	Male	173200
3	AsstProf	B	4	3	Male	79750
4	Prof	B	45	39	Male	115000
5	Prof	B	40	41	Male	141500
6	AssocProf	B	6	6	Male	97000
7	Prof	B	30	23	Male	175000
8	Prof	B	45	45	Male	147765
9	Prof	B	21	20	Male	119250
10	Prof	B	18	18	Female	129000
11	AssocProf	B	12	8	Male	119800
12	AsstProf	B	7	2	Male	79800
13	AsstProf	B	1	1	Male	77700
14	AsstProf	B	2	0	Male	78000
15	Prof	B	20	18	Male	104800
16	Prof	B	12	3	Male	117150
17	Prof	B	19	20	Male	101000
18	Prof	A	38	34	Male	103450
19	Prof	A	37	23	Male	124750
20	Prof	A	39	36	Female	137000
21	Prof	A	31	26	Male	89565
22	Prof	A	36	31	Male	102580
23	Prof	A	34	30	Male	93904
24	Prof	A	24	19	Male	113068
25	AssocProf	A	13	8	Female	74830
26	Prof	A	21	8	Male	106294
27	Prof	A	35	23	Male	134885
28	AsstProf	B	5	3	Male	82379
29	AsstProf	B	11	0	Male	77000
30	Prof	B	12	8	Male	118223
&vellip	&vellip	&vellip	&vellip	&vellip	&vellip	&vellip

	State	Murder	Assault	UrbanPop	Rape
1	Alabama	13.2	236	58	21.2
2	Alaska	10.0	263	48	44.5
3	Arizona	8.1	294	80	31.0
4	Arkansas	8.8	190	50	19.5
5	California	9.0	276	91	40.6
6	Colorado	7.9	204	78	38.7
7	Connecticut	3.3	110	77	11.1
8	Delaware	5.9	238	72	15.8
9	Florida	15.4	335	80	31.9
10	Georgia	17.4	211	60	25.8
11	Hawaii	5.3	46	83	20.2
12	Idaho	2.6	120	54	14.2
13	Illinois	10.4	249	83	24.0
14	Indiana	7.2	113	65	21.0
15	Iowa	2.2	56	57	11.3
16	Kansas	6.0	115	66	18.0
17	Kentucky	9.7	109	52	16.3
18	Louisiana	15.4	249	66	22.2
19	Maine	2.1	83	51	7.8
20	Maryland	11.3	300	67	27.8
21	Massachusetts	4.4	149	85	16.3
22	Michigan	12.1	255	74	35.1
23	Minnesota	2.7	72	66	14.9
24	Mississippi	16.1	259	44	17.1
25	Missouri	9.0	178	70	28.2
26	Montana	6.0	109	53	16.4
27	Nebraska	4.3	102	62	16.5
28	Nevada	12.2	252	81	46.0
29	New Hampshire	2.1	57	56	9.5
30	New Jersey	7.4	159	89	18.8
&vellip	&vellip	&vellip	&vellip	&vellip	&vellip

	Weight	Feed
1	179	horsebean
2	160	horsebean
3	136	horsebean
4	227	horsebean
5	217	horsebean
6	168	horsebean
7	108	horsebean
8	124	horsebean
9	143	horsebean
10	140	horsebean
11	309	linseed
12	229	linseed
13	181	linseed
14	141	linseed
15	260	linseed
16	203	linseed
17	148	linseed
18	169	linseed
19	213	linseed
20	257	linseed
21	244	linseed
22	271	linseed
23	243	soybean
24	230	soybean
25	248	soybean
26	327	soybean
27	329	soybean
28	250	soybean
29	193	soybean
30	271	soybean
&vellip	&vellip	&vellip

	Exper	Sex	School	Wage
1	9	female	13	6.3152956461
2	12	female	12	5.4797699786
3	11	female	11	3.6421699174
4	9	female	14	4.5933365997
5	8	female	14	2.4181574607
6	9	female	14	2.0940581101
7	8	female	12	5.5120039196
8	10	female	12	3.5484271597
9	12	female	10	5.8182263596
10	7	female	12	3.8277804998
11	10	female	14	6.7368936796
12	10	female	13	12.861342479
13	10	female	13	7.6555609995
14	9	female	10	2.4497795198
15	10	female	13	6.1056621468
16	3	female	15	8.2680058795
17	9	female	14	1.8624054829
18	13	female	11	3.9808917197
19	10	female	12	4.7634601775
20	11	female	12	6.3805955861
21	8	female	14	5.6311238472
22	7	female	11	2.4880573248
23	7	female	14	1.2237382772
24	5	female	12	1.3456535391
25	5	female	14	0.7511116452
26	10	female	14	11.330230279
27	11	female	12	3.8947576562
28	6	female	13	2.9985152513
29	11	female	12	3.2678961879
30	9	female	12	1.3551971812
&vellip	&vellip	&vellip	&vellip	&vellip

	Month	Year	Approve	Disapprove	Unsure	SeptOct2001	IraqWar	AvgPrice
1	2	2001	58.67	23.67	17.67	0	0	144.975
2	3	2001	58.0	26.67	15.33	0	0	140.925
3	4	2001	60.5	29.5	10.0	0	0	155.16
4	5	2001	55.0	33.33	11.67	0	0	170.175
5	6	2001	54.0	34.0	12.0	0	0	161.625
6	7	2001	56.5	34.0	9.5	0	0	142.06
7	8	2001	56.0	35.0	9.0	0	0	142.075
8	9	2001	75.67	18.33	6.0	1	0	152.15
9	10	2001	88.0	9.0	3.0	1	0	131.54
10	11	2001	87.0	8.67	4.33	0	0	117.05
11	12	2001	86.0	10.5	3.5	0	0	108.6
12	1	2002	83.67	12.67	3.67	0	0	110.725
13	2	2002	82.0	14.0	4.0	0	0	111.375
14	3	2002	79.25	15.75	5.0	0	0	124.925
15	4	2002	76.25	19.0	4.75	0	0	139.7
16	5	2002	76.33	17.67	6.0	0	0	139.175
17	6	2002	73.4	20.2	6.4	0	0	138.225
18	7	2002	70.5	23.5	6.0	0	0	139.7
19	8	2002	66.5	27.0	6.5	0	0	139.575
20	9	2002	67.2	28.2	4.6	0	0	139.96
21	10	2002	64.75	29.0	6.25	0	0	144.525
22	11	2002	66.33	27.0	6.67	0	0	141.9
23	12	2002	62.75	31.5	5.75	0	0	138.58
24	1	2003	60.17	35.0	4.83	0	0	145.75
25	2	2003	58.75	35.75	5.5	0	0	161.3
26	3	2003	65.2	30.6	4.2	0	1	169.3
27	4	2003	70.0	25.75	4.25	0	1	158.9
28	5	2003	66.33	30.0	3.67	0	1	149.725
29	6	2003	62.0	34.33	3.67	0	0	149.28
30	7	2003	59.67	36.67	3.67	0	0	151.25
&vellip	&vellip	&vellip	&vellip	&vellip	&vellip	&vellip	&vellip	&vellip

	Country	Year	GDP	Unem	CapMob	Trade
1	United States	1966	5.1111407	3.8	0	9.622906
2	United States	1967	2.2772829	3.8	0	9.983546
3	United States	1968	4.7	3.6	0	10.08912
4	United States	1969	2.8	3.5	0	10.43593
5	United States	1970	-0.2	4.9	0	10.49535
6	United States	1971	3.1	5.9	0	11.27827
7	United States	1972	5.4	5.6	0	11.21771
8	United States	1973	5.7	4.9	0	11.76705
9	United States	1974	-0.9	5.6	0	13.77255
10	United States	1975	-0.8	8.5	0	17.42326
11	United States	1976	4.7	7.7	0	16.52211
12	United States	1977	5.5	7.1	0	17.23492
13	United States	1978	4.7	6.1	0	17.54099
14	United States	1979	2.6	5.8	0	18.17591
15	United States	1980	-0.4	7.1	0	19.73285
16	United States	1981	3.4	7.5	0	21.51057
17	United States	1982	-3.0	9.5	0	20.53895
18	United States	1983	2.9	9.5	0	18.56972
19	United States	1984	7.2	7.5	0	17.81588
20	United States	1985	3.8	7.1	0	18.02899
21	United States	1986	2.8	7.0	0	17.20371
22	United States	1987	3.7	6.2	0	17.23095
23	United States	1988	4.6	5.5	0	18.29418
24	United States	1989	2.8	5.2700837	0	19.413526
25	United States	1990	0.9	5.4145596	0	20.638364
26	Canada	1966	6.8021676	3.6	0	38.45467
27	Canada	1967	2.9236458	4.1	0	40.16167
28	Canada	1968	5.6	4.8	0	41.06574
29	Canada	1969	5.2	4.7	0	42.76849
30	Canada	1970	2.6	5.9	0	44.16533
&vellip	&vellip	&vellip	&vellip	&vellip	&vellip	&vellip

	Freq	Sex	Method	Age	AgeGroup	Method2
1	4	male	poison	10	10-20	poison
2	0	male	cookgas	10	10-20	gas
3	0	male	toxicgas	10	10-20	gas
4	247	male	hang	10	10-20	hang
5	1	male	drown	10	10-20	drown
6	17	male	gun	10	10-20	gun
7	1	male	knife	10	10-20	knife
8	6	male	jump	10	10-20	jump
9	0	male	other	10	10-20	other
10	348	male	poison	15	10-20	poison
11	7	male	cookgas	15	10-20	gas
12	67	male	toxicgas	15	10-20	gas
13	578	male	hang	15	10-20	hang
14	22	male	drown	15	10-20	drown
15	179	male	gun	15	10-20	gun
16	11	male	knife	15	10-20	knife
17	74	male	jump	15	10-20	jump
18	175	male	other	15	10-20	other
19	808	male	poison	20	10-20	poison
20	32	male	cookgas	20	10-20	gas
21	229	male	toxicgas	20	10-20	gas
22	699	male	hang	20	10-20	hang
23	44	male	drown	20	10-20	drown
24	316	male	gun	20	10-20	gun
25	35	male	knife	20	10-20	knife
26	109	male	jump	20	10-20	jump
27	289	male	other	20	10-20	other
28	789	male	poison	25	25-35	poison
29	26	male	cookgas	25	25-35	gas
30	243	male	toxicgas	25	25-35	gas
&vellip	&vellip	&vellip	&vellip	&vellip	&vellip	&vellip