In [12]:

    
# helloMachine.r
# MWL, Lecture 1
# Author(s): [Phil Snyder]



In [13]:

    
data(iris) # load data (dataset "iris" comes with your R installation)



In [14]:

    
"
R doesn't support multi-line comments, but we can get away with 
benignly passing it a string instead.
"









    Out[14]:




"
R doesn't support multi-line comments, but we can get away with 
benignly passing it a string instead.
"



In [15]:

    
iris









    Out[15]:





Sepal.Length Sepal.Width Petal.Length Petal.Width Species

	1 5.1 3.5 1.4 0.2 setosa
	2 4.9 3 1.4 0.2 setosa
	3 4.7 3.2 1.3 0.2 setosa
	4 4.6 3.1 1.5 0.2 setosa
	5 5 3.6 1.4 0.2 setosa
	6 5.4 3.9 1.7 0.4 setosa
	7 4.6 3.4 1.4 0.3 setosa
	8 5 3.4 1.5 0.2 setosa
	9 4.4 2.9 1.4 0.2 setosa
	10 4.9 3.1 1.5 0.1 setosa
	11 5.4 3.7 1.5 0.2 setosa
	12 4.8 3.4 1.6 0.2 setosa
	13 4.8 3 1.4 0.1 setosa
	14 4.3 3 1.1 0.1 setosa
	15 5.8 4 1.2 0.2 setosa
	16 5.7 4.4 1.5 0.4 setosa
	17 5.4 3.9 1.3 0.4 setosa
	18 5.1 3.5 1.4 0.3 setosa
	19 5.7 3.8 1.7 0.3 setosa
	20 5.1 3.8 1.5 0.3 setosa
	21 5.4 3.4 1.7 0.2 setosa
	22 5.1 3.7 1.5 0.4 setosa
	23 4.6 3.6 1 0.2 setosa
	24 5.1 3.3 1.7 0.5 setosa
	25 4.8 3.4 1.9 0.2 setosa
	26 5 3 1.6 0.2 setosa
	27 5 3.4 1.6 0.4 setosa
	28 5.2 3.5 1.5 0.2 setosa
	29 5.2 3.4 1.4 0.2 setosa
	30 4.7 3.2 1.6 0.2 setosa
	31 4.8 3.1 1.6 0.2 setosa
	32 5.4 3.4 1.5 0.4 setosa
	33 5.2 4.1 1.5 0.1 setosa
	34 5.5 4.2 1.4 0.2 setosa
	35 4.9 3.1 1.5 0.2 setosa
	36 5 3.2 1.2 0.2 setosa
	37 5.5 3.5 1.3 0.2 setosa
	38 4.9 3.6 1.4 0.1 setosa
	39 4.4 3 1.3 0.2 setosa
	40 5.1 3.4 1.5 0.2 setosa
	41 5 3.5 1.3 0.3 setosa
	42 4.5 2.3 1.3 0.3 setosa
	43 4.4 3.2 1.3 0.2 setosa
	44 5 3.5 1.6 0.6 setosa
	45 5.1 3.8 1.9 0.4 setosa
	46 4.8 3 1.4 0.3 setosa
	47 5.1 3.8 1.6 0.2 setosa
	48 4.6 3.2 1.4 0.2 setosa
	49 5.3 3.7 1.5 0.2 setosa
	50 5 3.3 1.4 0.2 setosa
	51 7 3.2 4.7 1.4 versicolor
	52 6.4 3.2 4.5 1.5 versicolor
	53 6.9 3.1 4.9 1.5 versicolor
	54 5.5 2.3 4 1.3 versicolor
	55 6.5 2.8 4.6 1.5 versicolor
	56 5.7 2.8 4.5 1.3 versicolor
	57 6.3 3.3 4.7 1.6 versicolor
	58 4.9 2.4 3.3 1 versicolor
	59 6.6 2.9 4.6 1.3 versicolor
	60 5.2 2.7 3.9 1.4 versicolor
	61 5 2 3.5 1 versicolor
	62 5.9 3 4.2 1.5 versicolor
	63 6 2.2 4 1 versicolor
	64 6.1 2.9 4.7 1.4 versicolor
	65 5.6 2.9 3.6 1.3 versicolor
	66 6.7 3.1 4.4 1.4 versicolor
	67 5.6 3 4.5 1.5 versicolor
	68 5.8 2.7 4.1 1 versicolor
	69 6.2 2.2 4.5 1.5 versicolor
	70 5.6 2.5 3.9 1.1 versicolor
	71 5.9 3.2 4.8 1.8 versicolor
	72 6.1 2.8 4 1.3 versicolor
	73 6.3 2.5 4.9 1.5 versicolor
	74 6.1 2.8 4.7 1.2 versicolor
	75 6.4 2.9 4.3 1.3 versicolor
	76 6.6 3 4.4 1.4 versicolor
	77 6.8 2.8 4.8 1.4 versicolor
	78 6.7 3 5 1.7 versicolor
	79 6 2.9 4.5 1.5 versicolor
	80 5.7 2.6 3.5 1 versicolor
	81 5.5 2.4 3.8 1.1 versicolor
	82 5.5 2.4 3.7 1 versicolor
	83 5.8 2.7 3.9 1.2 versicolor
	84 6 2.7 5.1 1.6 versicolor
	85 5.4 3 4.5 1.5 versicolor
	86 6 3.4 4.5 1.6 versicolor
	87 6.7 3.1 4.7 1.5 versicolor
	88 6.3 2.3 4.4 1.3 versicolor
	89 5.6 3 4.1 1.3 versicolor
	90 5.5 2.5 4 1.3 versicolor
	91 5.5 2.6 4.4 1.2 versicolor
	92 6.1 3 4.6 1.4 versicolor
	93 5.8 2.6 4 1.2 versicolor
	94 5 2.3 3.3 1 versicolor
	95 5.6 2.7 4.2 1.3 versicolor
	96 5.7 3 4.2 1.2 versicolor
	97 5.7 2.9 4.2 1.3 versicolor
	98 6.2 2.9 4.3 1.3 versicolor
	99 5.1 2.5 3 1.1 versicolor
	100 5.7 2.8 4.1 1.3 versicolor
	101 6.3 3.3 6 2.5 virginica
	102 5.8 2.7 5.1 1.9 virginica
	103 7.1 3 5.9 2.1 virginica
	104 6.3 2.9 5.6 1.8 virginica
	105 6.5 3 5.8 2.2 virginica
	106 7.6 3 6.6 2.1 virginica
	107 4.9 2.5 4.5 1.7 virginica
	108 7.3 2.9 6.3 1.8 virginica
	109 6.7 2.5 5.8 1.8 virginica
	110 7.2 3.6 6.1 2.5 virginica
	111 6.5 3.2 5.1 2 virginica
	112 6.4 2.7 5.3 1.9 virginica
	113 6.8 3 5.5 2.1 virginica
	114 5.7 2.5 5 2 virginica
	115 5.8 2.8 5.1 2.4 virginica
	116 6.4 3.2 5.3 2.3 virginica
	117 6.5 3 5.5 1.8 virginica
	118 7.7 3.8 6.7 2.2 virginica
	119 7.7 2.6 6.9 2.3 virginica
	120 6 2.2 5 1.5 virginica
	121 6.9 3.2 5.7 2.3 virginica
	122 5.6 2.8 4.9 2 virginica
	123 7.7 2.8 6.7 2 virginica
	124 6.3 2.7 4.9 1.8 virginica
	125 6.7 3.3 5.7 2.1 virginica
	126 7.2 3.2 6 1.8 virginica
	127 6.2 2.8 4.8 1.8 virginica
	128 6.1 3 4.9 1.8 virginica
	129 6.4 2.8 5.6 2.1 virginica
	130 7.2 3 5.8 1.6 virginica
	131 7.4 2.8 6.1 1.9 virginica
	132 7.9 3.8 6.4 2 virginica
	133 6.4 2.8 5.6 2.2 virginica
	134 6.3 2.8 5.1 1.5 virginica
	135 6.1 2.6 5.6 1.4 virginica
	136 7.7 3 6.1 2.3 virginica
	137 6.3 3.4 5.6 2.4 virginica
	138 6.4 3.1 5.5 1.8 virginica
	139 6 3 4.8 1.8 virginica
	140 6.9 3.1 5.4 2.1 virginica
	141 6.7 3.1 5.6 2.4 virginica
	142 6.9 3.1 5.1 2.3 virginica
	143 5.8 2.7 5.1 1.9 virginica
	144 6.8 3.2 5.9 2.3 virginica
	145 6.7 3.3 5.7 2.5 virginica
	146 6.7 3 5.2 2.3 virginica
	147 6.3 2.5 5 1.9 virginica
	148 6.5 3 5.2 2 virginica
	149 6.2 3.4 5.4 2.3 virginica
	150 5.9 3 5.1 1.8 virginica



In [16]:

    
help(topic='iris', package='datasets')









    Out[16]:





iris {datasets} R Documentation

Edgar Anderson's Iris Data

Description

This famous (Fisher's or Anderson's) iris data set gives the
measurements in centimeters of the variables sepal length and width
and petal length and width, respectively, for 50 flowers from each
of 3 species of iris.  The species are Iris setosa,
versicolor, and virginica.



Usage

iris
iris3



Format

iris is a data frame with 150 cases (rows) and 5 variables
(columns) named Sepal.Length, Sepal.Width,
Petal.Length, Petal.Width, and Species.

iris3 gives the same data arranged as a 3-dimensional array
of size 50 by 4 by 3, as represented by S-PLUS.  The first dimension
gives the case number within the species subsample, the second the
measurements with names Sepal L., Sepal W.,
Petal L., and Petal W., and the third the species.



Source

Fisher, R. A. (1936)
The use of multiple measurements in taxonomic problems.
Annals of Eugenics,
7, Part II, 179–188.

The data were collected by
Anderson, Edgar (1935).
The irises of the Gaspe Peninsula,
Bulletin of the American Iris Society,
59, 2–5.



References

Becker, R. A., Chambers, J. M. and Wilks, A. R. (1988)
The New S Language.
Wadsworth & Brooks/Cole. (has iris3 as iris.)



See Also

matplot some examples of which use
iris.



Examples

dni3 <- dimnames(iris3)
ii <- data.frame(matrix(aperm(iris3, c(1,3,2)), ncol = 4,
                        dimnames = list(NULL, sub(" L.",".Length",
                                        sub(" W.",".Width", dni3[[2]])))),
    Species = gl(3, 50, labels = sub("S", "s", sub("V", "v", dni3[[3]]))))
all.equal(ii, iris) # TRUE


[Package datasets version 3.2.2 ]



In [17]:

    
names(iris)









    Out[17]:





	'Sepal.Length'
	'Sepal.Width'
	'Petal.Length'
	'Petal.Width'
	'Species'



In [18]:

    
class(iris)









    Out[18]:




'data.frame'



In [19]:

    
str(iris) # str := structure









    



'data.frame':	150 obs. of  5 variables:
 $ Sepal.Length: num  5.1 4.9 4.7 4.6 5 5.4 4.6 5 4.4 4.9 ...
 $ Sepal.Width : num  3.5 3 3.2 3.1 3.6 3.9 3.4 3.4 2.9 3.1 ...
 $ Petal.Length: num  1.4 1.4 1.3 1.5 1.4 1.7 1.4 1.5 1.4 1.5 ...
 $ Petal.Width : num  0.2 0.2 0.2 0.2 0.2 0.4 0.3 0.2 0.2 0.1 ...
 $ Species     : Factor w/ 3 levels "setosa","versicolor",..: 1 1 1 1 1 1 1 1 1 1 ...



In [20]:

    
levels(iris$Species)









    Out[20]:





	'setosa'
	'versicolor'
	'virginica'



In [21]:

    
?levels









    Out[21]:





levels {base} R Documentation

Levels Attributes

Description

levels provides access to the levels attribute of a variable.
The first form returns the value of the levels of its argument
and the second sets the attribute.



Usage

levels(x)
levels(x) <- value



Arguments


x

an object, for example a factor.

value

A valid value for levels(x).
For the default method, NULL or a character vector.  For the
factor method, a vector of character strings with length at
least the number of levels of x, or a named list specifying how to
rename the levels.




Details

Both the extractor and replacement forms are generic and new methods
can be written for them.  The most important method for the replacement
function is that for factors.

For the factor replacement method, a NA in value
causes that level to be removed from the levels and the elements
formerly with that level to be replaced by NA.

Note that for a factor, replacing the levels via
levels(x) <- value is not the same as (and is preferred to)
attr(x, "levels") <- value.

The replacement function is primitive.



References

Becker, R. A., Chambers, J. M. and Wilks, A. R. (1988)
The New S Language.
Wadsworth & Brooks/Cole.



See Also

nlevels, relevel, reorder.



Examples

## assign individual levels
x <- gl(2, 4, 8)
levels(x)[1] <- "low"
levels(x)[2] <- "high"
x

## or as a group
y <- gl(2, 4, 8)
levels(y) <- c("low", "high")
y

## combine some levels
z <- gl(3, 2, 12)
levels(z) <- c("A", "B", "A")
z

## same, using a named list
z <- gl(3, 2, 12)
levels(z) <- list(A = c(1,3), B = 2)
z

## we can add levels this way:
f <- factor(c("a","b"))
levels(f) <- c("c", "a", "b")
f

f <- factor(c("a","b"))
levels(f) <- list(C = "C", A = "a", B = "b")
f


[Package base version 3.2.2 ]



In [22]:

    
nrow(iris)









    Out[22]:




150



In [23]:

    
ncol(iris)









    Out[23]:




5



In [24]:

    
iris[1:3,] # pay attention to where the comma is!









    Out[24]:





Sepal.Length Sepal.Width Petal.Length Petal.Width Species

	1 5.1 3.5 1.4 0.2 setosa
	2 4.9 3 1.4 0.2 setosa
	3 4.7 3.2 1.3 0.2 setosa



In [25]:

    
iris[,1:3]









    Out[25]:





Sepal.Length Sepal.Width Petal.Length

	1 5.1 3.5 1.4
	2 4.9 3 1.4
	3 4.7 3.2 1.3
	4 4.6 3.1 1.5
	5 5 3.6 1.4
	6 5.4 3.9 1.7
	7 4.6 3.4 1.4
	8 5 3.4 1.5
	9 4.4 2.9 1.4
	10 4.9 3.1 1.5
	11 5.4 3.7 1.5
	12 4.8 3.4 1.6
	13 4.8 3 1.4
	14 4.3 3 1.1
	15 5.8 4 1.2
	16 5.7 4.4 1.5
	17 5.4 3.9 1.3
	18 5.1 3.5 1.4
	19 5.7 3.8 1.7
	20 5.1 3.8 1.5
	21 5.4 3.4 1.7
	22 5.1 3.7 1.5
	23 4.6 3.6 1
	24 5.1 3.3 1.7
	25 4.8 3.4 1.9
	26 5 3 1.6
	27 5 3.4 1.6
	28 5.2 3.5 1.5
	29 5.2 3.4 1.4
	30 4.7 3.2 1.6
	31 4.8 3.1 1.6
	32 5.4 3.4 1.5
	33 5.2 4.1 1.5
	34 5.5 4.2 1.4
	35 4.9 3.1 1.5
	36 5 3.2 1.2
	37 5.5 3.5 1.3
	38 4.9 3.6 1.4
	39 4.4 3 1.3
	40 5.1 3.4 1.5
	41 5 3.5 1.3
	42 4.5 2.3 1.3
	43 4.4 3.2 1.3
	44 5 3.5 1.6
	45 5.1 3.8 1.9
	46 4.8 3 1.4
	47 5.1 3.8 1.6
	48 4.6 3.2 1.4
	49 5.3 3.7 1.5
	50 5 3.3 1.4
	51 7 3.2 4.7
	52 6.4 3.2 4.5
	53 6.9 3.1 4.9
	54 5.5 2.3 4
	55 6.5 2.8 4.6
	56 5.7 2.8 4.5
	57 6.3 3.3 4.7
	58 4.9 2.4 3.3
	59 6.6 2.9 4.6
	60 5.2 2.7 3.9
	61 5 2 3.5
	62 5.9 3 4.2
	63 6 2.2 4
	64 6.1 2.9 4.7
	65 5.6 2.9 3.6
	66 6.7 3.1 4.4
	67 5.6 3 4.5
	68 5.8 2.7 4.1
	69 6.2 2.2 4.5
	70 5.6 2.5 3.9
	71 5.9 3.2 4.8
	72 6.1 2.8 4
	73 6.3 2.5 4.9
	74 6.1 2.8 4.7
	75 6.4 2.9 4.3
	76 6.6 3 4.4
	77 6.8 2.8 4.8
	78 6.7 3 5
	79 6 2.9 4.5
	80 5.7 2.6 3.5
	81 5.5 2.4 3.8
	82 5.5 2.4 3.7
	83 5.8 2.7 3.9
	84 6 2.7 5.1
	85 5.4 3 4.5
	86 6 3.4 4.5
	87 6.7 3.1 4.7
	88 6.3 2.3 4.4
	89 5.6 3 4.1
	90 5.5 2.5 4
	91 5.5 2.6 4.4
	92 6.1 3 4.6
	93 5.8 2.6 4
	94 5 2.3 3.3
	95 5.6 2.7 4.2
	96 5.7 3 4.2
	97 5.7 2.9 4.2
	98 6.2 2.9 4.3
	99 5.1 2.5 3
	100 5.7 2.8 4.1
	101 6.3 3.3 6
	102 5.8 2.7 5.1
	103 7.1 3 5.9
	104 6.3 2.9 5.6
	105 6.5 3 5.8
	106 7.6 3 6.6
	107 4.9 2.5 4.5
	108 7.3 2.9 6.3
	109 6.7 2.5 5.8
	110 7.2 3.6 6.1
	111 6.5 3.2 5.1
	112 6.4 2.7 5.3
	113 6.8 3 5.5
	114 5.7 2.5 5
	115 5.8 2.8 5.1
	116 6.4 3.2 5.3
	117 6.5 3 5.5
	118 7.7 3.8 6.7
	119 7.7 2.6 6.9
	120 6 2.2 5
	121 6.9 3.2 5.7
	122 5.6 2.8 4.9
	123 7.7 2.8 6.7
	124 6.3 2.7 4.9
	125 6.7 3.3 5.7
	126 7.2 3.2 6
	127 6.2 2.8 4.8
	128 6.1 3 4.9
	129 6.4 2.8 5.6
	130 7.2 3 5.8
	131 7.4 2.8 6.1
	132 7.9 3.8 6.4
	133 6.4 2.8 5.6
	134 6.3 2.8 5.1
	135 6.1 2.6 5.6
	136 7.7 3 6.1
	137 6.3 3.4 5.6
	138 6.4 3.1 5.5
	139 6 3 4.8
	140 6.9 3.1 5.4
	141 6.7 3.1 5.6
	142 6.9 3.1 5.1
	143 5.8 2.7 5.1
	144 6.8 3.2 5.9
	145 6.7 3.3 5.7
	146 6.7 3 5.2
	147 6.3 2.5 5
	148 6.5 3 5.2
	149 6.2 3.4 5.4
	150 5.9 3 5.1



In [26]:

    
iris$Sepal.Length



In [27]:

    
iris[,'Sepal.Length']



In [60]:

    
setosa <- iris[iris$Species == "setosa",] # get rows from iris where Species == "setosa"
linearModel <- lm(formula = Sepal.Length ~ Sepal.Width, data = setosa) # ~ := "as explained by"



In [61]:

    
linearModel









    Out[61]:





Call:
lm(formula = Sepal.Length ~ Sepal.Width, data = setosa)

Coefficients:
(Intercept)  Sepal.Width  
     2.6390       0.6905



In [62]:

    
names(linearModel)









    Out[62]:





	'coefficients'
	'residuals'
	'effects'
	'rank'
	'fitted.values'
	'assign'
	'qr'
	'df.residual'
	'xlevels'
	'call'
	'terms'
	'model'



In [63]:

    
linearModel$call









    Out[63]:





lm(formula = Sepal.Length ~ Sepal.Width, data = setosa)



In [64]:

    
plot(formula = Sepal.Length ~ Sepal.Width, data = setosa)
abline(linearModel)



In [65]:

    
# sapply is like a map function
squaredResiduals <- sapply(linearModel$residuals, function(x) return(x^2)) 
squaredError <- sum(squaredResiduals) / length(squaredResiduals)
squaredError









    Out[65]:




0.0546263038291104



In [66]:

    
# What if we remove the potential outlier at (2.3, 4.5)?
setosa[42,]









    Out[66]:





Sepal.Length Sepal.Width Petal.Length Petal.Width Species

	42 4.5 2.3 1.3 0.3 setosa



In [81]:

    
setosaMinusOutlier <- setosa[-42,]
fixedLinearModel <- lm(Sepal.Length ~ Sepal.Width, setosaMinusOutlier)
plot(Sepal.Length ~ Sepal.Width, setosaMinusOutlier)
abline(fixedLinearModel)



In [82]:

    
fixedSquaredResiduals <- sapply(fixedLinearModel$residuals, function(x) return(x^2)) 
fixedSquaredError <- sum(fixedSquaredResiduals) / length(fixedSquaredResiduals)
fixedSquaredError









    Out[82]:




0.0538399520729466

Not much improvement in this case.

When is removing outliers justified? What if we had seen a significant decrease in the squared error? Is Sepal.Length ~ Sepal.Width a smart thing to model, or are there hidden variables? (i.e., mean amount of daily sunshine, precipitation levels, exposure to wind, ...).



In [83]:

    
# now for some classification
flowers <- subset(iris, Species == "setosa" | Species == "virginica")
# equivalently flowers <- iris[iris$Species == "setosa" | iris$Species == "virginica",]
# R has different "OR" operators depending on whether we are doing operations 
# on matrix like objects or atomic objects. Be sure to use a single bar | here.



In [84]:

    
# Here our data has two classes, hence we are performing "binary classification".
plot(flowers[,1:4], pch=sapply(flowers$Species, substr, 1, 1))



In [85]:

    
flowers$Species <- as.factor(flowers$Species)

In this case, this is unnecessary because class(flowers$Species) == factor already, but in general we need to make our categorical response variables factors for classification.



In [86]:

    
flowers <- droplevels(flowers) # need to drop "versicolor" from Species levels since it's empty
dotchart(flowers$Petal.Length, pch=sapply(flowers$Species, substr, 1, 1))

Notice how the data is linearly separable. (In dot charts the y-axis is meaningless).



In [87]:

    
# 1-d case
library(MASS)
oneLinearClass <- lda(Species ~ Petal.Length, flowers)

lda stands for Linear Discriminant Analysis. How it works is not important for now. Just know that it attempts to draw a dot/line/plane/hyperplane separating our two classes.



In [88]:

    
onePredictions <- predict(oneLinearClass, flowers)
onePredictionResults <- table(onePredictions$class == flowers$Species) / length(flowers$Species)
onePredictionResults









    Out[88]:





TRUE 
   1

Wow! 0 error! Who would've thunk.



In [89]:

    
# 2-d case
plot(Sepal.Length ~ Sepal.Width, flowers, pch=sapply(flowers$Species, substr, 1, 1))



In [90]:

    
twoLinearClass <- lda(Species ~ Sepal.Length + Sepal.Width, flowers)
twoPredictions <- predict(twoLinearClass, flowers)
twoPredictionResults <- table(twoPredictions$class == flowers$Species) / length(flowers$Species)
twoPredictionResults









    Out[90]:





FALSE  TRUE 
 0.01  0.99

Here LDA is not perfect, although our data is (barely) linearly separable. This is because we abused the LDA assumption that the two classes have the same covariance matrix. (More on this in later lectures).

Now let's try the entire iris dataset, including Species == "versicolor" in n-dimensions. n = 4 here, since we have 4 features (we don't include our "Species" label).



In [91]:

    
labelMapper <- function(s) { # this is just to help us plot the data in the next line
    if (s == "setosa") return(1)
    if (s == "versicolor") return(2)
    if (s == "virginica") return(3)
}
plot(iris[,1:4], pch=sapply(iris$Species, labelMapper))



In [92]:

    
nLinearClass <- lda(Species ~ ., iris) # Species ~ . := Species as explained by "everything"
nPredictions <- predict(nLinearClass, iris)
nPredictionResults <- table(nPredictions$class == iris$Species) / length(iris$Species)
nPredictionResults









    Out[92]:





FALSE  TRUE 
 0.02  0.98

This is actually pretty good, especially since all we're doing is drawing a hyperplane through our 4 dimensional data!

BUT, as we'll see next lecture, we have done something egregious that has given us a false sense of confidence about the accuracy of our predictor...

`x`	an object, for example a factor.
`value`	A valid value for `levels(x)`. For the default method, `NULL` or a character vector. For the `factor` method, a vector of character strings with length at least the number of levels of `x`, or a named list specifying how to rename the levels.

	Sepal.Length	Sepal.Width	Petal.Length	Petal.Width	Species
1	5.1	3.5	1.4	0.2	setosa
2	4.9	3	1.4	0.2	setosa
3	4.7	3.2	1.3	0.2	setosa
4	4.6	3.1	1.5	0.2	setosa
5	5	3.6	1.4	0.2	setosa
6	5.4	3.9	1.7	0.4	setosa
7	4.6	3.4	1.4	0.3	setosa
8	5	3.4	1.5	0.2	setosa
9	4.4	2.9	1.4	0.2	setosa
10	4.9	3.1	1.5	0.1	setosa
11	5.4	3.7	1.5	0.2	setosa
12	4.8	3.4	1.6	0.2	setosa
13	4.8	3	1.4	0.1	setosa
14	4.3	3	1.1	0.1	setosa
15	5.8	4	1.2	0.2	setosa
16	5.7	4.4	1.5	0.4	setosa
17	5.4	3.9	1.3	0.4	setosa
18	5.1	3.5	1.4	0.3	setosa
19	5.7	3.8	1.7	0.3	setosa
20	5.1	3.8	1.5	0.3	setosa
21	5.4	3.4	1.7	0.2	setosa
22	5.1	3.7	1.5	0.4	setosa
23	4.6	3.6	1	0.2	setosa
24	5.1	3.3	1.7	0.5	setosa
25	4.8	3.4	1.9	0.2	setosa
26	5	3	1.6	0.2	setosa
27	5	3.4	1.6	0.4	setosa
28	5.2	3.5	1.5	0.2	setosa
29	5.2	3.4	1.4	0.2	setosa
30	4.7	3.2	1.6	0.2	setosa
31	4.8	3.1	1.6	0.2	setosa
32	5.4	3.4	1.5	0.4	setosa
33	5.2	4.1	1.5	0.1	setosa
34	5.5	4.2	1.4	0.2	setosa
35	4.9	3.1	1.5	0.2	setosa
36	5	3.2	1.2	0.2	setosa
37	5.5	3.5	1.3	0.2	setosa
38	4.9	3.6	1.4	0.1	setosa
39	4.4	3	1.3	0.2	setosa
40	5.1	3.4	1.5	0.2	setosa
41	5	3.5	1.3	0.3	setosa
42	4.5	2.3	1.3	0.3	setosa
43	4.4	3.2	1.3	0.2	setosa
44	5	3.5	1.6	0.6	setosa
45	5.1	3.8	1.9	0.4	setosa
46	4.8	3	1.4	0.3	setosa
47	5.1	3.8	1.6	0.2	setosa
48	4.6	3.2	1.4	0.2	setosa
49	5.3	3.7	1.5	0.2	setosa
50	5	3.3	1.4	0.2	setosa
51	7	3.2	4.7	1.4	versicolor
52	6.4	3.2	4.5	1.5	versicolor
53	6.9	3.1	4.9	1.5	versicolor
54	5.5	2.3	4	1.3	versicolor
55	6.5	2.8	4.6	1.5	versicolor
56	5.7	2.8	4.5	1.3	versicolor
57	6.3	3.3	4.7	1.6	versicolor
58	4.9	2.4	3.3	1	versicolor
59	6.6	2.9	4.6	1.3	versicolor
60	5.2	2.7	3.9	1.4	versicolor
61	5	2	3.5	1	versicolor
62	5.9	3	4.2	1.5	versicolor
63	6	2.2	4	1	versicolor
64	6.1	2.9	4.7	1.4	versicolor
65	5.6	2.9	3.6	1.3	versicolor
66	6.7	3.1	4.4	1.4	versicolor
67	5.6	3	4.5	1.5	versicolor
68	5.8	2.7	4.1	1	versicolor
69	6.2	2.2	4.5	1.5	versicolor
70	5.6	2.5	3.9	1.1	versicolor
71	5.9	3.2	4.8	1.8	versicolor
72	6.1	2.8	4	1.3	versicolor
73	6.3	2.5	4.9	1.5	versicolor
74	6.1	2.8	4.7	1.2	versicolor
75	6.4	2.9	4.3	1.3	versicolor
76	6.6	3	4.4	1.4	versicolor
77	6.8	2.8	4.8	1.4	versicolor
78	6.7	3	5	1.7	versicolor
79	6	2.9	4.5	1.5	versicolor
80	5.7	2.6	3.5	1	versicolor
81	5.5	2.4	3.8	1.1	versicolor
82	5.5	2.4	3.7	1	versicolor
83	5.8	2.7	3.9	1.2	versicolor
84	6	2.7	5.1	1.6	versicolor
85	5.4	3	4.5	1.5	versicolor
86	6	3.4	4.5	1.6	versicolor
87	6.7	3.1	4.7	1.5	versicolor
88	6.3	2.3	4.4	1.3	versicolor
89	5.6	3	4.1	1.3	versicolor
90	5.5	2.5	4	1.3	versicolor
91	5.5	2.6	4.4	1.2	versicolor
92	6.1	3	4.6	1.4	versicolor
93	5.8	2.6	4	1.2	versicolor
94	5	2.3	3.3	1	versicolor
95	5.6	2.7	4.2	1.3	versicolor
96	5.7	3	4.2	1.2	versicolor
97	5.7	2.9	4.2	1.3	versicolor
98	6.2	2.9	4.3	1.3	versicolor
99	5.1	2.5	3	1.1	versicolor
100	5.7	2.8	4.1	1.3	versicolor
101	6.3	3.3	6	2.5	virginica
102	5.8	2.7	5.1	1.9	virginica
103	7.1	3	5.9	2.1	virginica
104	6.3	2.9	5.6	1.8	virginica
105	6.5	3	5.8	2.2	virginica
106	7.6	3	6.6	2.1	virginica
107	4.9	2.5	4.5	1.7	virginica
108	7.3	2.9	6.3	1.8	virginica
109	6.7	2.5	5.8	1.8	virginica
110	7.2	3.6	6.1	2.5	virginica
111	6.5	3.2	5.1	2	virginica
112	6.4	2.7	5.3	1.9	virginica
113	6.8	3	5.5	2.1	virginica
114	5.7	2.5	5	2	virginica
115	5.8	2.8	5.1	2.4	virginica
116	6.4	3.2	5.3	2.3	virginica
117	6.5	3	5.5	1.8	virginica
118	7.7	3.8	6.7	2.2	virginica
119	7.7	2.6	6.9	2.3	virginica
120	6	2.2	5	1.5	virginica
121	6.9	3.2	5.7	2.3	virginica
122	5.6	2.8	4.9	2	virginica
123	7.7	2.8	6.7	2	virginica
124	6.3	2.7	4.9	1.8	virginica
125	6.7	3.3	5.7	2.1	virginica
126	7.2	3.2	6	1.8	virginica
127	6.2	2.8	4.8	1.8	virginica
128	6.1	3	4.9	1.8	virginica
129	6.4	2.8	5.6	2.1	virginica
130	7.2	3	5.8	1.6	virginica
131	7.4	2.8	6.1	1.9	virginica
132	7.9	3.8	6.4	2	virginica
133	6.4	2.8	5.6	2.2	virginica
134	6.3	2.8	5.1	1.5	virginica
135	6.1	2.6	5.6	1.4	virginica
136	7.7	3	6.1	2.3	virginica
137	6.3	3.4	5.6	2.4	virginica
138	6.4	3.1	5.5	1.8	virginica
139	6	3	4.8	1.8	virginica
140	6.9	3.1	5.4	2.1	virginica
141	6.7	3.1	5.6	2.4	virginica
142	6.9	3.1	5.1	2.3	virginica
143	5.8	2.7	5.1	1.9	virginica
144	6.8	3.2	5.9	2.3	virginica
145	6.7	3.3	5.7	2.5	virginica
146	6.7	3	5.2	2.3	virginica
147	6.3	2.5	5	1.9	virginica
148	6.5	3	5.2	2	virginica
149	6.2	3.4	5.4	2.3	virginica
150	5.9	3	5.1	1.8	virginica

	Sepal.Length	Sepal.Width	Petal.Length
1	5.1	3.5	1.4
2	4.9	3	1.4
3	4.7	3.2	1.3
4	4.6	3.1	1.5
5	5	3.6	1.4
6	5.4	3.9	1.7
7	4.6	3.4	1.4
8	5	3.4	1.5
9	4.4	2.9	1.4
10	4.9	3.1	1.5
11	5.4	3.7	1.5
12	4.8	3.4	1.6
13	4.8	3	1.4
14	4.3	3	1.1
15	5.8	4	1.2
16	5.7	4.4	1.5
17	5.4	3.9	1.3
18	5.1	3.5	1.4
19	5.7	3.8	1.7
20	5.1	3.8	1.5
21	5.4	3.4	1.7
22	5.1	3.7	1.5
23	4.6	3.6	1
24	5.1	3.3	1.7
25	4.8	3.4	1.9
26	5	3	1.6
27	5	3.4	1.6
28	5.2	3.5	1.5
29	5.2	3.4	1.4
30	4.7	3.2	1.6
31	4.8	3.1	1.6
32	5.4	3.4	1.5
33	5.2	4.1	1.5
34	5.5	4.2	1.4
35	4.9	3.1	1.5
36	5	3.2	1.2
37	5.5	3.5	1.3
38	4.9	3.6	1.4
39	4.4	3	1.3
40	5.1	3.4	1.5
41	5	3.5	1.3
42	4.5	2.3	1.3
43	4.4	3.2	1.3
44	5	3.5	1.6
45	5.1	3.8	1.9
46	4.8	3	1.4
47	5.1	3.8	1.6
48	4.6	3.2	1.4
49	5.3	3.7	1.5
50	5	3.3	1.4
51	7	3.2	4.7
52	6.4	3.2	4.5
53	6.9	3.1	4.9
54	5.5	2.3	4
55	6.5	2.8	4.6
56	5.7	2.8	4.5
57	6.3	3.3	4.7
58	4.9	2.4	3.3
59	6.6	2.9	4.6
60	5.2	2.7	3.9
61	5	2	3.5
62	5.9	3	4.2
63	6	2.2	4
64	6.1	2.9	4.7
65	5.6	2.9	3.6
66	6.7	3.1	4.4
67	5.6	3	4.5
68	5.8	2.7	4.1
69	6.2	2.2	4.5
70	5.6	2.5	3.9
71	5.9	3.2	4.8
72	6.1	2.8	4
73	6.3	2.5	4.9
74	6.1	2.8	4.7
75	6.4	2.9	4.3
76	6.6	3	4.4
77	6.8	2.8	4.8
78	6.7	3	5
79	6	2.9	4.5
80	5.7	2.6	3.5
81	5.5	2.4	3.8
82	5.5	2.4	3.7
83	5.8	2.7	3.9
84	6	2.7	5.1
85	5.4	3	4.5
86	6	3.4	4.5
87	6.7	3.1	4.7
88	6.3	2.3	4.4
89	5.6	3	4.1
90	5.5	2.5	4
91	5.5	2.6	4.4
92	6.1	3	4.6
93	5.8	2.6	4
94	5	2.3	3.3
95	5.6	2.7	4.2
96	5.7	3	4.2
97	5.7	2.9	4.2
98	6.2	2.9	4.3
99	5.1	2.5	3
100	5.7	2.8	4.1
101	6.3	3.3	6
102	5.8	2.7	5.1
103	7.1	3	5.9
104	6.3	2.9	5.6
105	6.5	3	5.8
106	7.6	3	6.6
107	4.9	2.5	4.5
108	7.3	2.9	6.3
109	6.7	2.5	5.8
110	7.2	3.6	6.1
111	6.5	3.2	5.1
112	6.4	2.7	5.3
113	6.8	3	5.5
114	5.7	2.5	5
115	5.8	2.8	5.1
116	6.4	3.2	5.3
117	6.5	3	5.5
118	7.7	3.8	6.7
119	7.7	2.6	6.9
120	6	2.2	5
121	6.9	3.2	5.7
122	5.6	2.8	4.9
123	7.7	2.8	6.7
124	6.3	2.7	4.9
125	6.7	3.3	5.7
126	7.2	3.2	6
127	6.2	2.8	4.8
128	6.1	3	4.9
129	6.4	2.8	5.6
130	7.2	3	5.8
131	7.4	2.8	6.1
132	7.9	3.8	6.4
133	6.4	2.8	5.6
134	6.3	2.8	5.1
135	6.1	2.6	5.6
136	7.7	3	6.1
137	6.3	3.4	5.6
138	6.4	3.1	5.5
139	6	3	4.8
140	6.9	3.1	5.4
141	6.7	3.1	5.6
142	6.9	3.1	5.1
143	5.8	2.7	5.1
144	6.8	3.2	5.9
145	6.7	3.3	5.7
146	6.7	3	5.2
147	6.3	2.5	5
148	6.5	3	5.2
149	6.2	3.4	5.4
150	5.9	3	5.1

	Sepal.Length	Sepal.Width	Petal.Length
1	5.1	3.5	1.4
2	4.9	3	1.4
3	4.7	3.2	1.3
4	4.6	3.1	1.5
5	5	3.6	1.4
6	5.4	3.9	1.7
7	4.6	3.4	1.4
8	5	3.4	1.5
9	4.4	2.9	1.4
10	4.9	3.1	1.5
11	5.4	3.7	1.5
12	4.8	3.4	1.6
13	4.8	3	1.4
14	4.3	3	1.1
15	5.8	4	1.2
16	5.7	4.4	1.5
17	5.4	3.9	1.3
18	5.1	3.5	1.4
19	5.7	3.8	1.7
20	5.1	3.8	1.5
21	5.4	3.4	1.7
22	5.1	3.7	1.5
23	4.6	3.6	1
24	5.1	3.3	1.7
25	4.8	3.4	1.9
26	5	3	1.6
27	5	3.4	1.6
28	5.2	3.5	1.5
29	5.2	3.4	1.4
30	4.7	3.2	1.6
31	4.8	3.1	1.6
32	5.4	3.4	1.5
33	5.2	4.1	1.5
34	5.5	4.2	1.4
35	4.9	3.1	1.5
36	5	3.2	1.2
37	5.5	3.5	1.3
38	4.9	3.6	1.4
39	4.4	3	1.3
40	5.1	3.4	1.5
41	5	3.5	1.3
42	4.5	2.3	1.3
43	4.4	3.2	1.3
44	5	3.5	1.6
45	5.1	3.8	1.9
46	4.8	3	1.4
47	5.1	3.8	1.6
48	4.6	3.2	1.4
49	5.3	3.7	1.5
50	5	3.3	1.4
51	7	3.2	4.7
52	6.4	3.2	4.5
53	6.9	3.1	4.9
54	5.5	2.3	4
55	6.5	2.8	4.6
56	5.7	2.8	4.5
57	6.3	3.3	4.7
58	4.9	2.4	3.3
59	6.6	2.9	4.6
60	5.2	2.7	3.9
61	5	2	3.5
62	5.9	3	4.2
63	6	2.2	4
64	6.1	2.9	4.7
65	5.6	2.9	3.6
66	6.7	3.1	4.4
67	5.6	3	4.5
68	5.8	2.7	4.1
69	6.2	2.2	4.5
70	5.6	2.5	3.9
71	5.9	3.2	4.8
72	6.1	2.8	4
73	6.3	2.5	4.9
74	6.1	2.8	4.7
75	6.4	2.9	4.3
76	6.6	3	4.4
77	6.8	2.8	4.8
78	6.7	3	5
79	6	2.9	4.5
80	5.7	2.6	3.5
81	5.5	2.4	3.8
82	5.5	2.4	3.7
83	5.8	2.7	3.9
84	6	2.7	5.1
85	5.4	3	4.5
86	6	3.4	4.5
87	6.7	3.1	4.7
88	6.3	2.3	4.4
89	5.6	3	4.1
90	5.5	2.5	4
91	5.5	2.6	4.4
92	6.1	3	4.6
93	5.8	2.6	4
94	5	2.3	3.3
95	5.6	2.7	4.2
96	5.7	3	4.2
97	5.7	2.9	4.2
98	6.2	2.9	4.3
99	5.1	2.5	3
100	5.7	2.8	4.1
101	6.3	3.3	6
102	5.8	2.7	5.1
103	7.1	3	5.9
104	6.3	2.9	5.6
105	6.5	3	5.8
106	7.6	3	6.6
107	4.9	2.5	4.5
108	7.3	2.9	6.3
109	6.7	2.5	5.8
110	7.2	3.6	6.1
111	6.5	3.2	5.1
112	6.4	2.7	5.3
113	6.8	3	5.5
114	5.7	2.5	5
115	5.8	2.8	5.1
116	6.4	3.2	5.3
117	6.5	3	5.5
118	7.7	3.8	6.7
119	7.7	2.6	6.9
120	6	2.2	5
121	6.9	3.2	5.7
122	5.6	2.8	4.9
123	7.7	2.8	6.7
124	6.3	2.7	4.9
125	6.7	3.3	5.7
126	7.2	3.2	6
127	6.2	2.8	4.8
128	6.1	3	4.9
129	6.4	2.8	5.6
130	7.2	3	5.8
131	7.4	2.8	6.1
132	7.9	3.8	6.4
133	6.4	2.8	5.6
134	6.3	2.8	5.1
135	6.1	2.6	5.6
136	7.7	3	6.1
137	6.3	3.4	5.6
138	6.4	3.1	5.5
139	6	3	4.8
140	6.9	3.1	5.4
141	6.7	3.1	5.6
142	6.9	3.1	5.1
143	5.8	2.7	5.1
144	6.8	3.2	5.9
145	6.7	3.3	5.7
146	6.7	3	5.2
147	6.3	2.5	5
148	6.5	3	5.2
149	6.2	3.4	5.4
150	5.9	3	5.1

	Sepal.Length	Sepal.Width	Petal.Length
1	5.1	3.5	1.4
2	4.9	3	1.4
3	4.7	3.2	1.3
4	4.6	3.1	1.5
5	5	3.6	1.4
6	5.4	3.9	1.7
7	4.6	3.4	1.4
8	5	3.4	1.5
9	4.4	2.9	1.4
10	4.9	3.1	1.5
11	5.4	3.7	1.5
12	4.8	3.4	1.6
13	4.8	3	1.4
14	4.3	3	1.1
15	5.8	4	1.2
16	5.7	4.4	1.5
17	5.4	3.9	1.3
18	5.1	3.5	1.4
19	5.7	3.8	1.7
20	5.1	3.8	1.5
21	5.4	3.4	1.7
22	5.1	3.7	1.5
23	4.6	3.6	1
24	5.1	3.3	1.7
25	4.8	3.4	1.9
26	5	3	1.6
27	5	3.4	1.6
28	5.2	3.5	1.5
29	5.2	3.4	1.4
30	4.7	3.2	1.6
31	4.8	3.1	1.6
32	5.4	3.4	1.5
33	5.2	4.1	1.5
34	5.5	4.2	1.4
35	4.9	3.1	1.5
36	5	3.2	1.2
37	5.5	3.5	1.3
38	4.9	3.6	1.4
39	4.4	3	1.3
40	5.1	3.4	1.5
41	5	3.5	1.3
42	4.5	2.3	1.3
43	4.4	3.2	1.3
44	5	3.5	1.6
45	5.1	3.8	1.9
46	4.8	3	1.4
47	5.1	3.8	1.6
48	4.6	3.2	1.4
49	5.3	3.7	1.5
50	5	3.3	1.4
51	7	3.2	4.7
52	6.4	3.2	4.5
53	6.9	3.1	4.9
54	5.5	2.3	4
55	6.5	2.8	4.6
56	5.7	2.8	4.5
57	6.3	3.3	4.7
58	4.9	2.4	3.3
59	6.6	2.9	4.6
60	5.2	2.7	3.9
61	5	2	3.5
62	5.9	3	4.2
63	6	2.2	4
64	6.1	2.9	4.7
65	5.6	2.9	3.6
66	6.7	3.1	4.4
67	5.6	3	4.5
68	5.8	2.7	4.1
69	6.2	2.2	4.5
70	5.6	2.5	3.9
71	5.9	3.2	4.8
72	6.1	2.8	4
73	6.3	2.5	4.9
74	6.1	2.8	4.7
75	6.4	2.9	4.3
76	6.6	3	4.4
77	6.8	2.8	4.8
78	6.7	3	5
79	6	2.9	4.5
80	5.7	2.6	3.5
81	5.5	2.4	3.8
82	5.5	2.4	3.7
83	5.8	2.7	3.9
84	6	2.7	5.1
85	5.4	3	4.5
86	6	3.4	4.5
87	6.7	3.1	4.7
88	6.3	2.3	4.4
89	5.6	3	4.1
90	5.5	2.5	4
91	5.5	2.6	4.4
92	6.1	3	4.6
93	5.8	2.6	4
94	5	2.3	3.3
95	5.6	2.7	4.2
96	5.7	3	4.2
97	5.7	2.9	4.2
98	6.2	2.9	4.3
99	5.1	2.5	3
100	5.7	2.8	4.1
101	6.3	3.3	6
102	5.8	2.7	5.1
103	7.1	3	5.9
104	6.3	2.9	5.6
105	6.5	3	5.8
106	7.6	3	6.6
107	4.9	2.5	4.5
108	7.3	2.9	6.3
109	6.7	2.5	5.8
110	7.2	3.6	6.1
111	6.5	3.2	5.1
112	6.4	2.7	5.3
113	6.8	3	5.5
114	5.7	2.5	5
115	5.8	2.8	5.1
116	6.4	3.2	5.3
117	6.5	3	5.5
118	7.7	3.8	6.7
119	7.7	2.6	6.9
120	6	2.2	5
121	6.9	3.2	5.7
122	5.6	2.8	4.9
123	7.7	2.8	6.7
124	6.3	2.7	4.9
125	6.7	3.3	5.7
126	7.2	3.2	6
127	6.2	2.8	4.8
128	6.1	3	4.9
129	6.4	2.8	5.6
130	7.2	3	5.8
131	7.4	2.8	6.1
132	7.9	3.8	6.4
133	6.4	2.8	5.6
134	6.3	2.8	5.1
135	6.1	2.6	5.6
136	7.7	3	6.1
137	6.3	3.4	5.6
138	6.4	3.1	5.5
139	6	3	4.8
140	6.9	3.1	5.4
141	6.7	3.1	5.6
142	6.9	3.1	5.1
143	5.8	2.7	5.1
144	6.8	3.2	5.9
145	6.7	3.3	5.7
146	6.7	3	5.2
147	6.3	2.5	5
148	6.5	3	5.2
149	6.2	3.4	5.4
150	5.9	3	5.1