Character recognition

This demo extends the kaggle competition. In this demo, we train using images of characters obtained from google view, and also classify test images which are of various sizes. Firstly k-Nearest Neighbor is used to identify the characters. Some of the take aways are how easy it is to build parallelizable and fast systems. We also try out various other methods and show comparisons.

Some key takeaways:

Ease of prototyping deployable models.
Parallelization also very easy to implement.
For loops are so fast in Julia.
How to gain from vectorizations using ArrayFire.

Download the following files from here, and place them in the /data directory :

testResized/
trainResized/
sampleSubmission.csv
trainLabels.csv



In [2]:

    
using Images, Colors, DataFrames, TestImages, Gadfly, ArrayFire;



In [3]:

    
include("$(Pkg.dir())/MLDemos/src/characters/knndemo.jl");



In [4]:

    
# Some configurations
path = "$(Pkg.dir())/MLDemos/";
imageSize = 400;
# true = for loop, false = vectorised
flag = true
# Read the training labels
labelsInfoTrain = readtable("$(path)data/characters/trainLabels.csv");
# Read the test labels
labelsInfoTest = readtable("$(path)data/characters/testLabels.csv");



In [5]:

    
#chars=unique(labelsInfoTrain[:Class])
counts=by(labelsInfoTrain, :Class, nrow);
p1=Gadfly.plot(x = counts[:Class], y=counts[:x1], Guide.xlabel("Characters"), Guide.ylabel("# of samples"), Geom.bar, Guide.title("Distribution of training data"))









    Out[5]:

The training images range 1.Bmp to 6283.Bmp, Lets see how the characters look like:



In [64]:

    
#1<n<6283
n=6250
@show labelsInfoTrain[n,:Class]
showimtrain(n)









    



labelsInfoTrain[n,:Class] = "h"






    Out[64]:

Below is the test dataframe which has all the test images labelled as 'A', the goal is to predict the labels for all the 6220 test images indexed from 6284.Bmp to 12503.Bmp



In [48]:

    
labelsInfoTest









    Out[48]:




ID Class
1 6284 A
2 6285 A
3 6286 A
4 6287 A
5 6288 A
6 6289 A
7 6290 A
8 6291 A
9 6292 A
10 6293 A
11 6294 A
12 6295 A
13 6296 A
14 6297 A
15 6298 A
16 6299 A
17 6300 A
18 6301 A
19 6302 A
20 6303 A
21 6304 A
22 6305 A
23 6306 A
24 6307 A
25 6308 A
26 6309 A
27 6310 A
28 6311 A
29 6312 A
30 6313 A
&vellip &vellip &vellip



In [13]:

    
#read the images in from the training data.
xTrain = read_data_sv("train", labelsInfoTrain, imageSize, "$(path)data/characters");

#read the test images
xTest = read_data_sv("test", labelsInfoTest, imageSize, "$(path)data/characters");



In [15]:

    
#Map the training characters to ASCII values
yTrain = map(x -> x[1], labelsInfoTrain[:Class]);
yTrain = int(yTrain);



In [16]:

    
# Transposing the images, so that the columns represent each character.
xTrain = xTrain'
xTest = xTest'









    Out[16]:





400x6220 Array{Float64,2}:
 0.45098   0.282353  0.113725  0.592157  …  0.329412  0.635294  0.403922
 0.447059  0.286275  0.152941  0.545098     0.341176  0.611765  0.423529
 0.443137  0.309804  0.156863  0.564706     0.345098  0.662745  0.396078
 0.443137  0.301961  0.156863  0.529412     0.345098  0.635294  0.388235
 0.435294  0.309804  0.156863  0.568627     0.341176  0.627451  0.368627
 0.431373  0.341176  0.156863  0.607843  …  0.341176  0.615686  0.368627
 0.458824  0.34902   0.160784  0.568627     0.345098  0.611765  0.411765
 0.462745  0.333333  0.160784  0.576471     0.337255  0.619608  0.501961
 0.447059  0.341176  0.160784  0.537255     0.337255  0.643137  0.690196
 0.462745  0.364706  0.156863  0.560784     0.341176  0.654902  0.819608
 0.458824  0.364706  0.156863  0.580392  …  0.345098  0.627451  0.858824
 0.458824  0.376471  0.156863  0.592157     0.337255  0.65098   0.862745
 0.458824  0.388235  0.156863  0.592157     0.345098  0.631373  0.796078
 ⋮                                       ⋱                              
 0.466667  0.45098   0.160784  0.494118     0.329412  0.643137  0.407843
 0.47451   0.458824  0.156863  0.380392     0.329412  0.643137  0.423529
 0.494118  0.443137  0.156863  0.258824  …  0.32549   0.658824  0.415686
 0.486275  0.443137  0.152941  0.290196     0.317647  0.643137  0.439216
 0.482353  0.443137  0.14902   0.239216     0.317647  0.662745  0.431373
 0.47451   0.447059  0.145098  0.235294     0.32549   0.615686  0.435294
 0.47451   0.447059  0.145098  0.258824     0.321569  0.639216  0.423529
 0.486275  0.439216  0.145098  0.247059  …  0.313725  0.631373  0.411765
 0.490196  0.45098   0.145098  0.388235     0.317647  0.607843  0.392157
 0.486275  0.431373  0.145098  0.556863     0.32549   0.658824  0.505882
 0.482353  0.282353  0.145098  0.588235     0.32549   0.576471  0.611765
 0.470588  0.294118  0.101961  0.592157     0.313725  0.584314  0.392157

Parallelisation :

addprocs() before running the code
@everywhere before each function
@parallel before each for loop



In [30]:

    
#=
addprocs(1)
@everywhere using DataFrames
include("$(Pkg.dir())/MLDemos/src/characters/knndemo.jl")
procs()
=#

Training the model



In [18]:

    
# Assign the labels to the training images and find the ratio
# of correctly classified labels to the total number of labels

k = 3
@time sumValues = @parallel (+) for i in 1:size(xTrain, 2)
 assign_label(xTrain, yTrain, k, i) == yTrain[i, 1]
end
loofCvAccuracy = sumValues / size(xTrain, 2)









    



 36.474307 seconds (112.12 M allocations: 2.126 GB, 0.73% gc time)






    Out[18]:





0.04918032786885246

Predicting for the test images :



In [19]:

    
# Running the kNN on test data set
tic()
k = 3 # The CV accuracy shows this value to be the best.                             
yPredictions = @parallel (vcat) for i in 1:size(xTest, 2)
 nRows = size(xTrain, 1)
 imageI = Array(Float32, nRows)
 for index in 1:nRows
  imageI[index] = xTest[index, i]
 end
 assign_label(xTrain, yTrain, k, imageI)
end
toc()









    



elapsed time: 65.206551157 seconds






    Out[19]:





65.206551157

Assign the predicted values to labelsInfoTest



In [20]:

    
#Convert integer predictions to character                                            
labelsInfoTest[:Class] = map(Char, yPredictions);

Lets actually see how it looks like :

Test images range from 6384.Bmp to 12503.Bmp.



In [68]:

    
m=12003

show(labelsInfoTest[labelsInfoTest[:ID].==m,:Class])
showimtest(m)









    



['W']





    Out[68]:



In [49]:

    
labelsInfoTest









    Out[49]:




ID Class
1 6284 H
2 6285 E
3 6286 R
4 6287 d
5 6288 E
6 6289 C
7 6290 0
8 6291 a
9 6292 8
10 6293 H
11 6294 N
12 6295 R
13 6296 k
14 6297 H
15 6298 r
16 6299 5
17 6300 A
18 6301 R
19 6302 X
20 6303 Y
21 6304 M
22 6305 r
23 6306 A
24 6307 a
25 6308 R
26 6309 S
27 6310 D
28 6311 D
29 6312 n
30 6313 F
&vellip &vellip &vellip

Performance figures :

The julia parallel implementation :

We can see that as we add procs, the perfomance improves until nprocs = 6, this is because the data is small and 6 processors are optimal on the test system.



In [23]:

    
t=[34.749010, 34.001, 23.3853, 16.9243, 15.7945, 15.5830, 17.09088];
N = [1,2,3,4,5,6,10]
Gadfly.plot(x=N, y=t, Geom.point, Geom.line, Guide.xlabel("no. of procs"), Guide.ylabel("t in sec"), Guide.title("Parallel Performance"))









    Out[23]:

Advantages of for loops :

Distance measure :

To compute the distance we have used the euclidean distance, which can be computed either as vectorised operation or through for loops.

Thanks to the typing and clever compiler decisions that happen behind the scenes in Julia, the opposite is often true: for loops can be faster than vectorized operations!



In [24]:

    
# Vectorised euclidean distance
function euclidean_distance_vectorise(a, b)
   return dot(a-b, a-b) 
end









    Out[24]:





euclidean_distance_vectorise (generic function with 1 method)



In [25]:

    
# For loop cosine measure
function euclidean_distance_for(a, b)
 distance = 0.0 
 for index in 1:size(a, 1) 
  distance += (a[index]-b[index]) * (a[index]-b[index])
 end
 return distance
end









    Out[25]:





euclidean_distance_for (generic function with 1 method)

ArrayFire

ArrayFire is a library for GPU and accelerated computing. ArrayFire.jl wraps the ArrayFire library for Julia, and provides a Julian interface.



In [29]:

    
#=
a = rand(100000)
b = rand(100000)
@time euclidean_distance_for(a,b)
@time euclidean_distance_vectorise(a,b)
af = AFArray(a)
bf = AFArray(b)
@time euclidean_distance_vectorise(af,bf)
=#









    



  0.000178 seconds (5 allocations: 176 bytes)
  0.000516 seconds (13 allocations: 1.526 MB)
  0.000295 seconds (27 allocations: 944 bytes)






    Out[29]:





1-element ArrayFire.AFArray{Float64,1}:
 16669.3

Plot showing the time taken to train the model : for loop Vs Vectorised Vs ArrayFire Vectorised



In [37]:

    
df=DataFrame(names = ["For", "Vectorised", "ArrayFire"], t2 = [34,70,20])

p1=Gadfly.plot( x=df[:names], y=df[:t2],  Guide.ylabel("Time in sec"), Geom.bar, Guide.title("For loop Vs Vectorised Vs ArrayFire."))









    Out[37]:

	ID	Class
1	6284	A
2	6285	A
3	6286	A
4	6287	A
5	6288	A
6	6289	A
7	6290	A
8	6291	A
9	6292	A
10	6293	A
11	6294	A
12	6295	A
13	6296	A
14	6297	A
15	6298	A
16	6299	A
17	6300	A
18	6301	A
19	6302	A
20	6303	A
21	6304	A
22	6305	A
23	6306	A
24	6307	A
25	6308	A
26	6309	A
27	6310	A
28	6311	A
29	6312	A
30	6313	A
&vellip	&vellip	&vellip