This ipython notebook will teach you the basics of how multilayer perceptrons work, and show you how to use multilayer perceptrons in pylearn2.
To do this, we will go over several concepts:
Part 1: What pylearn2 is doing for you in this example
Review of softmax regression, and how MLPs are similar
The multilayer perceptron model
Some beneficial properties of MLPs
Some detrimental properties of MLPs
Part 2: How to use pylearn2 to train an MLP
Part 3: A deeper MLP, and pylearn2 polymorphism
Part 4: Regularization, and pylearn2 costs
Note that this won't explain in detail how the individual classes are implemented. The classes follow pretty good naming conventions and have pretty good docstrings, but if you have trouble understanding them, write to me and I might add a part 3 explaining how some of the parts work under the hood.
Please write to pylearn-dev@googlegroups.com if you encounter any problem with this tutorial.
Before running this notebook, you must have installed pylearn2. Follow the download and installation instructions if you have not yet done so.
This tutorial also assumes you already know about softmax regression, and know how to train and evaluate a softmax regression model in pylearn2. If not, work through softmax_regression.ipynb before starting this tutorial.
It's also strongly recommend that you run this notebook with THEANO_FLAGS="device=gpu". This is a processing intensive example and the GPU will make it run a lot faster, if you have one available. Execute the next cell to verify that you are using the GPU.
In [1]:
import theano
print theano.config.device
gpu
Using gpu device 0: GeForce GTX 285
In this part, we won't get into any specifics of pylearn2 yet. We'll just discuss how to train a multilayer perceptron (MLP). If you already know about MLPs, feel free to skip straight to part 2, where we show how to do all of this in pylearn2.
In softmax_regression.ipynb, we saw how softmax regression is a classification model that learns to map an input vector $x$ to a probability distribution $p(y\mid x)$ where $y$ is a categorical value with $k$ different values. We then described how a dataset $\mathcal{D}$ of $(x, y)$ tuples could be used to train a softmax regression model by maximizing the log likelihood,
$$ \sum_{x,y \in \mathcal{D} } \log P(y \mid x). $$A multilayer perceptron is a very general machine learning model. In many cases, we can think of it as mapping $x$ to $P(y\mid x)$, and train it by maximizing the log likelihood. We'll start with that basic perspective, because of its similarity to softmax regression. (It is, however, possible to interpret the output of a multiplayer perceptron non-probabilistically, to use it for regression rather than classification, and to train it by optimizing functions other than the log likelihood)
Everything we described above is still relevant to the MLP. However, there is one more fact about softmax regression that does not apply to the MLP. Specifically, softmax regression assumes that
$$ p(y \mid x) = \frac { \exp( x^T W + b ) } { \sum_i \exp(x^T W + b)_i } = \text{softmax}( x^T W + b). $$The MLP makes a different assumption about the functional form of $p(y \mid x)$.
The multilayer perceptron model assumption is very weak. Essentially, the assumption is that the relationship between inputs and outputs can be represented by the composition of several simpler functions. Each function being composed can be thought of as another "layer" or stage of processing. The number of compositions determines the "depth" of the model.
Suppose we have a sequence of functions implementing the layers, $g_1, g_2, \dots, g_L$. Then the output of our MLP is
$$f(x) = g_L(g_{L-1}( \dots g_2( g_1 ( x )) \dots )).$$In the first example for this tutorial, we will use just two layers. The final layer will be
$ g_2(g_1) = \text{softmax}( g_1^T W^{(2)} + b^{(2)}),$
so we can think of this model as using $g_1$ to transform $x$ into a different space, then doing softmax regression in that space.
For the first layer, we will use an affine transform followed by elementwise-application of the logistic sigmoid function, $\sigma(z) = \frac {1 } { 1 + \exp(-z) }.$ This is a very commonly used type of layer in multilayer perceptrons. Putting it all together, we get
$ g_1(x) = \sigma ( x^T W^{(1)} + b^{(1)} ).$
The full model is thus
$$ f(x) = \text{softmax}( \sigma ( x^T W^{(1)} + b^{(1)} )^T W^{(2)} + b^{(2)}). $$If we interpret $f(x)$ as defining $p(y \mid x)$, it makes sense to train the parameters $W^{(1)}$, $W^{(2)}$, $b^{(1)}$, and $b^{(2)}$ by maximizing the log likelihood of the training data.
An obvious problem with softmax regression and other linear classifiers is that linear functions are very simple. They prevent solutions to even very simple classification problems, such as the class of 2 bit patterns whose XOR is true. XOR is true when $x=[1,0]$ or $x=[0,1]$ but not when $x=[0,0]$ or $x=[1,1]$. Suppose we draw a line that separates $[0,0]$ from $[0,1]$. Then it must pass through some point $[0,p]$. We require that this line also pass through $[q,1]$ in order to separate $[0,1]$ from $[1,1]$. But this means it slope must be negative and its $x$-intercept must be negative. Since a line only has one $x$ intercept, it does not pass between $[0,0]$ and $[1,0]$. Those two points belong to different classes, so any linear classifier must fail.
An MLP solves this problem by introducing extra stages of processing. In our two layer example, suppose the dimensionality of the first layer is 2. We call the outputs of this layer "hidden units" because they are neither inputs nor outputs of the system; they are unobserved variables that the network must decide what to do with. The MLP can set one of these hidden units to be active when the sum of the two input variables is less than 1. It can set the other to be active when the sum of the two input variables is greater than 1. It can then set the output unit to be active by default, and to deactivate when either of the two hidden variables is active.
More generally, an MLP with one sufficient large hidden layer can represent any function. This result is known as the "universal approximator theorem."
Another advantage of MLPs is that they can be made deeper and deeper, rather than just wider and wider. Many functions can be represented more efficiently (using fewer parameters) with a deep architecture than with a wide one. Using fewer parameters is beneficial both because the MLP takes less memory to represent, but also because the parameters may be estimated more accurately from a smaller amount of data.
Unfortunately, just because an MLP can represent any function does not mean that it will learn to represent the right function. The problem of overfitting can still make the MLP perform badly on the test set even if it classifies the training set perfectly. While larger MLPs are capable of fitting more complicated training sets, they are also likely to overfit worse than smaller MLPs.
A related issue with MLPs is that they have many configuration options. The model itself imposes design decisions such as what type of function to use for each layer, the dimensionality of each layer. Also, the log likelihood is no longer generally concave, so the choice of optimization procedure matters more than it did with softmax regression. These configuration options are known as "hyperparameters." Choosing the right hyperparameters is an open and exciting research problem.
Most of the hyperparameters in this tutorial were not chosen particularly carefully. Feel free to play with all of the settings in this notebook. If you find better ones, write to me and I'll put your settings and your name in the tutorial!
Now that we've described the theory of what we're going to do, it's time to do it! This part describes how to use pylearn2 to run the algorithms described above.
As in the softmax regression tutorial, we will use the MLP to do optical character recognition on the MNIST dataset. The yaml string we construct is similar ot the one we use before. The main difference is that the MLP model class takes a "layers" argument describing the various layers of the model.
Note that for each layer, we need to specify what class to load. The identity of this class determines what type of layer appears at each position in the network. Here, we use a sigmoid hidden layer followed by a softmax output layer.
Every layer of the MLP needs a unique name. Here we name the first hidden layer 'h0' and the output label representing the prediction of the class $y$ 'y'. These layer names are used to generate monitor channel names later so that we can track properties of each layer separately.
The hidden layer needs some configuration that is pretty similar to the configuration for the output layer. Much as we need to tell the output layer its size (10 classes) we also need to tell the hidden layer its dimension, or the number of hidden units to go in that layer. In this case we use 500. We also need to tell it how to initialize its weights. The Sigmoid class supports the irange argument that we demonstrated for Softmax in the softmax regression tutorial, and we could use that here. Instead, we demonstrate a different argument, sparse_init. When sparse_init is specified, each unit gets exactly sparse_init non-zero weights initially. These weights are drawn from $N(0,1)$, so they are quite large compared to how weights are usually initialized.
In [2]:
import os
import pylearn2
path = os.path.join(pylearn2.__path__[0], 'scripts', 'tutorials', 'multilayer_perceptron', 'mlp_tutorial_part_2.yaml')
with open(path, 'r') as f:
train = f.read()
hyper_params = {'train_stop' : 50000,
'valid_stop' : 60000,
'dim_h0' : 500,
'max_epochs' : 10000,
'save_path' : '.'}
train = train % (hyper_params)
print train
!obj:pylearn2.train.Train {
dataset: &train !obj:pylearn2.datasets.mnist.MNIST {
which_set: 'train',
start: 0,
stop: 50000
},
model: !obj:pylearn2.models.mlp.MLP {
layers: [
!obj:pylearn2.models.mlp.Sigmoid {
layer_name: 'h0',
dim: 500,
sparse_init: 15,
}, !obj:pylearn2.models.mlp.Softmax {
layer_name: 'y',
n_classes: 10,
irange: 0.
}
],
nvis: 784,
},
algorithm: !obj:pylearn2.training_algorithms.bgd.BGD {
batch_size: 10000,
line_search_mode: 'exhaustive',
conjugate: 1,
updates_per_batch: 10,
monitoring_dataset:
{
'train' : *train,
'valid' : !obj:pylearn2.datasets.mnist.MNIST {
which_set: 'train',
start: 50000,
stop: 60000
},
'test' : !obj:pylearn2.datasets.mnist.MNIST {
which_set: 'test',
}
},
termination_criterion: !obj:pylearn2.termination_criteria.And {
criteria: [
!obj:pylearn2.termination_criteria.MonitorBased {
channel_name: "valid_y_misclass"
},
!obj:pylearn2.termination_criteria.EpochCounter {
max_epochs: 10000
}
]
}
},
extensions: [
!obj:pylearn2.train_extensions.best_params.MonitorBasedSaveBest {
channel_name: 'valid_y_misclass',
save_path: "mlp_best.pkl"
},
]
}
Note that we still do not specify a cost to be minimized. In the case of LogisticRegression, the model requested the negative log likelihood by default. In the case of the MLP, it is up to the final layer of the MLP to specify the default cost if the user does not provide one. In this case, since the final layer is a Softmax layer, we still have the same objective function as in the SoftmaxRegression tutorial.
Now, we use pylearn2's yaml_parse.load to construct the Train object, and run its main loop. The same thing could be accomplished by running pylearn2's train.py script on a file containing the yaml string.
Execute the next cell to train the model. This will take several minutes and possible as much as a few hours depending on how fast your computer is.
In [3]:
from pylearn2.config import yaml_parse
train = yaml_parse.load(train)
train.main_loop()
compiling begin_record_entry...
/u/goodfeli/pylearn2/models/mlp.py:36: UserWarning: MLP changing the recursion limit.
warnings.warn("MLP changing the recursion limit.")
compiling begin_record_entry done. Time elapsed: 0.479222 seconds
Monitored channels:
ave_grad_mult
ave_grad_size
ave_step_size
test_h0_col_norms_max
test_h0_col_norms_mean
test_h0_col_norms_min
test_h0_max_x_max_u
test_h0_max_x_mean_u
test_h0_max_x_min_u
test_h0_mean_x_max_u
test_h0_mean_x_mean_u
test_h0_mean_x_min_u
test_h0_min_x_max_u
test_h0_min_x_mean_u
test_h0_min_x_min_u
test_h0_row_norms_max
test_h0_row_norms_mean
test_h0_row_norms_min
test_objective
test_y_col_norms_max
test_y_col_norms_mean
test_y_col_norms_min
test_y_max_max_class
test_y_mean_max_class
test_y_min_max_class
test_y_misclass
test_y_nll
test_y_row_norms_max
test_y_row_norms_mean
test_y_row_norms_min
train_h0_col_norms_max
train_h0_col_norms_mean
train_h0_col_norms_min
train_h0_max_x_max_u
train_h0_max_x_mean_u
train_h0_max_x_min_u
train_h0_mean_x_max_u
train_h0_mean_x_mean_u
train_h0_mean_x_min_u
train_h0_min_x_max_u
train_h0_min_x_mean_u
train_h0_min_x_min_u
train_h0_row_norms_max
train_h0_row_norms_mean
train_h0_row_norms_min
train_objective
train_y_col_norms_max
train_y_col_norms_mean
train_y_col_norms_min
train_y_max_max_class
train_y_mean_max_class
train_y_min_max_class
train_y_misclass
train_y_nll
train_y_row_norms_max
train_y_row_norms_mean
train_y_row_norms_min
valid_h0_col_norms_max
valid_h0_col_norms_mean
valid_h0_col_norms_min
valid_h0_max_x_max_u
valid_h0_max_x_mean_u
valid_h0_max_x_min_u
valid_h0_mean_x_max_u
valid_h0_mean_x_mean_u
valid_h0_mean_x_min_u
valid_h0_min_x_max_u
valid_h0_min_x_mean_u
valid_h0_min_x_min_u
valid_h0_row_norms_max
valid_h0_row_norms_mean
valid_h0_row_norms_min
valid_objective
valid_y_col_norms_max
valid_y_col_norms_mean
valid_y_col_norms_min
valid_y_max_max_class
valid_y_mean_max_class
valid_y_min_max_class
valid_y_misclass
valid_y_nll
valid_y_row_norms_max
valid_y_row_norms_mean
valid_y_row_norms_min
Compiling accum...
graph size: 160
graph size: 157
graph size: 157
Compiling accum done. Time elapsed: 11.082528 seconds
Monitoring step:
Epochs seen: 0
Batches seen: 0
Examples seen: 0
ave_grad_mult: 0.0
ave_grad_size: 0.0
ave_step_size: 0.0
test_h0_col_norms_max: 6.23503398895
test_h0_col_norms_mean: 3.82355618477
test_h0_col_norms_min: 2.06193995476
test_h0_max_x_max_u: 0.999900639057
test_h0_max_x_mean_u: 0.909942150116
test_h0_max_x_min_u: 0.508436858654
test_h0_mean_x_max_u: 0.901069939137
test_h0_mean_x_mean_u: 0.476713299751
test_h0_mean_x_min_u: 0.152832776308
test_h0_min_x_max_u: 0.480607658625
test_h0_min_x_mean_u: 0.0718067958951
test_h0_min_x_min_u: 0.000174344575498
test_h0_row_norms_max: 5.89326095581
test_h0_row_norms_mean: 2.98549151421
test_h0_row_norms_min: 0.0
test_objective: 2.30258440971
test_y_col_norms_max: 0.0
test_y_col_norms_mean: 0.0
test_y_col_norms_min: 0.0
test_y_max_max_class: 0.0999999940395
test_y_mean_max_class: 0.099990285933
test_y_min_max_class: 0.0999999940395
test_y_misclass: 0.901999950409
test_y_nll: 2.30258440971
test_y_row_norms_max: 0.0
test_y_row_norms_mean: 0.0
test_y_row_norms_min: 0.0
train_h0_col_norms_max: 6.23503303528
train_h0_col_norms_mean: 3.82355594635
train_h0_col_norms_min: 2.06193971634
train_h0_max_x_max_u: 0.999884188175
train_h0_max_x_mean_u: 0.910601377487
train_h0_max_x_min_u: 0.542480230331
train_h0_mean_x_max_u: 0.899177610874
train_h0_mean_x_mean_u: 0.477026820183
train_h0_mean_x_min_u: 0.158626437187
train_h0_min_x_max_u: 0.458495438099
train_h0_min_x_mean_u: 0.0697233080864
train_h0_min_x_min_u: 0.000107248379209
train_h0_row_norms_max: 5.89326000214
train_h0_row_norms_mean: 2.98549151421
train_h0_row_norms_min: 0.0
train_objective: 2.30258440971
train_y_col_norms_max: 0.0
train_y_col_norms_mean: 0.0
train_y_col_norms_min: 0.0
train_y_max_max_class: 0.0999999940395
train_y_mean_max_class: 0.0999902933836
train_y_min_max_class: 0.0999999940395
train_y_misclass: 0.901359915733
train_y_nll: 2.30258440971
train_y_row_norms_max: 0.0
train_y_row_norms_mean: 0.0
train_y_row_norms_min: 0.0
valid_h0_col_norms_max: 6.23503398895
valid_h0_col_norms_mean: 3.82355618477
valid_h0_col_norms_min: 2.06193995476
valid_h0_max_x_max_u: 0.999902307987
valid_h0_max_x_mean_u: 0.910734891891
valid_h0_max_x_min_u: 0.505713641644
valid_h0_mean_x_max_u: 0.897212743759
valid_h0_mean_x_mean_u: 0.477113306522
valid_h0_mean_x_min_u: 0.159442692995
valid_h0_min_x_max_u: 0.474104195833
valid_h0_min_x_mean_u: 0.0706818476319
valid_h0_min_x_min_u: 0.000110276472697
valid_h0_row_norms_max: 5.89326095581
valid_h0_row_norms_mean: 2.98549151421
valid_h0_row_norms_min: 0.0
valid_objective: 2.30258440971
valid_y_col_norms_max: 0.0
valid_y_col_norms_mean: 0.0
valid_y_col_norms_min: 0.0
valid_y_max_max_class: 0.0999999940395
valid_y_mean_max_class: 0.099990285933
valid_y_min_max_class: 0.0999999940395
valid_y_misclass: 0.900900006294
valid_y_nll: 2.30258440971
valid_y_row_norms_max: 0.0
valid_y_row_norms_mean: 0.0
valid_y_row_norms_min: 0.0
Time this epoch: 35.338505 seconds
Monitoring step:
Epochs seen: 1
Batches seen: 5
Examples seen: 50000
ave_grad_mult: 0.566698908806
ave_grad_size: 0.567735552788
ave_step_size: 0.291175425053
test_h0_col_norms_max: 6.24065446854
test_h0_col_norms_mean: 3.83268666267
test_h0_col_norms_min: 2.0723836422
test_h0_max_x_max_u: 0.999798893929
test_h0_max_x_mean_u: 0.930105090141
test_h0_max_x_min_u: 0.600322246552
test_h0_mean_x_max_u: 0.863031387329
test_h0_mean_x_mean_u: 0.476889610291
test_h0_mean_x_min_u: 0.171247333288
test_h0_min_x_max_u: 0.412737071514
test_h0_min_x_mean_u: 0.0536084063351
test_h0_min_x_min_u: 0.000199288566364
test_h0_row_norms_max: 5.89763784409
test_h0_row_norms_mean: 2.99287319183
test_h0_row_norms_min: 0.0068221190013
test_objective: 0.350786328316
test_y_col_norms_max: 2.74948716164
test_y_col_norms_mean: 2.56346487999
test_y_col_norms_min: 2.34412789345
test_y_max_max_class: 0.999794960022
test_y_mean_max_class: 0.840726792812
test_y_min_max_class: 0.207839608192
test_y_misclass: 0.0983999967575
test_y_nll: 0.350786328316
test_y_row_norms_max: 0.701220929623
test_y_row_norms_mean: 0.34330791235
test_y_row_norms_min: 0.0764839723706
train_h0_col_norms_max: 6.24065446854
train_h0_col_norms_mean: 3.83268642426
train_h0_col_norms_min: 2.07238340378
train_h0_max_x_max_u: 0.999829530716
train_h0_max_x_mean_u: 0.930867910385
train_h0_max_x_min_u: 0.617025732994
train_h0_mean_x_max_u: 0.860394179821
train_h0_mean_x_mean_u: 0.477169722319
train_h0_mean_x_min_u: 0.177841931581
train_h0_min_x_max_u: 0.386521846056
train_h0_min_x_mean_u: 0.0524694435298
train_h0_min_x_min_u: 0.000151637359522
train_h0_row_norms_max: 5.89763736725
train_h0_row_norms_mean: 2.99287295341
train_h0_row_norms_min: 0.0068221190013
train_objective: 0.372914284468
train_y_col_norms_max: 2.74948716164
train_y_col_norms_mean: 2.56346464157
train_y_col_norms_min: 2.34412789345
train_y_max_max_class: 0.999826908112
train_y_mean_max_class: 0.833846986294
train_y_min_max_class: 0.198893502355
train_y_misclass: 0.106319993734
train_y_nll: 0.372914284468
train_y_row_norms_max: 0.701220929623
train_y_row_norms_mean: 0.343307882547
train_y_row_norms_min: 0.0764839798212
valid_h0_col_norms_max: 6.24065446854
valid_h0_col_norms_mean: 3.83268666267
valid_h0_col_norms_min: 2.0723836422
valid_h0_max_x_max_u: 0.999864041805
valid_h0_max_x_mean_u: 0.930580854416
valid_h0_max_x_min_u: 0.638543665409
valid_h0_mean_x_max_u: 0.858349621296
valid_h0_mean_x_mean_u: 0.477255016565
valid_h0_mean_x_min_u: 0.177810654044
valid_h0_min_x_max_u: 0.361713379622
valid_h0_min_x_mean_u: 0.0531250722706
valid_h0_min_x_min_u: 0.000215846084757
valid_h0_row_norms_max: 5.89763784409
valid_h0_row_norms_mean: 2.99287319183
valid_h0_row_norms_min: 0.0068221190013
valid_objective: 0.339448153973
valid_y_col_norms_max: 2.74948716164
valid_y_col_norms_mean: 2.56346487999
valid_y_col_norms_min: 2.34412789345
valid_y_max_max_class: 0.999945104122
valid_y_mean_max_class: 0.845010101795
valid_y_min_max_class: 0.196165680885
valid_y_misclass: 0.0965999960899
valid_y_nll: 0.339448153973
valid_y_row_norms_max: 0.701220929623
valid_y_row_norms_mean: 0.34330791235
valid_y_row_norms_min: 0.0764839723706
Time this epoch: 35.029214 seconds
Monitoring step:
Epochs seen: 2
Batches seen: 10
Examples seen: 100000
ave_grad_mult: 0.648920476437
ave_grad_size: 0.385089039803
ave_step_size: 0.205155700445
test_h0_col_norms_max: 6.2453122139
test_h0_col_norms_mean: 3.8378276825
test_h0_col_norms_min: 2.07804393768
test_h0_max_x_max_u: 0.999864637852
test_h0_max_x_mean_u: 0.93498313427
test_h0_max_x_min_u: 0.613258361816
test_h0_mean_x_max_u: 0.847131431103
test_h0_mean_x_mean_u: 0.476234823465
test_h0_mean_x_min_u: 0.172577545047
test_h0_min_x_max_u: 0.381593316793
test_h0_min_x_mean_u: 0.0493729114532
test_h0_min_x_min_u: 0.000119279786304
test_h0_row_norms_max: 5.90795898438
test_h0_row_norms_mean: 2.99731445312
test_h0_row_norms_min: 0.0140750305727
test_objective: 0.296338170767
test_y_col_norms_max: 3.20915484428
test_y_col_norms_mean: 3.00029850006
test_y_col_norms_min: 2.73683047295
test_y_max_max_class: 0.9999589324
test_y_mean_max_class: 0.878535091877
test_y_min_max_class: 0.236884206533
test_y_misclass: 0.0850000008941
test_y_nll: 0.296338170767
test_y_row_norms_max: 0.839111089706
test_y_row_norms_mean: 0.403169810772
test_y_row_norms_min: 0.0928392037749
train_h0_col_norms_max: 6.24531269073
train_h0_col_norms_mean: 3.83782744408
train_h0_col_norms_min: 2.07804393768
train_h0_max_x_max_u: 0.999843478203
train_h0_max_x_mean_u: 0.935774207115
train_h0_max_x_min_u: 0.630811154842
train_h0_mean_x_max_u: 0.843988478184
train_h0_mean_x_mean_u: 0.476507484913
train_h0_mean_x_min_u: 0.179330348969
train_h0_min_x_max_u: 0.372446238995
train_h0_min_x_mean_u: 0.048459071666
train_h0_min_x_min_u: 0.000123051402625
train_h0_row_norms_max: 5.90795898438
train_h0_row_norms_mean: 2.99731445312
train_h0_row_norms_min: 0.014075031504
train_objective: 0.310930907726
train_y_col_norms_max: 3.20915460587
train_y_col_norms_mean: 3.00029873848
train_y_col_norms_min: 2.73683071136
train_y_max_max_class: 0.999969184399
train_y_mean_max_class: 0.872422754765
train_y_min_max_class: 0.206743046641
train_y_misclass: 0.0889399945736
train_y_nll: 0.310930907726
train_y_row_norms_max: 0.839111089706
train_y_row_norms_mean: 0.403169810772
train_y_row_norms_min: 0.0928391963243
valid_h0_col_norms_max: 6.2453122139
valid_h0_col_norms_mean: 3.8378276825
valid_h0_col_norms_min: 2.07804393768
valid_h0_max_x_max_u: 0.999864220619
valid_h0_max_x_mean_u: 0.935237765312
valid_h0_max_x_min_u: 0.672344446182
valid_h0_mean_x_max_u: 0.842247903347
valid_h0_mean_x_mean_u: 0.476582586765
valid_h0_mean_x_min_u: 0.178887397051
valid_h0_min_x_max_u: 0.358671993017
valid_h0_min_x_mean_u: 0.0488182529807
valid_h0_min_x_min_u: 0.000185967219295
valid_h0_row_norms_max: 5.90795898438
valid_h0_row_norms_mean: 2.99731445312
valid_h0_row_norms_min: 0.0140750305727
valid_objective: 0.286341637373
valid_y_col_norms_max: 3.20915484428
valid_y_col_norms_mean: 3.00029850006
valid_y_col_norms_min: 2.73683047295
valid_y_max_max_class: 0.999980926514
valid_y_mean_max_class: 0.880788624287
valid_y_min_max_class: 0.193636313081
valid_y_misclass: 0.0813999921083
valid_y_nll: 0.286341637373
valid_y_row_norms_max: 0.839111089706
valid_y_row_norms_mean: 0.403169810772
valid_y_row_norms_min: 0.0928392037749
Time this epoch: 35.009148 seconds
Monitoring step:
Epochs seen: 3
Batches seen: 15
Examples seen: 150000
ave_grad_mult: 0.747792065144
ave_grad_size: 0.265085607767
ave_step_size: 0.150685995817
test_h0_col_norms_max: 6.24948835373
test_h0_col_norms_mean: 3.84261131287
test_h0_col_norms_min: 2.08266615868
test_h0_max_x_max_u: 0.99994790554
test_h0_max_x_mean_u: 0.937485575676
test_h0_max_x_min_u: 0.633630394936
test_h0_mean_x_max_u: 0.859075248241
test_h0_mean_x_mean_u: 0.475113451481
test_h0_mean_x_min_u: 0.166715249419
test_h0_min_x_max_u: 0.368945479393
test_h0_min_x_mean_u: 0.0472293719649
test_h0_min_x_min_u: 5.30257530045e-05
test_h0_row_norms_max: 5.91970491409
test_h0_row_norms_mean: 3.00150084496
test_h0_row_norms_min: 0.0220027510077
test_objective: 0.269680500031
test_y_col_norms_max: 3.56634759903
test_y_col_norms_mean: 3.29666876793
test_y_col_norms_min: 3.00721621513
test_y_max_max_class: 0.999979376793
test_y_mean_max_class: 0.893490552902
test_y_min_max_class: 0.250094264746
test_y_misclass: 0.0763000026345
test_y_nll: 0.269680500031
test_y_row_norms_max: 0.959613263607
test_y_row_norms_mean: 0.443394243717
test_y_row_norms_min: 0.103941932321
train_h0_col_norms_max: 6.24948787689
train_h0_col_norms_mean: 3.84261083603
train_h0_col_norms_min: 2.08266615868
train_h0_max_x_max_u: 0.99988758564
train_h0_max_x_mean_u: 0.938323676586
train_h0_max_x_min_u: 0.649454653263
train_h0_mean_x_max_u: 0.846590101719
train_h0_mean_x_mean_u: 0.475384742022
train_h0_mean_x_min_u: 0.171920359135
train_h0_min_x_max_u: 0.365952074528
train_h0_min_x_mean_u: 0.0464779213071
train_h0_min_x_min_u: 6.07749607298e-05
train_h0_row_norms_max: 5.91970491409
train_h0_row_norms_mean: 3.00150060654
train_h0_row_norms_min: 0.022002749145
train_objective: 0.278353452682
train_y_col_norms_max: 3.56634736061
train_y_col_norms_mean: 3.29666852951
train_y_col_norms_min: 3.00721621513
train_y_max_max_class: 0.999987363815
train_y_mean_max_class: 0.889036417007
train_y_min_max_class: 0.227912455797
train_y_misclass: 0.0788599997759
train_y_nll: 0.278353452682
train_y_row_norms_max: 0.959613204002
train_y_row_norms_mean: 0.443394213915
train_y_row_norms_min: 0.103941932321
valid_h0_col_norms_max: 6.24948835373
valid_h0_col_norms_mean: 3.84261131287
valid_h0_col_norms_min: 2.08266615868
valid_h0_max_x_max_u: 0.999919652939
valid_h0_max_x_mean_u: 0.937573850155
valid_h0_max_x_min_u: 0.684871912003
valid_h0_mean_x_max_u: 0.850003778934
valid_h0_mean_x_mean_u: 0.475453108549
valid_h0_mean_x_min_u: 0.170857235789
valid_h0_min_x_max_u: 0.353432744741
valid_h0_min_x_mean_u: 0.0467779003084
valid_h0_min_x_min_u: 6.80360026308e-05
valid_h0_row_norms_max: 5.91970491409
valid_h0_row_norms_mean: 3.00150084496
valid_h0_row_norms_min: 0.0220027510077
valid_objective: 0.26020783186
valid_y_col_norms_max: 3.56634759903
valid_y_col_norms_mean: 3.29666876793
valid_y_col_norms_min: 3.00721621513
valid_y_max_max_class: 0.999977052212
valid_y_mean_max_class: 0.896274268627
valid_y_min_max_class: 0.17623616755
valid_y_misclass: 0.0750000029802
valid_y_nll: 0.26020783186
valid_y_row_norms_max: 0.959613263607
valid_y_row_norms_mean: 0.443394243717
valid_y_row_norms_min: 0.103941932321
Time this epoch: 35.058853 seconds
Monitoring step:
Epochs seen: 4
Batches seen: 20
Examples seen: 200000
ave_grad_mult: 0.788351774216
ave_grad_size: 0.187993511558
ave_step_size: 0.113317854702
test_h0_col_norms_max: 6.25235366821
test_h0_col_norms_mean: 3.84656834602
test_h0_col_norms_min: 2.08510184288
test_h0_max_x_max_u: 0.999974727631
test_h0_max_x_mean_u: 0.938515424728
test_h0_max_x_min_u: 0.650707960129
test_h0_mean_x_max_u: 0.87255191803
test_h0_mean_x_mean_u: 0.474163293839
test_h0_mean_x_min_u: 0.160470247269
test_h0_min_x_max_u: 0.364907234907
test_h0_min_x_mean_u: 0.0464833118021
test_h0_min_x_min_u: 2.23769111471e-05
test_h0_row_norms_max: 5.93058395386
test_h0_row_norms_mean: 3.0049738884
test_h0_row_norms_min: 0.0284670460969
test_objective: 0.252513170242
test_y_col_norms_max: 3.77643465996
test_y_col_norms_mean: 3.49576759338
test_y_col_norms_min: 3.21715569496
test_y_max_max_class: 0.999990880489
test_y_mean_max_class: 0.902969479561
test_y_min_max_class: 0.223742827773
test_y_misclass: 0.0724000036716
test_y_nll: 0.252513170242
test_y_row_norms_max: 1.04190921783
test_y_row_norms_mean: 0.47004455328
test_y_row_norms_min: 0.109351947904
train_h0_col_norms_max: 6.25235366821
train_h0_col_norms_mean: 3.84656858444
train_h0_col_norms_min: 2.08510160446
train_h0_max_x_max_u: 0.999940037727
train_h0_max_x_mean_u: 0.939188420773
train_h0_max_x_min_u: 0.661542713642
train_h0_mean_x_max_u: 0.85992783308
train_h0_mean_x_mean_u: 0.474434643984
train_h0_mean_x_min_u: 0.163209468126
train_h0_min_x_max_u: 0.358978569508
train_h0_min_x_mean_u: 0.0456797704101
train_h0_min_x_min_u: 3.15167126246e-05
train_h0_row_norms_max: 5.93058395386
train_h0_row_norms_mean: 3.00497412682
train_h0_row_norms_min: 0.0284670442343
train_objective: 0.257761448622
train_y_col_norms_max: 3.77643465996
train_y_col_norms_mean: 3.49576735497
train_y_col_norms_min: 3.21715545654
train_y_max_max_class: 0.999995172024
train_y_mean_max_class: 0.898737490177
train_y_min_max_class: 0.233332633972
train_y_misclass: 0.0732599943876
train_y_nll: 0.257761448622
train_y_row_norms_max: 1.04190921783
train_y_row_norms_mean: 0.470044583082
train_y_row_norms_min: 0.109351947904
valid_h0_col_norms_max: 6.25235366821
valid_h0_col_norms_mean: 3.84656834602
valid_h0_col_norms_min: 2.08510184288
valid_h0_max_x_max_u: 0.999963521957
valid_h0_max_x_mean_u: 0.938330054283
valid_h0_max_x_min_u: 0.685399234295
valid_h0_mean_x_max_u: 0.864110708237
valid_h0_mean_x_mean_u: 0.474497437477
valid_h0_mean_x_min_u: 0.161501988769
valid_h0_min_x_max_u: 0.347681999207
valid_h0_min_x_mean_u: 0.0459976904094
valid_h0_min_x_min_u: 2.87672100967e-05
valid_h0_row_norms_max: 5.93058395386
valid_h0_row_norms_mean: 3.0049738884
valid_h0_row_norms_min: 0.0284670460969
valid_objective: 0.242218419909
valid_y_col_norms_max: 3.77643465996
valid_y_col_norms_mean: 3.49576759338
valid_y_col_norms_min: 3.21715569496
valid_y_max_max_class: 0.999983727932
valid_y_mean_max_class: 0.90525239706
valid_y_min_max_class: 0.237812787294
valid_y_misclass: 0.070799998939
valid_y_nll: 0.242218419909
valid_y_row_norms_max: 1.04190921783
valid_y_row_norms_mean: 0.47004455328
valid_y_row_norms_min: 0.109351947904
Time this epoch: 34.824181 seconds
Monitoring step:
Epochs seen: 5
Batches seen: 25
Examples seen: 250000
ave_grad_mult: 0.822910606861
ave_grad_size: 0.140246614814
ave_step_size: 0.0910708159208
test_h0_col_norms_max: 6.2554602623
test_h0_col_norms_mean: 3.85085010529
test_h0_col_norms_min: 2.08709287643
test_h0_max_x_max_u: 0.999985814095
test_h0_max_x_mean_u: 0.939129829407
test_h0_max_x_min_u: 0.667058110237
test_h0_mean_x_max_u: 0.881521999836
test_h0_mean_x_mean_u: 0.473096251488
test_h0_mean_x_min_u: 0.148683413863
test_h0_min_x_max_u: 0.366505622864
test_h0_min_x_mean_u: 0.0459363907576
test_h0_min_x_min_u: 9.1133879323e-06
test_h0_row_norms_max: 5.94399118423
test_h0_row_norms_mean: 3.00873041153
test_h0_row_norms_min: 0.0347110852599
test_objective: 0.236052155495
test_y_col_norms_max: 3.98437142372
test_y_col_norms_mean: 3.68210268021
test_y_col_norms_min: 3.41360712051
test_y_max_max_class: 0.99999153614
test_y_mean_max_class: 0.909221351147
test_y_min_max_class: 0.227106332779
test_y_misclass: 0.0672999992967
test_y_nll: 0.236052155495
test_y_row_norms_max: 1.12676775455
test_y_row_norms_mean: 0.494562119246
test_y_row_norms_min: 0.114525236189
train_h0_col_norms_max: 6.2554602623
train_h0_col_norms_mean: 3.85085010529
train_h0_col_norms_min: 2.08709263802
train_h0_max_x_max_u: 0.999965369701
train_h0_max_x_mean_u: 0.939886808395
train_h0_max_x_min_u: 0.672379374504
train_h0_mean_x_max_u: 0.869372367859
train_h0_mean_x_mean_u: 0.473366141319
train_h0_mean_x_min_u: 0.151700764894
train_h0_min_x_max_u: 0.357233524323
train_h0_min_x_mean_u: 0.0450618416071
train_h0_min_x_min_u: 1.45595986396e-05
train_h0_row_norms_max: 5.9439907074
train_h0_row_norms_mean: 3.00873041153
train_h0_row_norms_min: 0.0347110852599
train_objective: 0.239308148623
train_y_col_norms_max: 3.9843711853
train_y_col_norms_mean: 3.68210220337
train_y_col_norms_min: 3.41360712051
train_y_max_max_class: 0.999996185303
train_y_mean_max_class: 0.905649185181
train_y_min_max_class: 0.236008346081
train_y_misclass: 0.0679599940777
train_y_nll: 0.239308148623
train_y_row_norms_max: 1.12676763535
train_y_row_norms_mean: 0.494562089443
train_y_row_norms_min: 0.114525228739
valid_h0_col_norms_max: 6.2554602623
valid_h0_col_norms_mean: 3.85085010529
valid_h0_col_norms_min: 2.08709287643
valid_h0_max_x_max_u: 0.999980926514
valid_h0_max_x_mean_u: 0.939110815525
valid_h0_max_x_min_u: 0.683836042881
valid_h0_mean_x_max_u: 0.873598277569
valid_h0_mean_x_mean_u: 0.473425507545
valid_h0_mean_x_min_u: 0.149841591716
valid_h0_min_x_max_u: 0.346154510975
valid_h0_min_x_mean_u: 0.0454438403249
valid_h0_min_x_min_u: 1.18227362691e-05
valid_h0_row_norms_max: 5.94399118423
valid_h0_row_norms_mean: 3.00873041153
valid_h0_row_norms_min: 0.0347110852599
valid_objective: 0.22658072412
valid_y_col_norms_max: 3.98437142372
valid_y_col_norms_mean: 3.68210268021
valid_y_col_norms_min: 3.41360712051
valid_y_max_max_class: 0.999987483025
valid_y_mean_max_class: 0.911411643028
valid_y_min_max_class: 0.217763110995
valid_y_misclass: 0.0644000023603
valid_y_nll: 0.22658072412
valid_y_row_norms_max: 1.12676775455
valid_y_row_norms_mean: 0.494562119246
valid_y_row_norms_min: 0.114525236189
Time this epoch: 35.012249 seconds
Monitoring step:
Epochs seen: 6
Batches seen: 30
Examples seen: 300000
ave_grad_mult: 0.849331319332
ave_grad_size: 0.110973127186
ave_step_size: 0.0771789103746
test_h0_col_norms_max: 6.25832700729
test_h0_col_norms_mean: 3.85529947281
test_h0_col_norms_min: 2.08869576454
test_h0_max_x_max_u: 0.999991595745
test_h0_max_x_mean_u: 0.93943220377
test_h0_max_x_min_u: 0.680398881435
test_h0_mean_x_max_u: 0.887371778488
test_h0_mean_x_mean_u: 0.472293674946
test_h0_mean_x_min_u: 0.139431104064
test_h0_min_x_max_u: 0.367107391357
test_h0_min_x_mean_u: 0.0457468703389
test_h0_min_x_min_u: 3.69549866264e-06
test_h0_row_norms_max: 5.9600777626
test_h0_row_norms_mean: 3.01261997223
test_h0_row_norms_min: 0.0412151031196
test_objective: 0.222071394324
test_y_col_norms_max: 4.16519927979
test_y_col_norms_mean: 3.85762476921
test_y_col_norms_min: 3.61017894745
test_y_max_max_class: 0.999991238117
test_y_mean_max_class: 0.913735508919
test_y_min_max_class: 0.246407344937
test_y_misclass: 0.0631999969482
test_y_nll: 0.222071394324
test_y_row_norms_max: 1.19918644428
test_y_row_norms_mean: 0.517221450806
test_y_row_norms_min: 0.117476500571
train_h0_col_norms_max: 6.25832748413
train_h0_col_norms_mean: 3.85529899597
train_h0_col_norms_min: 2.08869576454
train_h0_max_x_max_u: 0.999979615211
train_h0_max_x_mean_u: 0.94024169445
train_h0_max_x_min_u: 0.675026059151
train_h0_mean_x_max_u: 0.87550008297
train_h0_mean_x_mean_u: 0.472564071417
train_h0_mean_x_min_u: 0.142730906606
train_h0_min_x_max_u: 0.356041908264
train_h0_min_x_mean_u: 0.044754832983
train_h0_min_x_min_u: 6.11660334471e-06
train_h0_row_norms_max: 5.96007823944
train_h0_row_norms_mean: 3.01261997223
train_h0_row_norms_min: 0.0412151031196
train_objective: 0.222275063396
train_y_col_norms_max: 4.16519880295
train_y_col_norms_mean: 3.85762453079
train_y_col_norms_min: 3.61017894745
train_y_max_max_class: 0.999996602535
train_y_mean_max_class: 0.910623729229
train_y_min_max_class: 0.235357835889
train_y_misclass: 0.062839999795
train_y_nll: 0.222275063396
train_y_row_norms_max: 1.19918644428
train_y_row_norms_mean: 0.517221450806
train_y_row_norms_min: 0.11747649312
valid_h0_col_norms_max: 6.25832700729
valid_h0_col_norms_mean: 3.85529947281
valid_h0_col_norms_min: 2.08869576454
valid_h0_max_x_max_u: 0.999989330769
valid_h0_max_x_mean_u: 0.939590632915
valid_h0_max_x_min_u: 0.678366243839
valid_h0_mean_x_max_u: 0.879810392857
valid_h0_mean_x_mean_u: 0.472620040178
valid_h0_mean_x_min_u: 0.140709280968
valid_h0_min_x_max_u: 0.344533830881
valid_h0_min_x_mean_u: 0.0452971383929
valid_h0_min_x_min_u: 4.94029472975e-06
valid_h0_row_norms_max: 5.9600777626
valid_h0_row_norms_mean: 3.01261997223
valid_h0_row_norms_min: 0.0412151031196
valid_objective: 0.213480621576
valid_y_col_norms_max: 4.16519927979
valid_y_col_norms_mean: 3.85762476921
valid_y_col_norms_min: 3.61017894745
valid_y_max_max_class: 0.999992728233
valid_y_mean_max_class: 0.915528953075
valid_y_min_max_class: 0.230840429664
valid_y_misclass: 0.0590999983251
valid_y_nll: 0.213480621576
valid_y_row_norms_max: 1.19918644428
valid_y_row_norms_mean: 0.517221450806
valid_y_row_norms_min: 0.117476500571
Time this epoch: 34.796789 seconds
Monitoring step:
Epochs seen: 7
Batches seen: 35
Examples seen: 350000
ave_grad_mult: 0.921035170555
ave_grad_size: 0.0949304848909
ave_step_size: 0.0732585340738
test_h0_col_norms_max: 6.26188564301
test_h0_col_norms_mean: 3.86070275307
test_h0_col_norms_min: 2.09020781517
test_h0_max_x_max_u: 0.999995708466
test_h0_max_x_mean_u: 0.940146625042
test_h0_max_x_min_u: 0.672576725483
test_h0_mean_x_max_u: 0.892456889153
test_h0_mean_x_mean_u: 0.47117972374
test_h0_mean_x_min_u: 0.127655550838
test_h0_min_x_max_u: 0.367071986198
test_h0_min_x_mean_u: 0.0451025255024
test_h0_min_x_min_u: 1.38111693104e-06
test_h0_row_norms_max: 5.97794675827
test_h0_row_norms_mean: 3.01733326912
test_h0_row_norms_min: 0.0475185476243
test_objective: 0.2069362849
test_y_col_norms_max: 4.37119436264
test_y_col_norms_mean: 4.05648756027
test_y_col_norms_min: 3.72235488892
test_y_max_max_class: 0.999992549419
test_y_mean_max_class: 0.920760273933
test_y_min_max_class: 0.212535321712
test_y_misclass: 0.0597999989986
test_y_nll: 0.2069362849
test_y_row_norms_max: 1.28081488609
test_y_row_norms_mean: 0.54237049818
test_y_row_norms_min: 0.120768107474
train_h0_col_norms_max: 6.26188564301
train_h0_col_norms_mean: 3.86070251465
train_h0_col_norms_min: 2.09020781517
train_h0_max_x_max_u: 0.999989151955
train_h0_max_x_mean_u: 0.941006839275
train_h0_max_x_min_u: 0.670265555382
train_h0_mean_x_max_u: 0.880909919739
train_h0_mean_x_mean_u: 0.471454769373
train_h0_mean_x_min_u: 0.130571871996
train_h0_min_x_max_u: 0.354819297791
train_h0_min_x_mean_u: 0.0440064184368
train_h0_min_x_min_u: 2.32596198657e-06
train_h0_row_norms_max: 5.97794628143
train_h0_row_norms_mean: 3.0173330307
train_h0_row_norms_min: 0.0475185438991
train_objective: 0.205675914884
train_y_col_norms_max: 4.3711938858
train_y_col_norms_mean: 4.05648708344
train_y_col_norms_min: 3.72235488892
train_y_max_max_class: 0.999997496605
train_y_mean_max_class: 0.917994856834
train_y_min_max_class: 0.242114007473
train_y_misclass: 0.0586799941957
train_y_nll: 0.205675914884
train_y_row_norms_max: 1.2808150053
train_y_row_norms_mean: 0.542370438576
train_y_row_norms_min: 0.120768100023
valid_h0_col_norms_max: 6.26188564301
valid_h0_col_norms_mean: 3.86070275307
valid_h0_col_norms_min: 2.09020781517
valid_h0_max_x_max_u: 0.999994754791
valid_h0_max_x_mean_u: 0.940389454365
valid_h0_max_x_min_u: 0.653915822506
valid_h0_mean_x_max_u: 0.885270357132
valid_h0_mean_x_mean_u: 0.471503049135
valid_h0_mean_x_min_u: 0.129038855433
valid_h0_min_x_max_u: 0.343496620655
valid_h0_min_x_mean_u: 0.0445692464709
valid_h0_min_x_min_u: 1.89789943761e-06
valid_h0_row_norms_max: 5.97794675827
valid_h0_row_norms_mean: 3.01733326912
valid_h0_row_norms_min: 0.0475185476243
valid_objective: 0.199690312147
valid_y_col_norms_max: 4.37119436264
valid_y_col_norms_mean: 4.05648756027
valid_y_col_norms_min: 3.72235488892
valid_y_max_max_class: 0.999996244907
valid_y_mean_max_class: 0.922058641911
valid_y_min_max_class: 0.22336602211
valid_y_misclass: 0.055799998343
valid_y_nll: 0.199690312147
valid_y_row_norms_max: 1.28081488609
valid_y_row_norms_mean: 0.54237049818
valid_y_row_norms_min: 0.120768107474
Time this epoch: 34.805092 seconds
Monitoring step:
Epochs seen: 8
Batches seen: 40
Examples seen: 400000
ave_grad_mult: 0.991648554802
ave_grad_size: 0.0825677365065
ave_step_size: 0.0698289051652
test_h0_col_norms_max: 6.26615095139
test_h0_col_norms_mean: 3.86642217636
test_h0_col_norms_min: 2.0920112133
test_h0_max_x_max_u: 0.999997377396
test_h0_max_x_mean_u: 0.940795004368
test_h0_max_x_min_u: 0.66545778513
test_h0_mean_x_max_u: 0.901528179646
test_h0_mean_x_mean_u: 0.470299869776
test_h0_mean_x_min_u: 0.121718779206
test_h0_min_x_max_u: 0.370387345552
test_h0_min_x_mean_u: 0.0449309423566
test_h0_min_x_min_u: 5.55576320949e-07
test_h0_row_norms_max: 5.99863862991
test_h0_row_norms_mean: 3.02230143547
test_h0_row_norms_min: 0.0541109740734
test_objective: 0.1924007833
test_y_col_norms_max: 4.68016433716
test_y_col_norms_mean: 4.25164651871
test_y_col_norms_min: 3.82015967369
test_y_max_max_class: 0.999988377094
test_y_mean_max_class: 0.924113929272
test_y_min_max_class: 0.210057422519
test_y_misclass: 0.0555000007153
test_y_nll: 0.1924007833
test_y_row_norms_max: 1.36218941212
test_y_row_norms_mean: 0.566706836224
test_y_row_norms_min: 0.123096778989
train_h0_col_norms_max: 6.26615047455
train_h0_col_norms_mean: 3.86642193794
train_h0_col_norms_min: 2.0920112133
train_h0_max_x_max_u: 0.999993860722
train_h0_max_x_mean_u: 0.941651582718
train_h0_max_x_min_u: 0.657282650471
train_h0_mean_x_max_u: 0.89084905386
train_h0_mean_x_mean_u: 0.470575273037
train_h0_mean_x_min_u: 0.124335050583
train_h0_min_x_max_u: 0.357388138771
train_h0_min_x_mean_u: 0.0438850969076
train_h0_min_x_min_u: 9.09530456283e-07
train_h0_row_norms_max: 5.99863910675
train_h0_row_norms_mean: 3.02230119705
train_h0_row_norms_min: 0.0541109666228
train_objective: 0.187867701054
train_y_col_norms_max: 4.680164814
train_y_col_norms_mean: 4.25164651871
train_y_col_norms_min: 3.82015943527
train_y_max_max_class: 0.999996304512
train_y_mean_max_class: 0.922073721886
train_y_min_max_class: 0.237471118569
train_y_misclass: 0.0530999973416
train_y_nll: 0.187867701054
train_y_row_norms_max: 1.36218929291
train_y_row_norms_mean: 0.566706776619
train_y_row_norms_min: 0.123096778989
valid_h0_col_norms_max: 6.26615095139
valid_h0_col_norms_mean: 3.86642217636
valid_h0_col_norms_min: 2.0920112133
valid_h0_max_x_max_u: 0.999996244907
valid_h0_max_x_mean_u: 0.940959215164
valid_h0_max_x_min_u: 0.634269952774
valid_h0_mean_x_max_u: 0.894827961922
valid_h0_mean_x_mean_u: 0.470626890659
valid_h0_mean_x_min_u: 0.123129568994
valid_h0_min_x_max_u: 0.344170331955
valid_h0_min_x_mean_u: 0.0444831475616
valid_h0_min_x_min_u: 7.30816509531e-07
valid_h0_row_norms_max: 5.99863862991
valid_h0_row_norms_mean: 3.02230143547
valid_h0_row_norms_min: 0.0541109740734
valid_objective: 0.184409946203
valid_y_col_norms_max: 4.68016433716
valid_y_col_norms_mean: 4.25164651871
valid_y_col_norms_min: 3.82015967369
valid_y_max_max_class: 0.999994754791
valid_y_mean_max_class: 0.926723182201
valid_y_min_max_class: 0.219980046153
valid_y_misclass: 0.047499999404
valid_y_nll: 0.184409946203
valid_y_row_norms_max: 1.36218941212
valid_y_row_norms_mean: 0.566706836224
valid_y_row_norms_min: 0.123096778989
Time this epoch: 35.663056 seconds
Monitoring step:
Epochs seen: 9
Batches seen: 45
Examples seen: 450000
ave_grad_mult: 1.00632071495
ave_grad_size: 0.0730155408382
ave_step_size: 0.0651284307241
test_h0_col_norms_max: 6.27027750015
test_h0_col_norms_mean: 3.87175488472
test_h0_col_norms_min: 2.09271168709
test_h0_max_x_max_u: 0.99999833107
test_h0_max_x_mean_u: 0.941553533077
test_h0_max_x_min_u: 0.65441852808
test_h0_mean_x_max_u: 0.903928875923
test_h0_mean_x_mean_u: 0.469605773687
test_h0_mean_x_min_u: 0.114903002977
test_h0_min_x_max_u: 0.373793333769
test_h0_min_x_mean_u: 0.044343251735
test_h0_min_x_min_u: 2.48894650667e-07
test_h0_row_norms_max: 6.01675319672
test_h0_row_norms_mean: 3.0269382
test_h0_row_norms_min: 0.0595724433661
test_objective: 0.178400695324
test_y_col_norms_max: 4.93448925018
test_y_col_norms_mean: 4.4312376976
test_y_col_norms_min: 3.912296772
test_y_max_max_class: 0.99998986721
test_y_mean_max_class: 0.929982662201
test_y_min_max_class: 0.206445708871
test_y_misclass: 0.0520999990404
test_y_nll: 0.178400695324
test_y_row_norms_max: 1.42163467407
test_y_row_norms_mean: 0.588779568672
test_y_row_norms_min: 0.124702431262
train_h0_col_norms_max: 6.27027750015
train_h0_col_norms_mean: 3.87175512314
train_h0_col_norms_min: 2.09271168709
train_h0_max_x_max_u: 0.999995946884
train_h0_max_x_mean_u: 0.942493140697
train_h0_max_x_min_u: 0.638945221901
train_h0_mean_x_max_u: 0.893475353718
train_h0_mean_x_mean_u: 0.469883978367
train_h0_mean_x_min_u: 0.117275975645
train_h0_min_x_max_u: 0.360578835011
train_h0_min_x_mean_u: 0.0432931296527
train_h0_min_x_min_u: 3.94163265582e-07
train_h0_row_norms_max: 6.01675271988
train_h0_row_norms_mean: 3.0269382
train_h0_row_norms_min: 0.0595724396408
train_objective: 0.173733517528
train_y_col_norms_max: 4.93448877335
train_y_col_norms_mean: 4.4312376976
train_y_col_norms_min: 3.91229653358
train_y_max_max_class: 0.999996066093
train_y_mean_max_class: 0.92810434103
train_y_min_max_class: 0.229242756963
train_y_misclass: 0.0490399971604
train_y_nll: 0.173733517528
train_y_row_norms_max: 1.42163455486
train_y_row_norms_mean: 0.588779509068
train_y_row_norms_min: 0.124702423811
valid_h0_col_norms_max: 6.27027750015
valid_h0_col_norms_mean: 3.87175488472
valid_h0_col_norms_min: 2.09271168709
valid_h0_max_x_max_u: 0.999997377396
valid_h0_max_x_mean_u: 0.941749632359
valid_h0_max_x_min_u: 0.622578442097
valid_h0_mean_x_max_u: 0.897465348244
valid_h0_mean_x_mean_u: 0.469932496548
valid_h0_mean_x_min_u: 0.116939790547
valid_h0_min_x_max_u: 0.347404718399
valid_h0_min_x_mean_u: 0.0439214892685
valid_h0_min_x_min_u: 3.13890211601e-07
valid_h0_row_norms_max: 6.01675319672
valid_h0_row_norms_mean: 3.0269382
valid_h0_row_norms_min: 0.0595724433661
valid_objective: 0.172197133303
valid_y_col_norms_max: 4.93448925018
valid_y_col_norms_mean: 4.4312376976
valid_y_col_norms_min: 3.912296772
valid_y_max_max_class: 0.999996781349
valid_y_mean_max_class: 0.932501792908
valid_y_min_max_class: 0.216077208519
valid_y_misclass: 0.0454999953508
valid_y_nll: 0.172197133303
valid_y_row_norms_max: 1.42163467407
valid_y_row_norms_mean: 0.588779568672
valid_y_row_norms_min: 0.124702431262
Time this epoch: 35.404834 seconds
Monitoring step:
Epochs seen: 10
Batches seen: 50
Examples seen: 500000
ave_grad_mult: 1.06833612919
ave_grad_size: 0.0678643658757
ave_step_size: 0.0653440654278
test_h0_col_norms_max: 6.27522420883
test_h0_col_norms_mean: 3.87793588638
test_h0_col_norms_min: 2.09417295456
test_h0_max_x_max_u: 0.999998867512
test_h0_max_x_mean_u: 0.942130804062
test_h0_max_x_min_u: 0.645175695419
test_h0_mean_x_max_u: 0.909636974335
test_h0_mean_x_mean_u: 0.468845933676
test_h0_mean_x_min_u: 0.104815065861
test_h0_min_x_max_u: 0.378569096327
test_h0_min_x_mean_u: 0.0440588444471
test_h0_min_x_min_u: 1.15133666156e-07
test_h0_row_norms_max: 6.03866481781
test_h0_row_norms_mean: 3.03230404854
test_h0_row_norms_min: 0.065353885293
test_objective: 0.167283341289
test_y_col_norms_max: 5.2253780365
test_y_col_norms_mean: 4.62542486191
test_y_col_norms_min: 4.01688957214
test_y_max_max_class: 0.999992907047
test_y_mean_max_class: 0.933511257172
test_y_min_max_class: 0.242168530822
test_y_misclass: 0.0492999963462
test_y_nll: 0.167283341289
test_y_row_norms_max: 1.50107598305
test_y_row_norms_mean: 0.612406551838
test_y_row_norms_min: 0.125712171197
train_h0_col_norms_max: 6.27522373199
train_h0_col_norms_mean: 3.87793540955
train_h0_col_norms_min: 2.09417295456
train_h0_max_x_max_u: 0.999997496605
train_h0_max_x_mean_u: 0.943212330341
train_h0_max_x_min_u: 0.628583967686
train_h0_mean_x_max_u: 0.899803757668
train_h0_mean_x_mean_u: 0.469121694565
train_h0_mean_x_min_u: 0.107625767589
train_h0_min_x_max_u: 0.36565092206
train_h0_min_x_mean_u: 0.0430302321911
train_h0_min_x_min_u: 1.75549445203e-07
train_h0_row_norms_max: 6.03866481781
train_h0_row_norms_mean: 3.03230404854
train_h0_row_norms_min: 0.065353885293
train_objective: 0.159167990088
train_y_col_norms_max: 5.22537755966
train_y_col_norms_mean: 4.62542486191
train_y_col_norms_min: 4.01688957214
train_y_max_max_class: 0.999997138977
train_y_mean_max_class: 0.931973934174
train_y_min_max_class: 0.241810530424
train_y_misclass: 0.0449799969792
train_y_nll: 0.159167990088
train_y_row_norms_max: 1.50107610226
train_y_row_norms_mean: 0.612406492233
train_y_row_norms_min: 0.125712156296
valid_h0_col_norms_max: 6.27522420883
valid_h0_col_norms_mean: 3.87793588638
valid_h0_col_norms_min: 2.09417295456
valid_h0_max_x_max_u: 0.999998152256
valid_h0_max_x_mean_u: 0.942497193813
valid_h0_max_x_min_u: 0.619423508644
valid_h0_mean_x_max_u: 0.903488636017
valid_h0_mean_x_mean_u: 0.469177812338
valid_h0_mean_x_min_u: 0.108095638454
valid_h0_min_x_max_u: 0.349716216326
valid_h0_min_x_mean_u: 0.04355686903
valid_h0_min_x_min_u: 1.34484281489e-07
valid_h0_row_norms_max: 6.03866481781
valid_h0_row_norms_mean: 3.03230404854
valid_h0_row_norms_min: 0.065353885293
valid_objective: 0.160998404026
valid_y_col_norms_max: 5.2253780365
valid_y_col_norms_mean: 4.62542486191
valid_y_col_norms_min: 4.01688957214
valid_y_max_max_class: 0.999998152256
valid_y_mean_max_class: 0.936175227165
valid_y_min_max_class: 0.220791786909
valid_y_misclass: 0.0441000014544
valid_y_nll: 0.160998404026
valid_y_row_norms_max: 1.50107598305
valid_y_row_norms_mean: 0.612406551838
valid_y_row_norms_min: 0.125712171197
Time this epoch: 35.425083 seconds
Monitoring step:
Epochs seen: 11
Batches seen: 55
Examples seen: 550000
ave_grad_mult: 1.14648592472
ave_grad_size: 0.0634888410568
ave_step_size: 0.0666681230068
test_h0_col_norms_max: 6.28032588959
test_h0_col_norms_mean: 3.88434314728
test_h0_col_norms_min: 2.09576916695
test_h0_max_x_max_u: 0.999999403954
test_h0_max_x_mean_u: 0.942800343037
test_h0_max_x_min_u: 0.63667178154
test_h0_mean_x_max_u: 0.915984809399
test_h0_mean_x_mean_u: 0.468008965254
test_h0_mean_x_min_u: 0.101051539183
test_h0_min_x_max_u: 0.390204340219
test_h0_min_x_mean_u: 0.0434938073158
test_h0_min_x_min_u: 5.66224080956e-08
test_h0_row_norms_max: 6.06034469604
test_h0_row_norms_mean: 3.03785538673
test_h0_row_norms_min: 0.0704936757684
test_objective: 0.15456405282
test_y_col_norms_max: 5.50095510483
test_y_col_norms_mean: 4.82304191589
test_y_col_norms_min: 4.1173620224
test_y_max_max_class: 0.999992728233
test_y_mean_max_class: 0.936915397644
test_y_min_max_class: 0.252786010504
test_y_misclass: 0.0443000011146
test_y_nll: 0.15456405282
test_y_row_norms_max: 1.60092997551
test_y_row_norms_mean: 0.636273026466
test_y_row_norms_min: 0.124862372875
train_h0_col_norms_max: 6.28032636642
train_h0_col_norms_mean: 3.88434290886
train_h0_col_norms_min: 2.09576892853
train_h0_max_x_max_u: 0.999998629093
train_h0_max_x_mean_u: 0.944033026695
train_h0_max_x_min_u: 0.631079792976
train_h0_mean_x_max_u: 0.90686249733
train_h0_mean_x_mean_u: 0.468293100595
train_h0_mean_x_min_u: 0.103928506374
train_h0_min_x_max_u: 0.373679548502
train_h0_min_x_mean_u: 0.0424839258194
train_h0_min_x_min_u: 8.3652395233e-08
train_h0_row_norms_max: 6.06034517288
train_h0_row_norms_mean: 3.03785514832
train_h0_row_norms_min: 0.0704936683178
train_objective: 0.146077007055
train_y_col_norms_max: 5.50095510483
train_y_col_norms_mean: 4.82304239273
train_y_col_norms_min: 4.11736249924
train_y_max_max_class: 0.999997377396
train_y_mean_max_class: 0.935088992119
train_y_min_max_class: 0.235717624426
train_y_misclass: 0.0411599949002
train_y_nll: 0.146077007055
train_y_row_norms_max: 1.60092973709
train_y_row_norms_mean: 0.636273086071
train_y_row_norms_min: 0.124862357974
valid_h0_col_norms_max: 6.28032588959
valid_h0_col_norms_mean: 3.88434314728
valid_h0_col_norms_min: 2.09576916695
valid_h0_max_x_max_u: 0.999998867512
valid_h0_max_x_mean_u: 0.943427741528
valid_h0_max_x_min_u: 0.627752363682
valid_h0_mean_x_max_u: 0.910161554813
valid_h0_mean_x_mean_u: 0.468341171741
valid_h0_mean_x_min_u: 0.104510381818
valid_h0_min_x_max_u: 0.357529014349
valid_h0_min_x_mean_u: 0.0429090820253
valid_h0_min_x_min_u: 6.19904838572e-08
valid_h0_row_norms_max: 6.06034469604
valid_h0_row_norms_mean: 3.03785538673
valid_h0_row_norms_min: 0.0704936757684
valid_objective: 0.149976089597
valid_y_col_norms_max: 5.50095510483
valid_y_col_norms_mean: 4.82304191589
valid_y_col_norms_min: 4.1173620224
valid_y_max_max_class: 0.999998509884
valid_y_mean_max_class: 0.939062952995
valid_y_min_max_class: 0.239928662777
valid_y_misclass: 0.0416000001132
valid_y_nll: 0.149976089597
valid_y_row_norms_max: 1.60092997551
valid_y_row_norms_mean: 0.636273026466
valid_y_row_norms_min: 0.124862372875
Time this epoch: 35.174293 seconds
Monitoring step:
Epochs seen: 12
Batches seen: 60
Examples seen: 600000
ave_grad_mult: 1.16790962219
ave_grad_size: 0.0593062080443
ave_step_size: 0.0650760680437
test_h0_col_norms_max: 6.28521823883
test_h0_col_norms_mean: 3.89019036293
test_h0_col_norms_min: 2.09752202034
test_h0_max_x_max_u: 0.999999582767
test_h0_max_x_mean_u: 0.94396853447
test_h0_max_x_min_u: 0.629440486431
test_h0_mean_x_max_u: 0.920006334782
test_h0_mean_x_mean_u: 0.467411011457
test_h0_mean_x_min_u: 0.0957048162818
test_h0_min_x_max_u: 0.389512062073
test_h0_min_x_mean_u: 0.0425479598343
test_h0_min_x_min_u: 3.20807167498e-08
test_h0_row_norms_max: 6.08012914658
test_h0_row_norms_mean: 3.04289364815
test_h0_row_norms_min: 0.0743318274617
test_objective: 0.144802451134
test_y_col_norms_max: 5.74849033356
test_y_col_norms_mean: 5.00328540802
test_y_col_norms_min: 4.21305179596
test_y_max_max_class: 0.999994158745
test_y_mean_max_class: 0.941429018974
test_y_min_max_class: 0.231030538678
test_y_misclass: 0.0408000014722
test_y_nll: 0.144802451134
test_y_row_norms_max: 1.7184125185
test_y_row_norms_mean: 0.658156752586
test_y_row_norms_min: 0.125041946769
train_h0_col_norms_max: 6.28521871567
train_h0_col_norms_mean: 3.89019012451
train_h0_col_norms_min: 2.09752202034
train_h0_max_x_max_u: 0.999999046326
train_h0_max_x_mean_u: 0.945232570171
train_h0_max_x_min_u: 0.634238958359
train_h0_mean_x_max_u: 0.911378622055
train_h0_mean_x_mean_u: 0.467698544264
train_h0_mean_x_min_u: 0.0993719547987
train_h0_min_x_max_u: 0.373709738255
train_h0_min_x_mean_u: 0.0415380932391
train_h0_min_x_min_u: 4.67483047828e-08
train_h0_row_norms_max: 6.08012914658
train_h0_row_norms_mean: 3.04289340973
train_h0_row_norms_min: 0.0743318200111
train_objective: 0.135217413306
train_y_col_norms_max: 5.74849033356
train_y_col_norms_mean: 5.00328493118
train_y_col_norms_min: 4.21305131912
train_y_max_max_class: 0.999997973442
train_y_mean_max_class: 0.939604878426
train_y_min_max_class: 0.252161383629
train_y_misclass: 0.0378999970853
train_y_nll: 0.135217413306
train_y_row_norms_max: 1.71841263771
train_y_row_norms_mean: 0.658156752586
train_y_row_norms_min: 0.125041931868
valid_h0_col_norms_max: 6.28521823883
valid_h0_col_norms_mean: 3.89019036293
valid_h0_col_norms_min: 2.09752202034
valid_h0_max_x_max_u: 0.99999922514
valid_h0_max_x_mean_u: 0.944651842117
valid_h0_max_x_min_u: 0.645568966866
valid_h0_mean_x_max_u: 0.914412498474
valid_h0_mean_x_mean_u: 0.467748105526
valid_h0_mean_x_min_u: 0.0972835198045
valid_h0_min_x_max_u: 0.359061449766
valid_h0_min_x_mean_u: 0.0419331230223
valid_h0_min_x_min_u: 3.37856427279e-08
valid_h0_row_norms_max: 6.08012914658
valid_h0_row_norms_mean: 3.04289364815
valid_h0_row_norms_min: 0.0743318274617
valid_objective: 0.141469165683
valid_y_col_norms_max: 5.74849033356
valid_y_col_norms_mean: 5.00328540802
valid_y_col_norms_min: 4.21305179596
valid_y_max_max_class: 0.999998867512
valid_y_mean_max_class: 0.943582773209
valid_y_min_max_class: 0.241308540106
valid_y_misclass: 0.0379000008106
valid_y_nll: 0.141469165683
valid_y_row_norms_max: 1.7184125185
valid_y_row_norms_mean: 0.658156752586
valid_y_row_norms_min: 0.125041946769
Time this epoch: 35.417259 seconds
Monitoring step:
Epochs seen: 13
Batches seen: 65
Examples seen: 650000
ave_grad_mult: 1.26017534733
ave_grad_size: 0.0564817748964
ave_step_size: 0.066411331296
test_h0_col_norms_max: 6.29147386551
test_h0_col_norms_mean: 3.89687585831
test_h0_col_norms_min: 2.09867763519
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.944756031036
test_h0_max_x_min_u: 0.627687215805
test_h0_mean_x_max_u: 0.920987069607
test_h0_mean_x_mean_u: 0.46668151021
test_h0_mean_x_min_u: 0.0932114943862
test_h0_min_x_max_u: 0.397424399853
test_h0_min_x_mean_u: 0.041878964752
test_h0_min_x_min_u: 1.39939251298e-08
test_h0_row_norms_max: 6.10229206085
test_h0_row_norms_mean: 3.04865932465
test_h0_row_norms_min: 0.0787999555469
test_objective: 0.135829210281
test_y_col_norms_max: 6.01105213165
test_y_col_norms_mean: 5.20304250717
test_y_col_norms_min: 4.33085250854
test_y_max_max_class: 0.999994754791
test_y_mean_max_class: 0.945015072823
test_y_min_max_class: 0.230172082782
test_y_misclass: 0.0381999947131
test_y_nll: 0.135829210281
test_y_row_norms_max: 1.85168874264
test_y_row_norms_mean: 0.682141900063
test_y_row_norms_min: 0.125363498926
train_h0_col_norms_max: 6.29147338867
train_h0_col_norms_mean: 3.89687561989
train_h0_col_norms_min: 2.09867739677
train_h0_max_x_max_u: 0.999999403954
train_h0_max_x_mean_u: 0.946107804775
train_h0_max_x_min_u: 0.63179987669
train_h0_mean_x_max_u: 0.912519574165
train_h0_mean_x_mean_u: 0.466963618994
train_h0_mean_x_min_u: 0.0961530357599
train_h0_min_x_max_u: 0.379027783871
train_h0_min_x_mean_u: 0.0408683530986
train_h0_min_x_min_u: 1.94427727251e-08
train_h0_row_norms_max: 6.10229253769
train_h0_row_norms_mean: 3.04865932465
train_h0_row_norms_min: 0.0787999555469
train_objective: 0.12386597693
train_y_col_norms_max: 6.01105213165
train_y_col_norms_mean: 5.20304203033
train_y_col_norms_min: 4.33085203171
train_y_max_max_class: 0.999997973442
train_y_mean_max_class: 0.943518102169
train_y_min_max_class: 0.246507614851
train_y_misclass: 0.034559994936
train_y_nll: 0.12386597693
train_y_row_norms_max: 1.85168862343
train_y_row_norms_mean: 0.682141840458
train_y_row_norms_min: 0.125363498926
valid_h0_col_norms_max: 6.29147386551
valid_h0_col_norms_mean: 3.89687585831
valid_h0_col_norms_min: 2.09867763519
valid_h0_max_x_max_u: 0.999999403954
valid_h0_max_x_mean_u: 0.945632517338
valid_h0_max_x_min_u: 0.651219964027
valid_h0_mean_x_max_u: 0.915507853031
valid_h0_mean_x_mean_u: 0.467023015022
valid_h0_mean_x_min_u: 0.0969914197922
valid_h0_min_x_max_u: 0.364903271198
valid_h0_min_x_mean_u: 0.0411523580551
valid_h0_min_x_min_u: 1.40916096569e-08
valid_h0_row_norms_max: 6.10229206085
valid_h0_row_norms_mean: 3.04865932465
valid_h0_row_norms_min: 0.0787999555469
valid_objective: 0.133389517665
valid_y_col_norms_max: 6.01105213165
valid_y_col_norms_mean: 5.20304250717
valid_y_col_norms_min: 4.33085250854
valid_y_max_max_class: 0.999999165535
valid_y_mean_max_class: 0.946852385998
valid_y_min_max_class: 0.214304342866
valid_y_misclass: 0.03579999879
valid_y_nll: 0.133389517665
valid_y_row_norms_max: 1.85168874264
valid_y_row_norms_mean: 0.682141900063
valid_y_row_norms_min: 0.125363498926
Time this epoch: 35.366187 seconds
Monitoring step:
Epochs seen: 14
Batches seen: 70
Examples seen: 700000
ave_grad_mult: 1.40761697292
ave_grad_size: 0.0550340935588
ave_step_size: 0.0714166760445
test_h0_col_norms_max: 6.29854393005
test_h0_col_norms_mean: 3.90459442139
test_h0_col_norms_min: 2.1004254818
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.94591987133
test_h0_max_x_min_u: 0.614766418934
test_h0_mean_x_max_u: 0.921078026295
test_h0_mean_x_mean_u: 0.466091096401
test_h0_mean_x_min_u: 0.0916782915592
test_h0_min_x_max_u: 0.396448850632
test_h0_min_x_mean_u: 0.0410171151161
test_h0_min_x_min_u: 7.75897479599e-09
test_h0_row_norms_max: 6.12455701828
test_h0_row_norms_mean: 3.05531406403
test_h0_row_norms_min: 0.0834630578756
test_objective: 0.125504016876
test_y_col_norms_max: 6.29601860046
test_y_col_norms_mean: 5.42890501022
test_y_col_norms_min: 4.46609354019
test_y_max_max_class: 0.999994575977
test_y_mean_max_class: 0.949410498142
test_y_min_max_class: 0.201501131058
test_y_misclass: 0.0355000011623
test_y_nll: 0.125504016876
test_y_row_norms_max: 2.01360034943
test_y_row_norms_mean: 0.709331393242
test_y_row_norms_min: 0.125074863434
train_h0_col_norms_max: 6.29854393005
train_h0_col_norms_mean: 3.90459418297
train_h0_col_norms_min: 2.10042524338
train_h0_max_x_max_u: 0.999999523163
train_h0_max_x_mean_u: 0.947336554527
train_h0_max_x_min_u: 0.624508261681
train_h0_mean_x_max_u: 0.912684559822
train_h0_mean_x_mean_u: 0.466372013092
train_h0_mean_x_min_u: 0.0946839675307
train_h0_min_x_max_u: 0.383265286684
train_h0_min_x_mean_u: 0.0400523841381
train_h0_min_x_min_u: 1.04573256721e-08
train_h0_row_norms_max: 6.12455654144
train_h0_row_norms_mean: 3.05531382561
train_h0_row_norms_min: 0.0834630504251
train_objective: 0.112524747849
train_y_col_norms_max: 6.29601955414
train_y_col_norms_mean: 5.42890501022
train_y_col_norms_min: 4.46609306335
train_y_max_max_class: 0.999997973442
train_y_mean_max_class: 0.948245584965
train_y_min_max_class: 0.237888276577
train_y_misclass: 0.031159998849
train_y_nll: 0.112524747849
train_y_row_norms_max: 2.01360034943
train_y_row_norms_mean: 0.709331333637
train_y_row_norms_min: 0.125074848533
valid_h0_col_norms_max: 6.29854393005
valid_h0_col_norms_mean: 3.90459442139
valid_h0_col_norms_min: 2.1004254818
valid_h0_max_x_max_u: 0.999999582767
valid_h0_max_x_mean_u: 0.946813523769
valid_h0_max_x_min_u: 0.649647653103
valid_h0_mean_x_max_u: 0.915705919266
valid_h0_mean_x_mean_u: 0.466423898935
valid_h0_mean_x_min_u: 0.0953802764416
valid_h0_min_x_max_u: 0.369607925415
valid_h0_min_x_mean_u: 0.0402967631817
valid_h0_min_x_min_u: 7.3467920636e-09
valid_h0_row_norms_max: 6.12455701828
valid_h0_row_norms_mean: 3.05531406403
valid_h0_row_norms_min: 0.0834630578756
valid_objective: 0.124651312828
valid_y_col_norms_max: 6.29601860046
valid_y_col_norms_mean: 5.42890501022
valid_y_col_norms_min: 4.46609354019
valid_y_max_max_class: 0.999999046326
valid_y_mean_max_class: 0.950519561768
valid_y_min_max_class: 0.27420938015
valid_y_misclass: 0.0340000018477
valid_y_nll: 0.124651312828
valid_y_row_norms_max: 2.01360034943
valid_y_row_norms_mean: 0.709331393242
valid_y_row_norms_min: 0.125074863434
Time this epoch: 35.379965 seconds
Monitoring step:
Epochs seen: 15
Batches seen: 75
Examples seen: 750000
ave_grad_mult: 1.47251427174
ave_grad_size: 0.0522134304047
ave_step_size: 0.071938700974
test_h0_col_norms_max: 6.30543804169
test_h0_col_norms_mean: 3.91161727905
test_h0_col_norms_min: 2.10170149803
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.947175860405
test_h0_max_x_min_u: 0.611504435539
test_h0_mean_x_max_u: 0.926122069359
test_h0_mean_x_mean_u: 0.466326773167
test_h0_mean_x_min_u: 0.0923119410872
test_h0_min_x_max_u: 0.401742935181
test_h0_min_x_mean_u: 0.040495429188
test_h0_min_x_min_u: 3.92714794017e-09
test_h0_row_norms_max: 6.1450252533
test_h0_row_norms_mean: 3.0613322258
test_h0_row_norms_min: 0.0881084352732
test_objective: 0.118968196213
test_y_col_norms_max: 6.55995035172
test_y_col_norms_mean: 5.62976980209
test_y_col_norms_min: 4.59543800354
test_y_max_max_class: 0.999994218349
test_y_mean_max_class: 0.951547503471
test_y_min_max_class: 0.228803291917
test_y_misclass: 0.0349999964237
test_y_nll: 0.118968196213
test_y_row_norms_max: 2.14034724236
test_y_row_norms_mean: 0.733418226242
test_y_row_norms_min: 0.12729588151
train_h0_col_norms_max: 6.30543756485
train_h0_col_norms_mean: 3.91161704063
train_h0_col_norms_min: 2.10170149803
train_h0_max_x_max_u: 0.999999880791
train_h0_max_x_mean_u: 0.948555886745
train_h0_max_x_min_u: 0.618095517159
train_h0_mean_x_max_u: 0.918341517448
train_h0_mean_x_mean_u: 0.466599404812
train_h0_mean_x_min_u: 0.0956889539957
train_h0_min_x_max_u: 0.392006248236
train_h0_min_x_mean_u: 0.0395230464637
train_h0_min_x_min_u: 5.12299225264e-09
train_h0_row_norms_max: 6.14502477646
train_h0_row_norms_mean: 3.06133174896
train_h0_row_norms_min: 0.088108420372
train_objective: 0.103299617767
train_y_col_norms_max: 6.55995082855
train_y_col_norms_mean: 5.62976932526
train_y_col_norms_min: 4.5954375267
train_y_max_max_class: 0.999997377396
train_y_mean_max_class: 0.951093494892
train_y_min_max_class: 0.249234974384
train_y_misclass: 0.0284799989313
train_y_nll: 0.103299617767
train_y_row_norms_max: 2.14034700394
train_y_row_norms_mean: 0.733418226242
train_y_row_norms_min: 0.12729588151
valid_h0_col_norms_max: 6.30543804169
valid_h0_col_norms_mean: 3.91161727905
valid_h0_col_norms_min: 2.10170149803
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.947986066341
valid_h0_max_x_min_u: 0.642793953419
valid_h0_mean_x_max_u: 0.920964062214
valid_h0_mean_x_mean_u: 0.466656267643
valid_h0_mean_x_min_u: 0.0940045118332
valid_h0_min_x_max_u: 0.377251118422
valid_h0_min_x_mean_u: 0.03970798105
valid_h0_min_x_min_u: 3.59755203405e-09
valid_h0_row_norms_max: 6.1450252533
valid_h0_row_norms_mean: 3.0613322258
valid_h0_row_norms_min: 0.0881084352732
valid_objective: 0.119057364762
valid_y_col_norms_max: 6.55995035172
valid_y_col_norms_mean: 5.62976980209
valid_y_col_norms_min: 4.59543800354
valid_y_max_max_class: 0.999998807907
valid_y_mean_max_class: 0.953496754169
valid_y_min_max_class: 0.279151201248
valid_y_misclass: 0.0322999954224
valid_y_nll: 0.119057364762
valid_y_row_norms_max: 2.14034724236
valid_y_row_norms_mean: 0.733418226242
valid_y_row_norms_min: 0.12729588151
Time this epoch: 35.163641 seconds
Monitoring step:
Epochs seen: 16
Batches seen: 80
Examples seen: 800000
ave_grad_mult: 1.55044400692
ave_grad_size: 0.0495749413967
ave_step_size: 0.071437291801
test_h0_col_norms_max: 6.31254959106
test_h0_col_norms_mean: 3.91860723495
test_h0_col_norms_min: 2.10440206528
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.948059678078
test_h0_max_x_min_u: 0.594348907471
test_h0_mean_x_max_u: 0.927690863609
test_h0_mean_x_mean_u: 0.465816915035
test_h0_mean_x_min_u: 0.0875093266368
test_h0_min_x_max_u: 0.399484992027
test_h0_min_x_mean_u: 0.0397286936641
test_h0_min_x_min_u: 2.14466333581e-09
test_h0_row_norms_max: 6.16418838501
test_h0_row_norms_mean: 3.06732678413
test_h0_row_norms_min: 0.091041892767
test_objective: 0.111787736416
test_y_col_norms_max: 6.79865264893
test_y_col_norms_mean: 5.8274474144
test_y_col_norms_min: 4.71656274796
test_y_max_max_class: 0.999996483326
test_y_mean_max_class: 0.954726696014
test_y_min_max_class: 0.287018150091
test_y_misclass: 0.0328000001609
test_y_nll: 0.111787736416
test_y_row_norms_max: 2.27131104469
test_y_row_norms_mean: 0.757337749004
test_y_row_norms_min: 0.12875507772
train_h0_col_norms_max: 6.31254959106
train_h0_col_norms_mean: 3.91860699654
train_h0_col_norms_min: 2.10440182686
train_h0_max_x_max_u: 0.999999880791
train_h0_max_x_mean_u: 0.949532628059
train_h0_max_x_min_u: 0.603366672993
train_h0_mean_x_max_u: 0.920095324516
train_h0_mean_x_mean_u: 0.466088950634
train_h0_mean_x_min_u: 0.0905980989337
train_h0_min_x_max_u: 0.391806066036
train_h0_min_x_mean_u: 0.0387711115181
train_h0_min_x_min_u: 2.82344658764e-09
train_h0_row_norms_max: 6.16418838501
train_h0_row_norms_mean: 3.06732654572
train_h0_row_norms_min: 0.0910418853164
train_objective: 0.0944318547845
train_y_col_norms_max: 6.79865264893
train_y_col_norms_mean: 5.82744646072
train_y_col_norms_min: 4.71656322479
train_y_max_max_class: 0.999998569489
train_y_mean_max_class: 0.954577803612
train_y_min_max_class: 0.255649060011
train_y_misclass: 0.0261199977249
train_y_nll: 0.0944318547845
train_y_row_norms_max: 2.27131080627
train_y_row_norms_mean: 0.757337749004
train_y_row_norms_min: 0.128755062819
valid_h0_col_norms_max: 6.31254959106
valid_h0_col_norms_mean: 3.91860723495
valid_h0_col_norms_min: 2.10440206528
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.948857724667
valid_h0_max_x_min_u: 0.6251745224
valid_h0_mean_x_max_u: 0.922635018826
valid_h0_mean_x_mean_u: 0.46614703536
valid_h0_mean_x_min_u: 0.0918302312493
valid_h0_min_x_max_u: 0.379304587841
valid_h0_min_x_mean_u: 0.0389546044171
valid_h0_min_x_min_u: 1.96204941183e-09
valid_h0_row_norms_max: 6.16418838501
valid_h0_row_norms_mean: 3.06732678413
valid_h0_row_norms_min: 0.091041892767
valid_objective: 0.110771089792
valid_y_col_norms_max: 6.79865264893
valid_y_col_norms_mean: 5.8274474144
valid_y_col_norms_min: 4.71656274796
valid_y_max_max_class: 0.999999165535
valid_y_mean_max_class: 0.95663100481
valid_y_min_max_class: 0.264041811228
valid_y_misclass: 0.0305000003427
valid_y_nll: 0.110771089792
valid_y_row_norms_max: 2.27131104469
valid_y_row_norms_mean: 0.757337749004
valid_y_row_norms_min: 0.12875507772
Time this epoch: 35.246666 seconds
Monitoring step:
Epochs seen: 17
Batches seen: 85
Examples seen: 850000
ave_grad_mult: 1.59982562065
ave_grad_size: 0.0473937280476
ave_step_size: 0.0712730288506
test_h0_col_norms_max: 6.31961965561
test_h0_col_norms_mean: 3.92528343201
test_h0_col_norms_min: 2.10622811317
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.949356853962
test_h0_max_x_min_u: 0.59099650383
test_h0_mean_x_max_u: 0.927693426609
test_h0_mean_x_mean_u: 0.465647280216
test_h0_mean_x_min_u: 0.0867232903838
test_h0_min_x_max_u: 0.39404541254
test_h0_min_x_mean_u: 0.0387796163559
test_h0_min_x_min_u: 1.4791411429e-09
test_h0_row_norms_max: 6.18163251877
test_h0_row_norms_mean: 3.07301926613
test_h0_row_norms_min: 0.0938726961613
test_objective: 0.106328338385
test_y_col_norms_max: 7.01830482483
test_y_col_norms_mean: 6.0149974823
test_y_col_norms_min: 4.83683490753
test_y_max_max_class: 0.999997198582
test_y_mean_max_class: 0.95773011446
test_y_min_max_class: 0.291382759809
test_y_misclass: 0.0320000015199
test_y_nll: 0.106328338385
test_y_row_norms_max: 2.38739275932
test_y_row_norms_mean: 0.780075967312
test_y_row_norms_min: 0.130353063345
train_h0_col_norms_max: 6.31961917877
train_h0_col_norms_mean: 3.92528319359
train_h0_col_norms_min: 2.10622787476
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.950781822205
train_h0_max_x_min_u: 0.600085794926
train_h0_mean_x_max_u: 0.920143485069
train_h0_mean_x_mean_u: 0.465922415257
train_h0_mean_x_min_u: 0.0898449495435
train_h0_min_x_max_u: 0.391092181206
train_h0_min_x_mean_u: 0.0378985367715
train_h0_min_x_min_u: 1.99124361444e-09
train_h0_row_norms_max: 6.18163204193
train_h0_row_norms_mean: 3.07301878929
train_h0_row_norms_min: 0.0938726961613
train_objective: 0.088271394372
train_y_col_norms_max: 7.01830387115
train_y_col_norms_mean: 6.0149974823
train_y_col_norms_min: 4.83683490753
train_y_max_max_class: 0.999998629093
train_y_mean_max_class: 0.957574307919
train_y_min_max_class: 0.276376664639
train_y_misclass: 0.023999998346
train_y_nll: 0.088271394372
train_y_row_norms_max: 2.3873925209
train_y_row_norms_mean: 0.780075907707
train_y_row_norms_min: 0.130353048444
valid_h0_col_norms_max: 6.31961965561
valid_h0_col_norms_mean: 3.92528343201
valid_h0_col_norms_min: 2.10622811317
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.950090706348
valid_h0_max_x_min_u: 0.620329141617
valid_h0_mean_x_max_u: 0.922675073147
valid_h0_mean_x_mean_u: 0.465976387262
valid_h0_mean_x_min_u: 0.0912392660975
valid_h0_min_x_max_u: 0.378798425198
valid_h0_min_x_mean_u: 0.0380813851953
valid_h0_min_x_min_u: 1.35891120578e-09
valid_h0_row_norms_max: 6.18163251877
valid_h0_row_norms_mean: 3.07301926613
valid_h0_row_norms_min: 0.0938726961613
valid_objective: 0.107352338731
valid_y_col_norms_max: 7.01830482483
valid_y_col_norms_mean: 6.0149974823
valid_y_col_norms_min: 4.83683490753
valid_y_max_max_class: 0.999998867512
valid_y_mean_max_class: 0.959039092064
valid_y_min_max_class: 0.278402447701
valid_y_misclass: 0.0296999998391
valid_y_nll: 0.107352338731
valid_y_row_norms_max: 2.38739275932
valid_y_row_norms_mean: 0.780075967312
valid_y_row_norms_min: 0.130353063345
Time this epoch: 35.302343 seconds
Monitoring step:
Epochs seen: 18
Batches seen: 90
Examples seen: 900000
ave_grad_mult: 1.79280376434
ave_grad_size: 0.0464615598321
ave_step_size: 0.0771328359842
test_h0_col_norms_max: 6.32822799683
test_h0_col_norms_mean: 3.93359160423
test_h0_col_norms_min: 2.10832476616
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.95045799017
test_h0_max_x_min_u: 0.586617648602
test_h0_mean_x_max_u: 0.929918944836
test_h0_mean_x_mean_u: 0.465379714966
test_h0_mean_x_min_u: 0.083888605237
test_h0_min_x_max_u: 0.397964477539
test_h0_min_x_mean_u: 0.0380010083318
test_h0_min_x_min_u: 7.37118366345e-10
test_h0_row_norms_max: 6.20447731018
test_h0_row_norms_mean: 3.08011174202
test_h0_row_norms_min: 0.0980293303728
test_objective: 0.100425355136
test_y_col_norms_max: 7.28403282166
test_y_col_norms_mean: 6.2393155098
test_y_col_norms_min: 4.98830795288
test_y_max_max_class: 0.999997019768
test_y_mean_max_class: 0.959611177444
test_y_min_max_class: 0.283116281033
test_y_misclass: 0.03039999865
test_y_nll: 0.100425355136
test_y_row_norms_max: 2.53001952171
test_y_row_norms_mean: 0.806962490082
test_y_row_norms_min: 0.131183430552
train_h0_col_norms_max: 6.32822799683
train_h0_col_norms_mean: 3.93359088898
train_h0_col_norms_min: 2.10832476616
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.951834976673
train_h0_max_x_min_u: 0.594883143902
train_h0_mean_x_max_u: 0.922610759735
train_h0_mean_x_mean_u: 0.465650081635
train_h0_mean_x_min_u: 0.0870256572962
train_h0_min_x_max_u: 0.393012464046
train_h0_min_x_mean_u: 0.0370769426227
train_h0_min_x_min_u: 9.6733221433e-10
train_h0_row_norms_max: 6.20447683334
train_h0_row_norms_mean: 3.0801115036
train_h0_row_norms_min: 0.0980293378234
train_objective: 0.0801135376096
train_y_col_norms_max: 7.28403186798
train_y_col_norms_mean: 6.23931598663
train_y_col_norms_min: 4.98830747604
train_y_max_max_class: 0.999998509884
train_y_mean_max_class: 0.960199356079
train_y_min_max_class: 0.269580304623
train_y_misclass: 0.0213200002909
train_y_nll: 0.0801135376096
train_y_row_norms_max: 2.53001952171
train_y_row_norms_mean: 0.806962549686
train_y_row_norms_min: 0.131183415651
valid_h0_col_norms_max: 6.32822799683
valid_h0_col_norms_mean: 3.93359160423
valid_h0_col_norms_min: 2.10832476616
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.951114416122
valid_h0_max_x_min_u: 0.613163709641
valid_h0_mean_x_max_u: 0.925033152103
valid_h0_mean_x_mean_u: 0.465698361397
valid_h0_mean_x_min_u: 0.0884924307466
valid_h0_min_x_max_u: 0.386198699474
valid_h0_min_x_mean_u: 0.0373685508966
valid_h0_min_x_min_u: 6.73591848965e-10
valid_h0_row_norms_max: 6.20447731018
valid_h0_row_norms_mean: 3.08011174202
valid_h0_row_norms_min: 0.0980293303728
valid_objective: 0.101348236203
valid_y_col_norms_max: 7.28403282166
valid_y_col_norms_mean: 6.2393155098
valid_y_col_norms_min: 4.98830795288
valid_y_max_max_class: 0.99999922514
valid_y_mean_max_class: 0.961142122746
valid_y_min_max_class: 0.255374312401
valid_y_misclass: 0.028299998492
valid_y_nll: 0.101348236203
valid_y_row_norms_max: 2.53001952171
valid_y_row_norms_mean: 0.806962490082
valid_y_row_norms_min: 0.131183430552
Time this epoch: 35.215917 seconds
Monitoring step:
Epochs seen: 19
Batches seen: 95
Examples seen: 950000
ave_grad_mult: 1.94697141647
ave_grad_size: 0.0453744120896
ave_step_size: 0.0806727781892
test_h0_col_norms_max: 6.33764886856
test_h0_col_norms_mean: 3.94183731079
test_h0_col_norms_min: 2.11102938652
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.951863646507
test_h0_max_x_min_u: 0.580549895763
test_h0_mean_x_max_u: 0.932179749012
test_h0_mean_x_mean_u: 0.465434730053
test_h0_mean_x_min_u: 0.0796971917152
test_h0_min_x_max_u: 0.390337795019
test_h0_min_x_mean_u: 0.0372235476971
test_h0_min_x_min_u: 6.10773209786e-10
test_h0_row_norms_max: 6.22417736053
test_h0_row_norms_mean: 3.08713316917
test_h0_row_norms_min: 0.101160049438
test_objective: 0.0948458611965
test_y_col_norms_max: 7.54131317139
test_y_col_norms_mean: 6.45906209946
test_y_col_norms_min: 5.14208126068
test_y_max_max_class: 0.999998509884
test_y_mean_max_class: 0.962593019009
test_y_min_max_class: 0.309717655182
test_y_misclass: 0.0273999981582
test_y_nll: 0.0948458611965
test_y_row_norms_max: 2.65757870674
test_y_row_norms_mean: 0.83378046751
test_y_row_norms_min: 0.132128432393
train_h0_col_norms_max: 6.3376493454
train_h0_col_norms_mean: 3.94183754921
train_h0_col_norms_min: 2.1110291481
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.953146517277
train_h0_max_x_min_u: 0.591403305531
train_h0_mean_x_max_u: 0.925151884556
train_h0_mean_x_mean_u: 0.465704083443
train_h0_mean_x_min_u: 0.0828539133072
train_h0_min_x_max_u: 0.392235994339
train_h0_min_x_mean_u: 0.0363104119897
train_h0_min_x_min_u: 8.38429381478e-10
train_h0_row_norms_max: 6.2241768837
train_h0_row_norms_mean: 3.08713316917
train_h0_row_norms_min: 0.101160041988
train_objective: 0.073119558394
train_y_col_norms_max: 7.54131317139
train_y_col_norms_mean: 6.45906209946
train_y_col_norms_min: 5.14208078384
train_y_max_max_class: 0.999999344349
train_y_mean_max_class: 0.963022887707
train_y_min_max_class: 0.268300741911
train_y_misclass: 0.0194799974561
train_y_nll: 0.073119558394
train_y_row_norms_max: 2.65757846832
train_y_row_norms_mean: 0.833780527115
train_y_row_norms_min: 0.132128432393
valid_h0_col_norms_max: 6.33764886856
valid_h0_col_norms_mean: 3.94183731079
valid_h0_col_norms_min: 2.11102938652
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.952474594116
valid_h0_max_x_min_u: 0.606046676636
valid_h0_mean_x_max_u: 0.927380979061
valid_h0_mean_x_mean_u: 0.465753525496
valid_h0_mean_x_min_u: 0.0843893289566
valid_h0_min_x_max_u: 0.386562854052
valid_h0_min_x_mean_u: 0.0366778969765
valid_h0_min_x_min_u: 5.66411417768e-10
valid_h0_row_norms_max: 6.22417736053
valid_h0_row_norms_mean: 3.08713316917
valid_h0_row_norms_min: 0.101160049438
valid_objective: 0.09637324512
valid_y_col_norms_max: 7.54131317139
valid_y_col_norms_mean: 6.45906209946
valid_y_col_norms_min: 5.14208126068
valid_y_max_max_class: 0.999999463558
valid_y_mean_max_class: 0.96346116066
valid_y_min_max_class: 0.277560830116
valid_y_misclass: 0.0262000001967
valid_y_nll: 0.09637324512
valid_y_row_norms_max: 2.65757870674
valid_y_row_norms_mean: 0.83378046751
valid_y_row_norms_min: 0.132128432393
Time this epoch: 34.760706 seconds
Monitoring step:
Epochs seen: 20
Batches seen: 100
Examples seen: 1000000
ave_grad_mult: 2.02213191986
ave_grad_size: 0.0437575168908
ave_step_size: 0.081667304039
test_h0_col_norms_max: 6.34621286392
test_h0_col_norms_mean: 3.94933509827
test_h0_col_norms_min: 2.11350440979
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.953083157539
test_h0_max_x_min_u: 0.574586033821
test_h0_mean_x_max_u: 0.934979915619
test_h0_mean_x_mean_u: 0.465407788754
test_h0_mean_x_min_u: 0.0830942466855
test_h0_min_x_max_u: 0.386586099863
test_h0_min_x_mean_u: 0.0363725870848
test_h0_min_x_min_u: 3.24080540182e-10
test_h0_row_norms_max: 6.2420706749
test_h0_row_norms_mean: 3.09350514412
test_h0_row_norms_min: 0.104648023844
test_objective: 0.0911609381437
test_y_col_norms_max: 7.76595830917
test_y_col_norms_mean: 6.65801715851
test_y_col_norms_min: 5.27815532684
test_y_max_max_class: 0.999998688698
test_y_mean_max_class: 0.964522898197
test_y_min_max_class: 0.28780567646
test_y_misclass: 0.0263999979943
test_y_nll: 0.0911609381437
test_y_row_norms_max: 2.76887655258
test_y_row_norms_mean: 0.858034849167
test_y_row_norms_min: 0.135387971997
train_h0_col_norms_max: 6.34621238708
train_h0_col_norms_mean: 3.94933462143
train_h0_col_norms_min: 2.11350440979
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.95438170433
train_h0_max_x_min_u: 0.584669828415
train_h0_mean_x_max_u: 0.928267598152
train_h0_mean_x_mean_u: 0.465672910213
train_h0_mean_x_min_u: 0.0862845480442
train_h0_min_x_max_u: 0.389768064022
train_h0_min_x_mean_u: 0.0354867391288
train_h0_min_x_min_u: 4.38173886064e-10
train_h0_row_norms_max: 6.2420706749
train_h0_row_norms_mean: 3.0935049057
train_h0_row_norms_min: 0.104648023844
train_objective: 0.0672194138169
train_y_col_norms_max: 7.76595830917
train_y_col_norms_mean: 6.65801715851
train_y_col_norms_min: 5.27815580368
train_y_max_max_class: 0.999999523163
train_y_mean_max_class: 0.965664386749
train_y_min_max_class: 0.276637971401
train_y_misclass: 0.0176799986511
train_y_nll: 0.0672194138169
train_y_row_norms_max: 2.76887631416
train_y_row_norms_mean: 0.858034789562
train_y_row_norms_min: 0.135387957096
valid_h0_col_norms_max: 6.34621286392
valid_h0_col_norms_mean: 3.94933509827
valid_h0_col_norms_min: 2.11350440979
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.953744530678
valid_h0_max_x_min_u: 0.597306787968
valid_h0_mean_x_max_u: 0.930288851261
valid_h0_mean_x_mean_u: 0.465717792511
valid_h0_mean_x_min_u: 0.087476670742
valid_h0_min_x_max_u: 0.389485627413
valid_h0_min_x_mean_u: 0.0357559174299
valid_h0_min_x_min_u: 3.05218794683e-10
valid_h0_row_norms_max: 6.2420706749
valid_h0_row_norms_mean: 3.09350514412
valid_h0_row_norms_min: 0.104648023844
valid_objective: 0.0925975292921
valid_y_col_norms_max: 7.76595830917
valid_y_col_norms_mean: 6.65801715851
valid_y_col_norms_min: 5.27815532684
valid_y_max_max_class: 0.999999761581
valid_y_mean_max_class: 0.965861082077
valid_y_min_max_class: 0.303610026836
valid_y_misclass: 0.0258000008762
valid_y_nll: 0.0925975292921
valid_y_row_norms_max: 2.76887655258
valid_y_row_norms_mean: 0.858034849167
valid_y_row_norms_min: 0.135387971997
Time this epoch: 35.213061 seconds
Monitoring step:
Epochs seen: 21
Batches seen: 105
Examples seen: 1050000
ave_grad_mult: 2.08118438721
ave_grad_size: 0.0415316298604
ave_step_size: 0.080756470561
test_h0_col_norms_max: 6.35434007645
test_h0_col_norms_mean: 3.95622348785
test_h0_col_norms_min: 2.11573195457
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.953851401806
test_h0_max_x_min_u: 0.567606449127
test_h0_mean_x_max_u: 0.933193147182
test_h0_mean_x_mean_u: 0.465032488108
test_h0_mean_x_min_u: 0.0830998793244
test_h0_min_x_max_u: 0.383445978165
test_h0_min_x_mean_u: 0.0356372632086
test_h0_min_x_min_u: 2.19485554731e-10
test_h0_row_norms_max: 6.25859546661
test_h0_row_norms_mean: 3.09933209419
test_h0_row_norms_min: 0.107006825507
test_objective: 0.0886002033949
test_y_col_norms_max: 7.9637556076
test_y_col_norms_mean: 6.83463764191
test_y_col_norms_min: 5.3923330307
test_y_max_max_class: 0.99999833107
test_y_mean_max_class: 0.965270340443
test_y_min_max_class: 0.310471683741
test_y_misclass: 0.0262000001967
test_y_nll: 0.0886002033949
test_y_row_norms_max: 2.86672186852
test_y_row_norms_mean: 0.879510939121
test_y_row_norms_min: 0.136433556676
train_h0_col_norms_max: 6.35433912277
train_h0_col_norms_mean: 3.95622301102
train_h0_col_norms_min: 2.11573171616
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.955171644688
train_h0_max_x_min_u: 0.579390406609
train_h0_mean_x_max_u: 0.926286578178
train_h0_mean_x_mean_u: 0.465295374393
train_h0_mean_x_min_u: 0.0863517001271
train_h0_min_x_max_u: 0.384008407593
train_h0_min_x_mean_u: 0.0348346866667
train_h0_min_x_min_u: 2.7790120205e-10
train_h0_row_norms_max: 6.25859498978
train_h0_row_norms_mean: 3.09933185577
train_h0_row_norms_min: 0.107006818056
train_objective: 0.0625123158097
train_y_col_norms_max: 7.96375513077
train_y_col_norms_mean: 6.83463668823
train_y_col_norms_min: 5.3923330307
train_y_max_max_class: 0.99999922514
train_y_mean_max_class: 0.967036545277
train_y_min_max_class: 0.273270666599
train_y_misclass: 0.0158599987626
train_y_nll: 0.0625123158097
train_y_row_norms_max: 2.8667216301
train_y_row_norms_mean: 0.879510939121
train_y_row_norms_min: 0.136433571577
valid_h0_col_norms_max: 6.35434007645
valid_h0_col_norms_mean: 3.95622348785
valid_h0_col_norms_min: 2.11573195457
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.95438760519
valid_h0_max_x_min_u: 0.590360045433
valid_h0_mean_x_max_u: 0.928494334221
valid_h0_mean_x_mean_u: 0.465345591307
valid_h0_mean_x_min_u: 0.0878760442138
valid_h0_min_x_max_u: 0.384474813938
valid_h0_min_x_mean_u: 0.0351748354733
valid_h0_min_x_min_u: 1.96166291544e-10
valid_h0_row_norms_max: 6.25859546661
valid_h0_row_norms_mean: 3.09933209419
valid_h0_row_norms_min: 0.107006825507
valid_objective: 0.0909144356847
valid_y_col_norms_max: 7.9637556076
valid_y_col_norms_mean: 6.83463764191
valid_y_col_norms_min: 5.3923330307
valid_y_max_max_class: 0.999999403954
valid_y_mean_max_class: 0.966769099236
valid_y_min_max_class: 0.282997220755
valid_y_misclass: 0.025399999693
valid_y_nll: 0.0909144356847
valid_y_row_norms_max: 2.86672186852
valid_y_row_norms_mean: 0.879510939121
valid_y_row_norms_min: 0.136433556676
Time this epoch: 35.132773 seconds
Monitoring step:
Epochs seen: 22
Batches seen: 110
Examples seen: 1100000
ave_grad_mult: 2.14148879051
ave_grad_size: 0.0403550490737
ave_step_size: 0.0810787156224
test_h0_col_norms_max: 6.3625164032
test_h0_col_norms_mean: 3.96336507797
test_h0_col_norms_min: 2.11782503128
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.954942882061
test_h0_max_x_min_u: 0.564869940281
test_h0_mean_x_max_u: 0.935991108418
test_h0_mean_x_mean_u: 0.465213596821
test_h0_mean_x_min_u: 0.0809470117092
test_h0_min_x_max_u: 0.385282725096
test_h0_min_x_mean_u: 0.0350002162158
test_h0_min_x_min_u: 1.53522847213e-10
test_h0_row_norms_max: 6.27728748322
test_h0_row_norms_mean: 3.10535025597
test_h0_row_norms_min: 0.109762132168
test_objective: 0.0847353041172
test_y_col_norms_max: 8.15684700012
test_y_col_norms_mean: 7.01448202133
test_y_col_norms_min: 5.519551754
test_y_max_max_class: 0.999998867512
test_y_mean_max_class: 0.967154860497
test_y_min_max_class: 0.283250451088
test_y_misclass: 0.0249000005424
test_y_nll: 0.0847353041172
test_y_row_norms_max: 2.96138525009
test_y_row_norms_mean: 0.901844441891
test_y_row_norms_min: 0.138287782669
train_h0_col_norms_max: 6.3625164032
train_h0_col_norms_mean: 3.96336531639
train_h0_col_norms_min: 2.11782479286
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.956164836884
train_h0_max_x_min_u: 0.575154662132
train_h0_mean_x_max_u: 0.929373860359
train_h0_mean_x_mean_u: 0.465472817421
train_h0_mean_x_min_u: 0.0842097327113
train_h0_min_x_max_u: 0.385039448738
train_h0_min_x_mean_u: 0.0342263542116
train_h0_min_x_min_u: 1.99991759264e-10
train_h0_row_norms_max: 6.27728748322
train_h0_row_norms_mean: 3.10535001755
train_h0_row_norms_min: 0.109762117267
train_objective: 0.0575138144195
train_y_col_norms_max: 8.15684700012
train_y_col_norms_mean: 7.01448202133
train_y_col_norms_min: 5.51955223083
train_y_max_max_class: 0.999999582767
train_y_mean_max_class: 0.96871650219
train_y_min_max_class: 0.287014901638
train_y_misclass: 0.0142399985343
train_y_nll: 0.0575138144195
train_y_row_norms_max: 2.96138525009
train_y_row_norms_mean: 0.901844382286
train_y_row_norms_min: 0.138287782669
valid_h0_col_norms_max: 6.3625164032
valid_h0_col_norms_mean: 3.96336507797
valid_h0_col_norms_min: 2.11782503128
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.955441474915
valid_h0_max_x_min_u: 0.589357554913
valid_h0_mean_x_max_u: 0.93136715889
valid_h0_mean_x_mean_u: 0.46551990509
valid_h0_mean_x_min_u: 0.086060911417
valid_h0_min_x_max_u: 0.390778958797
valid_h0_min_x_mean_u: 0.0345365256071
valid_h0_min_x_min_u: 1.45148310038e-10
valid_h0_row_norms_max: 6.27728748322
valid_h0_row_norms_mean: 3.10535025597
valid_h0_row_norms_min: 0.109762132168
valid_objective: 0.0865774899721
valid_y_col_norms_max: 8.15684700012
valid_y_col_norms_mean: 7.01448202133
valid_y_col_norms_min: 5.519551754
valid_y_max_max_class: 0.999999761581
valid_y_mean_max_class: 0.96779280901
valid_y_min_max_class: 0.273192465305
valid_y_misclass: 0.0244999974966
valid_y_nll: 0.0865774899721
valid_y_row_norms_max: 2.96138525009
valid_y_row_norms_mean: 0.901844441891
valid_y_row_norms_min: 0.138287782669
Time this epoch: 35.193111 seconds
Monitoring step:
Epochs seen: 23
Batches seen: 115
Examples seen: 1150000
ave_grad_mult: 2.29178571701
ave_grad_size: 0.0395583026111
ave_step_size: 0.0849489048123
test_h0_col_norms_max: 6.37209796906
test_h0_col_norms_mean: 3.97117829323
test_h0_col_norms_min: 2.11957788467
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.956251323223
test_h0_max_x_min_u: 0.561847269535
test_h0_mean_x_max_u: 0.934819757938
test_h0_mean_x_mean_u: 0.465812414885
test_h0_mean_x_min_u: 0.0860762521625
test_h0_min_x_max_u: 0.382852345705
test_h0_min_x_mean_u: 0.0343066453934
test_h0_min_x_min_u: 1.00234549827e-10
test_h0_row_norms_max: 6.29464244843
test_h0_row_norms_mean: 3.11194372177
test_h0_row_norms_min: 0.112373262644
test_objective: 0.0813909471035
test_y_col_norms_max: 8.37556743622
test_y_col_norms_mean: 7.21202421188
test_y_col_norms_min: 5.66676425934
test_y_max_max_class: 0.99999922514
test_y_mean_max_class: 0.969460964203
test_y_min_max_class: 0.304885983467
test_y_misclass: 0.0245999991894
test_y_nll: 0.0813909471035
test_y_row_norms_max: 3.05758142471
test_y_row_norms_mean: 0.926100432873
test_y_row_norms_min: 0.141218408942
train_h0_col_norms_max: 6.37209796906
train_h0_col_norms_mean: 3.97117805481
train_h0_col_norms_min: 2.11957764626
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.957390427589
train_h0_max_x_min_u: 0.571023106575
train_h0_mean_x_max_u: 0.928050994873
train_h0_mean_x_mean_u: 0.466074705124
train_h0_mean_x_min_u: 0.089665055275
train_h0_min_x_max_u: 0.383693158627
train_h0_min_x_mean_u: 0.0335417687893
train_h0_min_x_min_u: 1.25948085294e-10
train_h0_row_norms_max: 6.29464149475
train_h0_row_norms_mean: 3.11194324493
train_h0_row_norms_min: 0.112373247743
train_objective: 0.0530071258545
train_y_col_norms_max: 8.37556743622
train_y_col_norms_mean: 7.21202325821
train_y_col_norms_min: 5.6667637825
train_y_max_max_class: 0.999999761581
train_y_mean_max_class: 0.971323847771
train_y_min_max_class: 0.274939656258
train_y_misclass: 0.0134199988097
train_y_nll: 0.0530071258545
train_y_row_norms_max: 3.05758142471
train_y_row_norms_mean: 0.926100373268
train_y_row_norms_min: 0.141218394041
valid_h0_col_norms_max: 6.37209796906
valid_h0_col_norms_mean: 3.97117829323
valid_h0_col_norms_min: 2.11957788467
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.956661045551
valid_h0_max_x_min_u: 0.585293292999
valid_h0_mean_x_max_u: 0.930199086666
valid_h0_mean_x_mean_u: 0.466111898422
valid_h0_mean_x_min_u: 0.0879896134138
valid_h0_min_x_max_u: 0.389526426792
valid_h0_min_x_mean_u: 0.0339160002768
valid_h0_min_x_min_u: 9.11426628614e-11
valid_h0_row_norms_max: 6.29464244843
valid_h0_row_norms_mean: 3.11194372177
valid_h0_row_norms_min: 0.112373262644
valid_objective: 0.0844431295991
valid_y_col_norms_max: 8.37556743622
valid_y_col_norms_mean: 7.21202421188
valid_y_col_norms_min: 5.66676425934
valid_y_max_max_class: 0.999999761581
valid_y_mean_max_class: 0.970053553581
valid_y_min_max_class: 0.252451866865
valid_y_misclass: 0.0244999974966
valid_y_nll: 0.0844431295991
valid_y_row_norms_max: 3.05758142471
valid_y_row_norms_mean: 0.926100432873
valid_y_row_norms_min: 0.141218408942
Time this epoch: 35.101327 seconds
Monitoring step:
Epochs seen: 24
Batches seen: 120
Examples seen: 1200000
ave_grad_mult: 2.4745285511
ave_grad_size: 0.037980530411
ave_step_size: 0.0874084308743
test_h0_col_norms_max: 6.38188457489
test_h0_col_norms_mean: 3.97928380966
test_h0_col_norms_min: 2.12256121635
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.957348406315
test_h0_max_x_min_u: 0.559364676476
test_h0_mean_x_max_u: 0.934902369976
test_h0_mean_x_mean_u: 0.465777903795
test_h0_mean_x_min_u: 0.0820802301168
test_h0_min_x_max_u: 0.37371224165
test_h0_min_x_mean_u: 0.0335819907486
test_h0_min_x_min_u: 6.84129905504e-11
test_h0_row_norms_max: 6.31191539764
test_h0_row_norms_mean: 3.11876320839
test_h0_row_norms_min: 0.114646181464
test_objective: 0.0791404470801
test_y_col_norms_max: 8.59417057037
test_y_col_norms_mean: 7.40912103653
test_y_col_norms_min: 5.81003856659
test_y_max_max_class: 0.99999922514
test_y_mean_max_class: 0.969864010811
test_y_min_max_class: 0.260499119759
test_y_misclass: 0.0230999998748
test_y_nll: 0.0791404470801
test_y_row_norms_max: 3.15858983994
test_y_row_norms_mean: 0.950569629669
test_y_row_norms_min: 0.144145652652
train_h0_col_norms_max: 6.38188409805
train_h0_col_norms_mean: 3.97928357124
train_h0_col_norms_min: 2.12256097794
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.958476305008
train_h0_max_x_min_u: 0.567814290524
train_h0_mean_x_max_u: 0.928138375282
train_h0_mean_x_mean_u: 0.46603512764
train_h0_mean_x_min_u: 0.0855919569731
train_h0_min_x_max_u: 0.379186630249
train_h0_min_x_mean_u: 0.0329259894788
train_h0_min_x_min_u: 8.38127969804e-11
train_h0_row_norms_max: 6.31191492081
train_h0_row_norms_mean: 3.11876296997
train_h0_row_norms_min: 0.114646181464
train_objective: 0.0484027862549
train_y_col_norms_max: 8.59417057037
train_y_col_norms_mean: 7.40912055969
train_y_col_norms_min: 5.81003761292
train_y_max_max_class: 0.999999701977
train_y_mean_max_class: 0.972274065018
train_y_min_max_class: 0.297603964806
train_y_misclass: 0.0116999996826
train_y_nll: 0.0484027862549
train_y_row_norms_max: 3.15858960152
train_y_row_norms_mean: 0.95056951046
train_y_row_norms_min: 0.144145637751
valid_h0_col_norms_max: 6.38188457489
valid_h0_col_norms_mean: 3.97928380966
valid_h0_col_norms_min: 2.12256121635
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.957733333111
valid_h0_max_x_min_u: 0.57974678278
valid_h0_mean_x_max_u: 0.930305242538
valid_h0_mean_x_mean_u: 0.466072052717
valid_h0_mean_x_min_u: 0.0875690802932
valid_h0_min_x_max_u: 0.382509231567
valid_h0_min_x_mean_u: 0.0332807153463
valid_h0_min_x_min_u: 6.19569395788e-11
valid_h0_row_norms_max: 6.31191539764
valid_h0_row_norms_mean: 3.11876320839
valid_h0_row_norms_min: 0.114646181464
valid_objective: 0.0832240283489
valid_y_col_norms_max: 8.59417057037
valid_y_col_norms_mean: 7.40912103653
valid_y_col_norms_min: 5.81003856659
valid_y_max_max_class: 0.999999761581
valid_y_mean_max_class: 0.970567047596
valid_y_min_max_class: 0.264748305082
valid_y_misclass: 0.023999998346
valid_y_nll: 0.0832240283489
valid_y_row_norms_max: 3.15858983994
valid_y_row_norms_mean: 0.950569629669
valid_y_row_norms_min: 0.144145652652
Time this epoch: 35.537865 seconds
Monitoring step:
Epochs seen: 25
Batches seen: 125
Examples seen: 1250000
ave_grad_mult: 2.61537218094
ave_grad_size: 0.0366696789861
ave_step_size: 0.0890378654003
test_h0_col_norms_max: 6.39218759537
test_h0_col_norms_mean: 3.98763632774
test_h0_col_norms_min: 2.1254658699
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.958310782909
test_h0_max_x_min_u: 0.550610423088
test_h0_mean_x_max_u: 0.936276137829
test_h0_mean_x_mean_u: 0.46598726511
test_h0_mean_x_min_u: 0.0805417820811
test_h0_min_x_max_u: 0.375468403101
test_h0_min_x_mean_u: 0.0328881442547
test_h0_min_x_min_u: 7.84403653142e-11
test_h0_row_norms_max: 6.33220767975
test_h0_row_norms_mean: 3.1257724762
test_h0_row_norms_min: 0.116853624582
test_objective: 0.0754533782601
test_y_col_norms_max: 8.81067371368
test_y_col_norms_mean: 7.60889148712
test_y_col_norms_min: 5.96597194672
test_y_max_max_class: 0.999999761581
test_y_mean_max_class: 0.971746265888
test_y_min_max_class: 0.288777351379
test_y_misclass: 0.0232999995351
test_y_nll: 0.0754533782601
test_y_row_norms_max: 3.24519085884
test_y_row_norms_mean: 0.975342810154
test_y_row_norms_min: 0.148321658373
train_h0_col_norms_max: 6.3921880722
train_h0_col_norms_mean: 3.98763632774
train_h0_col_norms_min: 2.1254658699
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.959453582764
train_h0_max_x_min_u: 0.561280608177
train_h0_mean_x_max_u: 0.929621398449
train_h0_mean_x_mean_u: 0.466242551804
train_h0_mean_x_min_u: 0.0841432288289
train_h0_min_x_max_u: 0.38017898798
train_h0_min_x_mean_u: 0.0322615392506
train_h0_min_x_min_u: 1.0068777756e-10
train_h0_row_norms_max: 6.33220720291
train_h0_row_norms_mean: 3.1257724762
train_h0_row_norms_min: 0.116853624582
train_objective: 0.043993473053
train_y_col_norms_max: 8.81067276001
train_y_col_norms_mean: 7.60889053345
train_y_col_norms_min: 5.96597194672
train_y_max_max_class: 0.999999821186
train_y_mean_max_class: 0.974251687527
train_y_min_max_class: 0.270618349314
train_y_misclass: 0.0104199992493
train_y_nll: 0.043993473053
train_y_row_norms_max: 3.24519062042
train_y_row_norms_mean: 0.975342690945
train_y_row_norms_min: 0.148321658373
valid_h0_col_norms_max: 6.39218759537
valid_h0_col_norms_mean: 3.98763632774
valid_h0_col_norms_min: 2.1254658699
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.958616793156
valid_h0_max_x_min_u: 0.574836432934
valid_h0_mean_x_max_u: 0.931737542152
valid_h0_mean_x_mean_u: 0.466278731823
valid_h0_mean_x_min_u: 0.0861588418484
valid_h0_min_x_max_u: 0.383438080549
valid_h0_min_x_mean_u: 0.032565869391
valid_h0_min_x_min_u: 7.15359646519e-11
valid_h0_row_norms_max: 6.33220767975
valid_h0_row_norms_mean: 3.1257724762
valid_h0_row_norms_min: 0.116853624582
valid_objective: 0.0792490914464
valid_y_col_norms_max: 8.81067371368
valid_y_col_norms_mean: 7.60889148712
valid_y_col_norms_min: 5.96597194672
valid_y_max_max_class: 0.999999821186
valid_y_mean_max_class: 0.972301781178
valid_y_min_max_class: 0.278648257256
valid_y_misclass: 0.0228000003844
valid_y_nll: 0.0792490914464
valid_y_row_norms_max: 3.24519085884
valid_y_row_norms_mean: 0.975342810154
valid_y_row_norms_min: 0.148321658373
Time this epoch: 35.095306 seconds
Monitoring step:
Epochs seen: 26
Batches seen: 130
Examples seen: 1300000
ave_grad_mult: 2.71106290817
ave_grad_size: 0.0348753891885
ave_step_size: 0.0883127823472
test_h0_col_norms_max: 6.4017291069
test_h0_col_norms_mean: 3.99520802498
test_h0_col_norms_min: 2.12854385376
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.959264814854
test_h0_max_x_min_u: 0.55078792572
test_h0_mean_x_max_u: 0.935866773129
test_h0_mean_x_mean_u: 0.466203004122
test_h0_mean_x_min_u: 0.078706741333
test_h0_min_x_max_u: 0.367448121309
test_h0_min_x_mean_u: 0.0321592055261
test_h0_min_x_min_u: 4.60236952715e-11
test_h0_row_norms_max: 6.34829235077
test_h0_row_norms_mean: 3.13210654259
test_h0_row_norms_min: 0.118406176567
test_objective: 0.0733289569616
test_y_col_norms_max: 9.00574874878
test_y_col_norms_mean: 7.78995084763
test_y_col_norms_min: 6.10382938385
test_y_max_max_class: 0.999999761581
test_y_mean_max_class: 0.972537279129
test_y_min_max_class: 0.267330288887
test_y_misclass: 0.0230999998748
test_y_nll: 0.0733289569616
test_y_row_norms_max: 3.33722496033
test_y_row_norms_mean: 0.997784733772
test_y_row_norms_min: 0.151363104582
train_h0_col_norms_max: 6.40172863007
train_h0_col_norms_mean: 3.9952082634
train_h0_col_norms_min: 2.12854361534
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.960306882858
train_h0_max_x_min_u: 0.562210321426
train_h0_mean_x_max_u: 0.929156780243
train_h0_mean_x_mean_u: 0.466451466084
train_h0_mean_x_min_u: 0.0823357179761
train_h0_min_x_max_u: 0.377736270428
train_h0_min_x_mean_u: 0.0315755605698
train_h0_min_x_min_u: 5.56277697517e-11
train_h0_row_norms_max: 6.34829139709
train_h0_row_norms_mean: 3.13210630417
train_h0_row_norms_min: 0.118406184018
train_objective: 0.0409014374018
train_y_col_norms_max: 9.00574874878
train_y_col_norms_mean: 7.78995037079
train_y_col_norms_min: 6.10382938385
train_y_max_max_class: 0.999999821186
train_y_mean_max_class: 0.975649058819
train_y_min_max_class: 0.290484070778
train_y_misclass: 0.00971999950707
train_y_nll: 0.0409014374018
train_y_row_norms_max: 3.33722496033
train_y_row_norms_mean: 0.997784733772
train_y_row_norms_min: 0.151363104582
valid_h0_col_norms_max: 6.4017291069
valid_h0_col_norms_mean: 3.99520802498
valid_h0_col_norms_min: 2.12854385376
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.959495425224
valid_h0_max_x_min_u: 0.576115489006
valid_h0_mean_x_max_u: 0.931356489658
valid_h0_mean_x_mean_u: 0.466476142406
valid_h0_mean_x_min_u: 0.084395840764
valid_h0_min_x_max_u: 0.37360149622
valid_h0_min_x_mean_u: 0.0320250503719
valid_h0_min_x_min_u: 4.11951479873e-11
valid_h0_row_norms_max: 6.34829235077
valid_h0_row_norms_mean: 3.13210654259
valid_h0_row_norms_min: 0.118406176567
valid_objective: 0.0791732370853
valid_y_col_norms_max: 9.00574874878
valid_y_col_norms_mean: 7.78995084763
valid_y_col_norms_min: 6.10382938385
valid_y_max_max_class: 0.999999821186
valid_y_mean_max_class: 0.973247587681
valid_y_min_max_class: 0.254454284906
valid_y_misclass: 0.0232999995351
valid_y_nll: 0.0791732370853
valid_y_row_norms_max: 3.33722496033
valid_y_row_norms_mean: 0.997784733772
valid_y_row_norms_min: 0.151363104582
Time this epoch: 35.406078 seconds
Monitoring step:
Epochs seen: 27
Batches seen: 135
Examples seen: 1350000
ave_grad_mult: 2.80285286903
ave_grad_size: 0.0334513224661
ave_step_size: 0.088192678988
test_h0_col_norms_max: 6.41177082062
test_h0_col_norms_mean: 4.00275707245
test_h0_col_norms_min: 2.13091373444
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.960214793682
test_h0_max_x_min_u: 0.547308385372
test_h0_mean_x_max_u: 0.936411142349
test_h0_mean_x_mean_u: 0.466299444437
test_h0_mean_x_min_u: 0.0786798894405
test_h0_min_x_max_u: 0.366961061954
test_h0_min_x_mean_u: 0.0315478779376
test_h0_min_x_min_u: 4.00019496694e-11
test_h0_row_norms_max: 6.36605072021
test_h0_row_norms_mean: 3.13841247559
test_h0_row_norms_min: 0.119428776205
test_objective: 0.0725825279951
test_y_col_norms_max: 9.19264411926
test_y_col_norms_mean: 7.96643924713
test_y_col_norms_min: 6.2465171814
test_y_max_max_class: 0.999999821186
test_y_mean_max_class: 0.974697828293
test_y_min_max_class: 0.284473180771
test_y_misclass: 0.0219000000507
test_y_nll: 0.0725825279951
test_y_row_norms_max: 3.41140413284
test_y_row_norms_mean: 1.01977562904
test_y_row_norms_min: 0.155052781105
train_h0_col_norms_max: 6.41176986694
train_h0_col_norms_mean: 4.00275659561
train_h0_col_norms_min: 2.13091373444
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.96123111248
train_h0_max_x_min_u: 0.556509852409
train_h0_mean_x_max_u: 0.929732501507
train_h0_mean_x_mean_u: 0.466540902853
train_h0_mean_x_min_u: 0.0823666229844
train_h0_min_x_max_u: 0.371994018555
train_h0_min_x_mean_u: 0.0308939814568
train_h0_min_x_min_u: 5.01665688157e-11
train_h0_row_norms_max: 6.36605024338
train_h0_row_norms_mean: 3.13841223717
train_h0_row_norms_min: 0.119428783655
train_objective: 0.0370035469532
train_y_col_norms_max: 9.19264411926
train_y_col_norms_mean: 7.96643972397
train_y_col_norms_min: 6.24651670456
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.977714180946
train_y_min_max_class: 0.2884734869
train_y_misclass: 0.00885999947786
train_y_nll: 0.0370035469532
train_y_row_norms_max: 3.41140389442
train_y_row_norms_mean: 1.01977562904
train_y_row_norms_min: 0.155052781105
valid_h0_col_norms_max: 6.41177082062
valid_h0_col_norms_mean: 4.00275707245
valid_h0_col_norms_min: 2.13091373444
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.960400640965
valid_h0_max_x_min_u: 0.574073672295
valid_h0_mean_x_max_u: 0.931940674782
valid_h0_mean_x_mean_u: 0.466566413641
valid_h0_mean_x_min_u: 0.0844538062811
valid_h0_min_x_max_u: 0.369769692421
valid_h0_min_x_mean_u: 0.0312857404351
valid_h0_min_x_min_u: 3.6578275131e-11
valid_h0_row_norms_max: 6.36605072021
valid_h0_row_norms_mean: 3.13841247559
valid_h0_row_norms_min: 0.119428776205
valid_objective: 0.0765716135502
valid_y_col_norms_max: 9.19264411926
valid_y_col_norms_mean: 7.96643924713
valid_y_col_norms_min: 6.2465171814
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.974825143814
valid_y_min_max_class: 0.268961429596
valid_y_misclass: 0.0228999983519
valid_y_nll: 0.0765716135502
valid_y_row_norms_max: 3.41140413284
valid_y_row_norms_mean: 1.01977562904
valid_y_row_norms_min: 0.155052781105
Time this epoch: 34.780491 seconds
Monitoring step:
Epochs seen: 28
Batches seen: 140
Examples seen: 1400000
ave_grad_mult: 3.07722043991
ave_grad_size: 0.0323846936226
ave_step_size: 0.0927985981107
test_h0_col_norms_max: 6.42322683334
test_h0_col_norms_mean: 4.01182746887
test_h0_col_norms_min: 2.13467645645
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.961281061172
test_h0_max_x_min_u: 0.549638330936
test_h0_mean_x_max_u: 0.939934492111
test_h0_mean_x_mean_u: 0.466522186995
test_h0_mean_x_min_u: 0.0823039337993
test_h0_min_x_max_u: 0.357347339392
test_h0_min_x_mean_u: 0.030734334141
test_h0_min_x_min_u: 5.08886266459e-11
test_h0_row_norms_max: 6.38524675369
test_h0_row_norms_mean: 3.1459903717
test_h0_row_norms_min: 0.121352598071
test_objective: 0.0716430544853
test_y_col_norms_max: 9.41203117371
test_y_col_norms_mean: 8.17550086975
test_y_col_norms_min: 6.39991140366
test_y_max_max_class: 0.999999821186
test_y_mean_max_class: 0.974777877331
test_y_min_max_class: 0.270264923573
test_y_misclass: 0.0223999992013
test_y_nll: 0.0716430544853
test_y_row_norms_max: 3.5011806488
test_y_row_norms_mean: 1.04629290104
test_y_row_norms_min: 0.159884780645
train_h0_col_norms_max: 6.42322635651
train_h0_col_norms_mean: 4.01182699203
train_h0_col_norms_min: 2.13467645645
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.962265074253
train_h0_max_x_min_u: 0.556583762169
train_h0_mean_x_max_u: 0.93360298872
train_h0_mean_x_mean_u: 0.4667532444
train_h0_mean_x_min_u: 0.0861736312509
train_h0_min_x_max_u: 0.366083562374
train_h0_min_x_mean_u: 0.0301264487207
train_h0_min_x_min_u: 6.59544779902e-11
train_h0_row_norms_max: 6.38524627686
train_h0_row_norms_mean: 3.1459903717
train_h0_row_norms_min: 0.121352590621
train_objective: 0.0347100757062
train_y_col_norms_max: 9.41203117371
train_y_col_norms_mean: 8.17550086975
train_y_col_norms_min: 6.39991092682
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.978323638439
train_y_min_max_class: 0.29167419672
train_y_misclass: 0.00763999950141
train_y_nll: 0.0347100757062
train_y_row_norms_max: 3.50118041039
train_y_row_norms_mean: 1.04629290104
train_y_row_norms_min: 0.159884765744
valid_h0_col_norms_max: 6.42322683334
valid_h0_col_norms_mean: 4.01182746887
valid_h0_col_norms_min: 2.13467645645
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.961273550987
valid_h0_max_x_min_u: 0.570200264454
valid_h0_mean_x_max_u: 0.935558438301
valid_h0_mean_x_mean_u: 0.466782003641
valid_h0_mean_x_min_u: 0.0883660390973
valid_h0_min_x_max_u: 0.355543404818
valid_h0_min_x_mean_u: 0.0304867494851
valid_h0_min_x_min_u: 4.62212004781e-11
valid_h0_row_norms_max: 6.38524675369
valid_h0_row_norms_mean: 3.1459903717
valid_h0_row_norms_min: 0.121352598071
valid_objective: 0.0746665000916
valid_y_col_norms_max: 9.41203117371
valid_y_col_norms_mean: 8.17550086975
valid_y_col_norms_min: 6.39991140366
valid_y_max_max_class: 0.999999821186
valid_y_mean_max_class: 0.975300252438
valid_y_min_max_class: 0.280865699053
valid_y_misclass: 0.0222999975085
valid_y_nll: 0.0746665000916
valid_y_row_norms_max: 3.5011806488
valid_y_row_norms_mean: 1.04629290104
valid_y_row_norms_min: 0.159884780645
Time this epoch: 35.278322 seconds
Monitoring step:
Epochs seen: 29
Batches seen: 145
Examples seen: 1450000
ave_grad_mult: 3.31815242767
ave_grad_size: 0.0319525785744
ave_step_size: 0.0989938527346
test_h0_col_norms_max: 6.43632364273
test_h0_col_norms_mean: 4.02131462097
test_h0_col_norms_min: 2.13684439659
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.96216905117
test_h0_max_x_min_u: 0.548579275608
test_h0_mean_x_max_u: 0.941554307938
test_h0_mean_x_mean_u: 0.46652469039
test_h0_mean_x_min_u: 0.0799239650369
test_h0_min_x_max_u: 0.357737779617
test_h0_min_x_mean_u: 0.0300854835659
test_h0_min_x_min_u: 2.19761518011e-11
test_h0_row_norms_max: 6.40747022629
test_h0_row_norms_mean: 3.15389037132
test_h0_row_norms_min: 0.123720750213
test_objective: 0.0690323263407
test_y_col_norms_max: 9.64226436615
test_y_col_norms_mean: 8.3891248703
test_y_col_norms_min: 6.57722139359
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.976467430592
test_y_min_max_class: 0.293188840151
test_y_misclass: 0.0212999973446
test_y_nll: 0.0690323263407
test_y_row_norms_max: 3.5816681385
test_y_row_norms_mean: 1.07294213772
test_y_row_norms_min: 0.162085324526
train_h0_col_norms_max: 6.43632364273
train_h0_col_norms_mean: 4.02131462097
train_h0_col_norms_min: 2.13684439659
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.963165700436
train_h0_max_x_min_u: 0.55665397644
train_h0_mean_x_max_u: 0.935373008251
train_h0_mean_x_mean_u: 0.466756403446
train_h0_mean_x_min_u: 0.0838716328144
train_h0_min_x_max_u: 0.36525374651
train_h0_min_x_mean_u: 0.0295039452612
train_h0_min_x_min_u: 2.68275124338e-11
train_h0_row_norms_max: 6.40747022629
train_h0_row_norms_mean: 3.15389037132
train_h0_row_norms_min: 0.123720750213
train_objective: 0.0306258164346
train_y_col_norms_max: 9.64226341248
train_y_col_norms_mean: 8.3891248703
train_y_col_norms_min: 6.57722091675
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.980324864388
train_y_min_max_class: 0.283443570137
train_y_misclass: 0.00647999951616
train_y_nll: 0.0306258164346
train_y_row_norms_max: 3.58166790009
train_y_row_norms_mean: 1.07294213772
train_y_row_norms_min: 0.162085309625
valid_h0_col_norms_max: 6.43632364273
valid_h0_col_norms_mean: 4.02131462097
valid_h0_col_norms_min: 2.13684439659
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.962217152119
valid_h0_max_x_min_u: 0.571269631386
valid_h0_mean_x_max_u: 0.937248468399
valid_h0_mean_x_mean_u: 0.466767311096
valid_h0_mean_x_min_u: 0.0859551951289
valid_h0_min_x_max_u: 0.352124005556
valid_h0_min_x_mean_u: 0.0299664791673
valid_h0_min_x_min_u: 2.06817705323e-11
valid_h0_row_norms_max: 6.40747022629
valid_h0_row_norms_mean: 3.15389037132
valid_h0_row_norms_min: 0.123720750213
valid_objective: 0.0733132436872
valid_y_col_norms_max: 9.64226436615
valid_y_col_norms_mean: 8.3891248703
valid_y_col_norms_min: 6.57722139359
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.976790785789
valid_y_min_max_class: 0.291347831488
valid_y_misclass: 0.0217000003904
valid_y_nll: 0.0733132436872
valid_y_row_norms_max: 3.5816681385
valid_y_row_norms_mean: 1.07294213772
valid_y_row_norms_min: 0.162085324526
Time this epoch: 35.082858 seconds
Monitoring step:
Epochs seen: 30
Batches seen: 150
Examples seen: 1500000
ave_grad_mult: 3.39051413536
ave_grad_size: 0.0302216522396
ave_step_size: 0.0965146124363
test_h0_col_norms_max: 6.44657659531
test_h0_col_norms_mean: 4.02922582626
test_h0_col_norms_min: 2.14045143127
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.963280200958
test_h0_max_x_min_u: 0.548351347446
test_h0_mean_x_max_u: 0.940559446812
test_h0_mean_x_mean_u: 0.466341674328
test_h0_mean_x_min_u: 0.0799687504768
test_h0_min_x_max_u: 0.357499152422
test_h0_min_x_mean_u: 0.0293492469937
test_h0_min_x_min_u: 2.20450845079e-11
test_h0_row_norms_max: 6.4218788147
test_h0_row_norms_mean: 3.16045355797
test_h0_row_norms_min: 0.124831520021
test_objective: 0.0672078579664
test_y_col_norms_max: 9.82299423218
test_y_col_norms_mean: 8.56633377075
test_y_col_norms_min: 6.71553707123
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.977503836155
test_y_min_max_class: 0.255842655897
test_y_misclass: 0.0198999978602
test_y_nll: 0.0672078579664
test_y_row_norms_max: 3.65550899506
test_y_row_norms_mean: 1.09536457062
test_y_row_norms_min: 0.163716614246
train_h0_col_norms_max: 6.44657611847
train_h0_col_norms_mean: 4.02922534943
train_h0_col_norms_min: 2.14045143127
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.964230835438
train_h0_max_x_min_u: 0.55356913805
train_h0_mean_x_max_u: 0.934283614159
train_h0_mean_x_mean_u: 0.466563820839
train_h0_mean_x_min_u: 0.0839123427868
train_h0_min_x_max_u: 0.358219176531
train_h0_min_x_mean_u: 0.0287974383682
train_h0_min_x_min_u: 2.7636832059e-11
train_h0_row_norms_max: 6.42187833786
train_h0_row_norms_mean: 3.16045331955
train_h0_row_norms_min: 0.12483151257
train_objective: 0.0282621402293
train_y_col_norms_max: 9.82299423218
train_y_col_norms_mean: 8.56633377075
train_y_col_norms_min: 6.71553659439
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.981470048428
train_y_min_max_class: 0.304827183485
train_y_misclass: 0.00591999944299
train_y_nll: 0.0282621402293
train_y_row_norms_max: 3.65550875664
train_y_row_norms_mean: 1.09536445141
train_y_row_norms_min: 0.163716599345
valid_h0_col_norms_max: 6.44657659531
valid_h0_col_norms_mean: 4.02922582626
valid_h0_col_norms_min: 2.14045143127
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.963276088238
valid_h0_max_x_min_u: 0.567677497864
valid_h0_mean_x_max_u: 0.936259627342
valid_h0_mean_x_mean_u: 0.466583073139
valid_h0_mean_x_min_u: 0.0857717692852
valid_h0_min_x_max_u: 0.344594448805
valid_h0_min_x_mean_u: 0.0292918123305
valid_h0_min_x_min_u: 2.15348190669e-11
valid_h0_row_norms_max: 6.4218788147
valid_h0_row_norms_mean: 3.16045355797
valid_h0_row_norms_min: 0.124831520021
valid_objective: 0.0722089111805
valid_y_col_norms_max: 9.82299423218
valid_y_col_norms_mean: 8.56633377075
valid_y_col_norms_min: 6.71553707123
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.977750241756
valid_y_min_max_class: 0.297483742237
valid_y_misclass: 0.021099999547
valid_y_nll: 0.0722089111805
valid_y_row_norms_max: 3.65550899506
valid_y_row_norms_mean: 1.09536457062
valid_y_row_norms_min: 0.163716614246
Time this epoch: 35.012923 seconds
Monitoring step:
Epochs seen: 31
Batches seen: 155
Examples seen: 1550000
ave_grad_mult: 3.48831152916
ave_grad_size: 0.0287408661097
ave_step_size: 0.0940494984388
test_h0_col_norms_max: 6.45725250244
test_h0_col_norms_mean: 4.03711128235
test_h0_col_norms_min: 2.14304447174
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.963537156582
test_h0_max_x_min_u: 0.54846316576
test_h0_mean_x_max_u: 0.940077364445
test_h0_mean_x_mean_u: 0.466366380453
test_h0_mean_x_min_u: 0.0763043165207
test_h0_min_x_max_u: 0.358212590218
test_h0_min_x_mean_u: 0.0289474800229
test_h0_min_x_min_u: 2.33178042847e-11
test_h0_row_norms_max: 6.44061088562
test_h0_row_norms_mean: 3.16697835922
test_h0_row_norms_min: 0.125991553068
test_objective: 0.0662763118744
test_y_col_norms_max: 10.0062093735
test_y_col_norms_mean: 8.73837566376
test_y_col_norms_min: 6.84926891327
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.978273510933
test_y_min_max_class: 0.285628795624
test_y_misclass: 0.0193999987096
test_y_nll: 0.0662763118744
test_y_row_norms_max: 3.72132134438
test_y_row_norms_mean: 1.11716985703
test_y_row_norms_min: 0.168285727501
train_h0_col_norms_max: 6.45725250244
train_h0_col_norms_mean: 4.03711175919
train_h0_col_norms_min: 2.14304423332
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.964494287968
train_h0_max_x_min_u: 0.551624715328
train_h0_mean_x_max_u: 0.933692634106
train_h0_mean_x_mean_u: 0.466590344906
train_h0_mean_x_min_u: 0.0803528800607
train_h0_min_x_max_u: 0.359871029854
train_h0_min_x_mean_u: 0.0284454971552
train_h0_min_x_min_u: 3.10430431361e-11
train_h0_row_norms_max: 6.44061088562
train_h0_row_norms_mean: 3.16697835922
train_h0_row_norms_min: 0.125991553068
train_objective: 0.0254283007234
train_y_col_norms_max: 10.0062084198
train_y_col_norms_mean: 8.73837471008
train_y_col_norms_min: 6.84926795959
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.982455551624
train_y_min_max_class: 0.295360028744
train_y_misclass: 0.00481999944896
train_y_nll: 0.0254283007234
train_y_row_norms_max: 3.72132158279
train_y_row_norms_mean: 1.11716985703
train_y_row_norms_min: 0.168285742402
valid_h0_col_norms_max: 6.45725250244
valid_h0_col_norms_mean: 4.03711128235
valid_h0_col_norms_min: 2.14304447174
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.96359193325
valid_h0_max_x_min_u: 0.566685140133
valid_h0_mean_x_max_u: 0.935809135437
valid_h0_mean_x_mean_u: 0.466593444347
valid_h0_mean_x_min_u: 0.0823497697711
valid_h0_min_x_max_u: 0.342845439911
valid_h0_min_x_mean_u: 0.0289610140026
valid_h0_min_x_min_u: 2.35798464088e-11
valid_h0_row_norms_max: 6.44061088562
valid_h0_row_norms_mean: 3.16697835922
valid_h0_row_norms_min: 0.125991553068
valid_objective: 0.0720023438334
valid_y_col_norms_max: 10.0062093735
valid_y_col_norms_mean: 8.73837566376
valid_y_col_norms_min: 6.84926891327
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.978184223175
valid_y_min_max_class: 0.33930772543
valid_y_misclass: 0.0208999998868
valid_y_nll: 0.0720023438334
valid_y_row_norms_max: 3.72132134438
valid_y_row_norms_mean: 1.11716985703
valid_y_row_norms_min: 0.168285727501
Time this epoch: 35.375439 seconds
Monitoring step:
Epochs seen: 32
Batches seen: 160
Examples seen: 1600000
ave_grad_mult: 3.63046574593
ave_grad_size: 0.0268990695477
ave_step_size: 0.091931194067
test_h0_col_norms_max: 6.46750497818
test_h0_col_norms_mean: 4.04480981827
test_h0_col_norms_min: 2.1463572979
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.96414077282
test_h0_max_x_min_u: 0.551202893257
test_h0_mean_x_max_u: 0.938820004463
test_h0_mean_x_mean_u: 0.466768145561
test_h0_mean_x_min_u: 0.0774406716228
test_h0_min_x_max_u: 0.35613399744
test_h0_min_x_mean_u: 0.0284414924681
test_h0_min_x_min_u: 2.09623499114e-11
test_h0_row_norms_max: 6.45651197433
test_h0_row_norms_mean: 3.17332720757
test_h0_row_norms_min: 0.126796171069
test_objective: 0.0663162916899
test_y_col_norms_max: 10.1816034317
test_y_col_norms_mean: 8.90622425079
test_y_col_norms_min: 6.98152685165
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.978770077229
test_y_min_max_class: 0.294499635696
test_y_misclass: 0.0207000002265
test_y_nll: 0.0663162916899
test_y_row_norms_max: 3.78270602226
test_y_row_norms_mean: 1.13857710361
test_y_row_norms_min: 0.170636937022
train_h0_col_norms_max: 6.46750450134
train_h0_col_norms_mean: 4.04480981827
train_h0_col_norms_min: 2.1463572979
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.965125858784
train_h0_max_x_min_u: 0.553418159485
train_h0_mean_x_max_u: 0.932287812233
train_h0_mean_x_mean_u: 0.466983139515
train_h0_mean_x_min_u: 0.0815795511007
train_h0_min_x_max_u: 0.351988613605
train_h0_min_x_mean_u: 0.0279593002051
train_h0_min_x_min_u: 2.79454966112e-11
train_h0_row_norms_max: 6.4565114975
train_h0_row_norms_mean: 3.17332744598
train_h0_row_norms_min: 0.126796171069
train_objective: 0.0232511665672
train_y_col_norms_max: 10.1816034317
train_y_col_norms_mean: 8.90622425079
train_y_col_norms_min: 6.98152732849
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.98363161087
train_y_min_max_class: 0.305530905724
train_y_misclass: 0.00421999953687
train_y_nll: 0.0232511665672
train_y_row_norms_max: 3.78270626068
train_y_row_norms_mean: 1.13857698441
train_y_row_norms_min: 0.170636937022
valid_h0_col_norms_max: 6.46750497818
valid_h0_col_norms_mean: 4.04480981827
valid_h0_col_norms_min: 2.1463572979
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.964271306992
valid_h0_max_x_min_u: 0.568406701088
valid_h0_mean_x_max_u: 0.934549808502
valid_h0_mean_x_mean_u: 0.466980487108
valid_h0_mean_x_min_u: 0.0836460664868
valid_h0_min_x_max_u: 0.336699128151
valid_h0_min_x_mean_u: 0.028556285426
valid_h0_min_x_min_u: 2.1281049839e-11
valid_h0_row_norms_max: 6.45651197433
valid_h0_row_norms_mean: 3.17332720757
valid_h0_row_norms_min: 0.126796171069
valid_objective: 0.0705031752586
valid_y_col_norms_max: 10.1816034317
valid_y_col_norms_mean: 8.90622425079
valid_y_col_norms_min: 6.98152685165
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.978758752346
valid_y_min_max_class: 0.291737556458
valid_y_misclass: 0.0208999998868
valid_y_nll: 0.0705031752586
valid_y_row_norms_max: 3.78270602226
valid_y_row_norms_mean: 1.13857710361
valid_y_row_norms_min: 0.170636937022
Time this epoch: 35.379330 seconds
Monitoring step:
Epochs seen: 33
Batches seen: 165
Examples seen: 1650000
ave_grad_mult: 3.85002589226
ave_grad_size: 0.0255950912833
ave_step_size: 0.0920957773924
test_h0_col_norms_max: 6.47780418396
test_h0_col_norms_mean: 4.05291509628
test_h0_col_norms_min: 2.14965701103
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.965208768845
test_h0_max_x_min_u: 0.553891956806
test_h0_mean_x_max_u: 0.941352784634
test_h0_mean_x_mean_u: 0.467216670513
test_h0_mean_x_min_u: 0.0769760459661
test_h0_min_x_max_u: 0.357422113419
test_h0_min_x_mean_u: 0.027681870386
test_h0_min_x_min_u: 1.6821729773e-11
test_h0_row_norms_max: 6.47196292877
test_h0_row_norms_mean: 3.18000507355
test_h0_row_norms_min: 0.127480790019
test_objective: 0.0658261179924
test_y_col_norms_max: 10.3589458466
test_y_col_norms_mean: 9.08145141602
test_y_col_norms_min: 7.12754154205
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.980045855045
test_y_min_max_class: 0.275538861752
test_y_misclass: 0.019999999553
test_y_nll: 0.0658261179924
test_y_row_norms_max: 3.84528589249
test_y_row_norms_mean: 1.16080152988
test_y_row_norms_min: 0.173613965511
train_h0_col_norms_max: 6.47780418396
train_h0_col_norms_mean: 4.05291461945
train_h0_col_norms_min: 2.14965701103
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.966172575951
train_h0_max_x_min_u: 0.551838994026
train_h0_mean_x_max_u: 0.935074448586
train_h0_mean_x_mean_u: 0.46742233634
train_h0_mean_x_min_u: 0.0811282843351
train_h0_min_x_max_u: 0.346001476049
train_h0_min_x_mean_u: 0.0272373519838
train_h0_min_x_min_u: 2.30680283902e-11
train_h0_row_norms_max: 6.47196340561
train_h0_row_norms_mean: 3.18000459671
train_h0_row_norms_min: 0.127480790019
train_objective: 0.02110886015
train_y_col_norms_max: 10.3589458466
train_y_col_norms_mean: 9.08145141602
train_y_col_norms_min: 7.12754058838
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.984871923923
train_y_min_max_class: 0.292335510254
train_y_misclass: 0.00331999990158
train_y_nll: 0.02110886015
train_y_row_norms_max: 3.84528613091
train_y_row_norms_mean: 1.16080152988
train_y_row_norms_min: 0.173613965511
valid_h0_col_norms_max: 6.47780418396
valid_h0_col_norms_mean: 4.05291509628
valid_h0_col_norms_min: 2.14965701103
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.965333819389
valid_h0_max_x_min_u: 0.567090988159
valid_h0_mean_x_max_u: 0.937143027782
valid_h0_mean_x_mean_u: 0.467420905828
valid_h0_mean_x_min_u: 0.0815225914121
valid_h0_min_x_max_u: 0.317524284124
valid_h0_min_x_mean_u: 0.0278288982809
valid_h0_min_x_min_u: 1.7383304865e-11
valid_h0_row_norms_max: 6.47196292877
valid_h0_row_norms_mean: 3.18000507355
valid_h0_row_norms_min: 0.127480790019
valid_objective: 0.0706555917859
valid_y_col_norms_max: 10.3589458466
valid_y_col_norms_mean: 9.08145141602
valid_y_col_norms_min: 7.12754154205
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.979981780052
valid_y_min_max_class: 0.314534544945
valid_y_misclass: 0.0206000003964
valid_y_nll: 0.0706555917859
valid_y_row_norms_max: 3.84528589249
valid_y_row_norms_mean: 1.16080152988
valid_y_row_norms_min: 0.173613965511
Time this epoch: 35.182908 seconds
Monitoring step:
Epochs seen: 34
Batches seen: 170
Examples seen: 1700000
ave_grad_mult: 4.07905960083
ave_grad_size: 0.0242200661451
ave_step_size: 0.0924715399742
test_h0_col_norms_max: 6.48900747299
test_h0_col_norms_mean: 4.06139850616
test_h0_col_norms_min: 2.1522192955
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.965900540352
test_h0_max_x_min_u: 0.551373183727
test_h0_mean_x_max_u: 0.942069590092
test_h0_mean_x_mean_u: 0.467438340187
test_h0_mean_x_min_u: 0.0787537544966
test_h0_min_x_max_u: 0.359593838453
test_h0_min_x_mean_u: 0.0271878745407
test_h0_min_x_min_u: 1.29720010775e-11
test_h0_row_norms_max: 6.49045753479
test_h0_row_norms_mean: 3.18700146675
test_h0_row_norms_min: 0.128459200263
test_objective: 0.0644877254963
test_y_col_norms_max: 10.5396261215
test_y_col_norms_mean: 9.26142787933
test_y_col_norms_min: 7.278901577
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.98046040535
test_y_min_max_class: 0.25162255764
test_y_misclass: 0.0206000003964
test_y_nll: 0.0644877254963
test_y_row_norms_max: 3.90689897537
test_y_row_norms_mean: 1.18369758129
test_y_row_norms_min: 0.177592679858
train_h0_col_norms_max: 6.48900747299
train_h0_col_norms_mean: 4.06139850616
train_h0_col_norms_min: 2.15221905708
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.966917276382
train_h0_max_x_min_u: 0.551108419895
train_h0_mean_x_max_u: 0.935860812664
train_h0_mean_x_mean_u: 0.467652916908
train_h0_mean_x_min_u: 0.0830486863852
train_h0_min_x_max_u: 0.34869286418
train_h0_min_x_mean_u: 0.0267758108675
train_h0_min_x_min_u: 1.72074004351e-11
train_h0_row_norms_max: 6.49045705795
train_h0_row_norms_mean: 3.18700098991
train_h0_row_norms_min: 0.128459185362
train_objective: 0.0193602163345
train_y_col_norms_max: 10.5396251678
train_y_col_norms_mean: 9.26142692566
train_y_col_norms_min: 7.27890205383
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.985767424107
train_y_min_max_class: 0.336476325989
train_y_misclass: 0.00289999973029
train_y_nll: 0.0193602163345
train_y_row_norms_max: 3.90689897537
train_y_row_norms_mean: 1.18369758129
train_y_row_norms_min: 0.177592664957
valid_h0_col_norms_max: 6.48900747299
valid_h0_col_norms_mean: 4.06139850616
valid_h0_col_norms_min: 2.1522192955
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.966026246548
valid_h0_max_x_min_u: 0.568945586681
valid_h0_mean_x_max_u: 0.937908649445
valid_h0_mean_x_mean_u: 0.467648357153
valid_h0_mean_x_min_u: 0.0835038796067
valid_h0_min_x_max_u: 0.32678771019
valid_h0_min_x_mean_u: 0.0274151265621
valid_h0_min_x_min_u: 1.33190177637e-11
valid_h0_row_norms_max: 6.49045753479
valid_h0_row_norms_mean: 3.18700146675
valid_h0_row_norms_min: 0.128459200263
valid_objective: 0.0707407668233
valid_y_col_norms_max: 10.5396261215
valid_y_col_norms_mean: 9.26142787933
valid_y_col_norms_min: 7.278901577
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.980556607246
valid_y_min_max_class: 0.298538506031
valid_y_misclass: 0.0219000000507
valid_y_nll: 0.0707407668233
valid_y_row_norms_max: 3.90689897537
valid_y_row_norms_mean: 1.18369758129
valid_y_row_norms_min: 0.177592679858
Time this epoch: 35.400439 seconds
Monitoring step:
Epochs seen: 35
Batches seen: 175
Examples seen: 1750000
ave_grad_mult: 4.3184633255
ave_grad_size: 0.022776318714
ave_step_size: 0.0920493155718
test_h0_col_norms_max: 6.49970197678
test_h0_col_norms_mean: 4.06945180893
test_h0_col_norms_min: 2.15374016762
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.966324806213
test_h0_max_x_min_u: 0.55324202776
test_h0_mean_x_max_u: 0.94215297699
test_h0_mean_x_mean_u: 0.466998904943
test_h0_mean_x_min_u: 0.076045922935
test_h0_min_x_max_u: 0.358031690121
test_h0_min_x_mean_u: 0.0268286950886
test_h0_min_x_min_u: 1.09078423377e-11
test_h0_row_norms_max: 6.50839042664
test_h0_row_norms_mean: 3.19361257553
test_h0_row_norms_min: 0.129599049687
test_objective: 0.0635969266295
test_y_col_norms_max: 10.7156534195
test_y_col_norms_mean: 9.42882728577
test_y_col_norms_min: 7.41432905197
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.980757176876
test_y_min_max_class: 0.248226299882
test_y_misclass: 0.0193999987096
test_y_nll: 0.0635969266295
test_y_row_norms_max: 3.96717524529
test_y_row_norms_mean: 1.20508480072
test_y_row_norms_min: 0.180099412799
train_h0_col_norms_max: 6.4997010231
train_h0_col_norms_mean: 4.06945180893
train_h0_col_norms_min: 2.1537399292
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.967268288136
train_h0_max_x_min_u: 0.551735162735
train_h0_mean_x_max_u: 0.935940921307
train_h0_mean_x_mean_u: 0.467215240002
train_h0_mean_x_min_u: 0.0804425179958
train_h0_min_x_max_u: 0.343524694443
train_h0_min_x_mean_u: 0.0264016315341
train_h0_min_x_min_u: 1.46074211754e-11
train_h0_row_norms_max: 6.50839090347
train_h0_row_norms_mean: 3.19361257553
train_h0_row_norms_min: 0.129599064589
train_objective: 0.0171565413475
train_y_col_norms_max: 10.7156524658
train_y_col_norms_mean: 9.42882633209
train_y_col_norms_min: 7.41432905197
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.986733615398
train_y_min_max_class: 0.337424963713
train_y_misclass: 0.00196000002325
train_y_nll: 0.0171565413475
train_y_row_norms_max: 3.96717500687
train_y_row_norms_mean: 1.20508468151
train_y_row_norms_min: 0.1800994277
valid_h0_col_norms_max: 6.49970197678
valid_h0_col_norms_mean: 4.06945180893
valid_h0_col_norms_min: 2.15374016762
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.966409444809
valid_h0_max_x_min_u: 0.564903616905
valid_h0_mean_x_max_u: 0.938026130199
valid_h0_mean_x_mean_u: 0.467201501131
valid_h0_mean_x_min_u: 0.0823206305504
valid_h0_min_x_max_u: 0.31958258152
valid_h0_min_x_mean_u: 0.0270514041185
valid_h0_min_x_min_u: 1.14466379084e-11
valid_h0_row_norms_max: 6.50839042664
valid_h0_row_norms_mean: 3.19361257553
valid_h0_row_norms_min: 0.129599049687
valid_objective: 0.0689148977399
valid_y_col_norms_max: 10.7156534195
valid_y_col_norms_mean: 9.42882728577
valid_y_col_norms_min: 7.41432905197
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.980640649796
valid_y_min_max_class: 0.264637023211
valid_y_misclass: 0.021099999547
valid_y_nll: 0.0689148977399
valid_y_row_norms_max: 3.96717524529
valid_y_row_norms_mean: 1.20508480072
valid_y_row_norms_min: 0.180099412799
Time this epoch: 35.392445 seconds
Monitoring step:
Epochs seen: 36
Batches seen: 180
Examples seen: 1800000
ave_grad_mult: 4.55049180984
ave_grad_size: 0.0215135067701
ave_step_size: 0.0914682373405
test_h0_col_norms_max: 6.5103468895
test_h0_col_norms_mean: 4.07773399353
test_h0_col_norms_min: 2.15385961533
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.967212915421
test_h0_max_x_min_u: 0.559629559517
test_h0_mean_x_max_u: 0.943103969097
test_h0_mean_x_mean_u: 0.466701477766
test_h0_mean_x_min_u: 0.0809521302581
test_h0_min_x_max_u: 0.355958789587
test_h0_min_x_mean_u: 0.0261587612331
test_h0_min_x_min_u: 8.34188967902e-12
test_h0_row_norms_max: 6.52430438995
test_h0_row_norms_mean: 3.20039582253
test_h0_row_norms_min: 0.130876362324
test_objective: 0.0621786899865
test_y_col_norms_max: 10.8925733566
test_y_col_norms_mean: 9.60350131989
test_y_col_norms_min: 7.55749177933
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.981753587723
test_y_min_max_class: 0.330662488937
test_y_misclass: 0.0190999973565
test_y_nll: 0.0621786899865
test_y_row_norms_max: 4.02781057358
test_y_row_norms_mean: 1.22741234303
test_y_row_norms_min: 0.181874185801
train_h0_col_norms_max: 6.51034593582
train_h0_col_norms_mean: 4.07773399353
train_h0_col_norms_min: 2.15385937691
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.968181371689
train_h0_max_x_min_u: 0.555752873421
train_h0_mean_x_max_u: 0.936968684196
train_h0_mean_x_mean_u: 0.466913819313
train_h0_mean_x_min_u: 0.0854497775435
train_h0_min_x_max_u: 0.338039875031
train_h0_min_x_mean_u: 0.0257331542671
train_h0_min_x_min_u: 1.09208597027e-11
train_h0_row_norms_max: 6.52430438995
train_h0_row_norms_mean: 3.20039534569
train_h0_row_norms_min: 0.130876347423
train_objective: 0.0157043337822
train_y_col_norms_max: 10.892572403
train_y_col_norms_mean: 9.60350131989
train_y_col_norms_min: 7.55749130249
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.987937808037
train_y_min_max_class: 0.323045521975
train_y_misclass: 0.00203999993391
train_y_nll: 0.0157043337822
train_y_row_norms_max: 4.02781057358
train_y_row_norms_mean: 1.22741222382
train_y_row_norms_min: 0.181874185801
valid_h0_col_norms_max: 6.5103468895
valid_h0_col_norms_mean: 4.07773399353
valid_h0_col_norms_min: 2.15385961533
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.967329084873
valid_h0_max_x_min_u: 0.567208707333
valid_h0_mean_x_max_u: 0.939025402069
valid_h0_mean_x_mean_u: 0.466897398233
valid_h0_mean_x_min_u: 0.0875384286046
valid_h0_min_x_max_u: 0.311605006456
valid_h0_min_x_mean_u: 0.0264036990702
valid_h0_min_x_min_u: 8.81185142215e-12
valid_h0_row_norms_max: 6.52430438995
valid_h0_row_norms_mean: 3.20039582253
valid_h0_row_norms_min: 0.130876362324
valid_objective: 0.0682094246149
valid_y_col_norms_max: 10.8925733566
valid_y_col_norms_mean: 9.60350131989
valid_y_col_norms_min: 7.55749177933
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.982199847698
valid_y_min_max_class: 0.324392050505
valid_y_misclass: 0.0208000000566
valid_y_nll: 0.0682094246149
valid_y_row_norms_max: 4.02781057358
valid_y_row_norms_mean: 1.22741234303
valid_y_row_norms_min: 0.181874185801
Time this epoch: 34.710048 seconds
Monitoring step:
Epochs seen: 37
Batches seen: 185
Examples seen: 1850000
ave_grad_mult: 4.72839355469
ave_grad_size: 0.0204669237137
ave_step_size: 0.0900116711855
test_h0_col_norms_max: 6.52083969116
test_h0_col_norms_mean: 4.08554124832
test_h0_col_norms_min: 2.15409636497
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.967603981495
test_h0_max_x_min_u: 0.557713389397
test_h0_mean_x_max_u: 0.943118810654
test_h0_mean_x_mean_u: 0.466896891594
test_h0_mean_x_min_u: 0.0787230879068
test_h0_min_x_max_u: 0.356404840946
test_h0_min_x_mean_u: 0.0257684588432
test_h0_min_x_min_u: 9.0589402299e-12
test_h0_row_norms_max: 6.53848934174
test_h0_row_norms_mean: 3.206792593
test_h0_row_norms_min: 0.131754085422
test_objective: 0.0623081922531
test_y_col_norms_max: 11.052611351
test_y_col_norms_mean: 9.76351451874
test_y_col_norms_min: 7.68663883209
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.982231199741
test_y_min_max_class: 0.287253022194
test_y_misclass: 0.0188999995589
test_y_nll: 0.0623081922531
test_y_row_norms_max: 4.08399629593
test_y_row_norms_mean: 1.24770605564
test_y_row_norms_min: 0.185720145702
train_h0_col_norms_max: 6.52083921432
train_h0_col_norms_mean: 4.08554124832
train_h0_col_norms_min: 2.15409636497
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.968553900719
train_h0_max_x_min_u: 0.553573608398
train_h0_mean_x_max_u: 0.937007129192
train_h0_mean_x_mean_u: 0.467112243176
train_h0_mean_x_min_u: 0.0832107812166
train_h0_min_x_max_u: 0.334134042263
train_h0_min_x_mean_u: 0.0253749713302
train_h0_min_x_min_u: 1.24377470129e-11
train_h0_row_norms_max: 6.5384888649
train_h0_row_norms_mean: 3.20679235458
train_h0_row_norms_min: 0.131754085422
train_objective: 0.0140552837402
train_y_col_norms_max: 11.0526103973
train_y_col_norms_mean: 9.76351451874
train_y_col_norms_min: 7.68663883209
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.988768100739
train_y_min_max_class: 0.329038023949
train_y_misclass: 0.00163999991491
train_y_nll: 0.0140552837402
train_y_row_norms_max: 4.08399581909
train_y_row_norms_mean: 1.24770605564
train_y_row_norms_min: 0.1857201159
valid_h0_col_norms_max: 6.52083969116
valid_h0_col_norms_mean: 4.08554124832
valid_h0_col_norms_min: 2.15409636497
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.967728018761
valid_h0_max_x_min_u: 0.569734930992
valid_h0_mean_x_max_u: 0.939063310623
valid_h0_mean_x_mean_u: 0.467084676027
valid_h0_mean_x_min_u: 0.0852277651429
valid_h0_min_x_max_u: 0.310160905123
valid_h0_min_x_mean_u: 0.0260545928031
valid_h0_min_x_min_u: 9.77232097327e-12
valid_h0_row_norms_max: 6.53848934174
valid_h0_row_norms_mean: 3.206792593
valid_h0_row_norms_min: 0.131754085422
valid_objective: 0.0679266303778
valid_y_col_norms_max: 11.052611351
valid_y_col_norms_mean: 9.76351451874
valid_y_col_norms_min: 7.68663883209
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.982333242893
valid_y_min_max_class: 0.319318085909
valid_y_misclass: 0.0204000007361
valid_y_nll: 0.0679266303778
valid_y_row_norms_max: 4.08399629593
valid_y_row_norms_mean: 1.24770605564
valid_y_row_norms_min: 0.185720145702
Time this epoch: 35.364850 seconds
Monitoring step:
Epochs seen: 38
Batches seen: 190
Examples seen: 1900000
ave_grad_mult: 5.14290428162
ave_grad_size: 0.0190559756011
ave_step_size: 0.0913925841451
test_h0_col_norms_max: 6.53183841705
test_h0_col_norms_mean: 4.09429168701
test_h0_col_norms_min: 2.1546792984
test_h0_max_x_max_u: 1.0
test_h0_max_x_mean_u: 0.968341529369
test_h0_max_x_min_u: 0.560349822044
test_h0_mean_x_max_u: 0.94111353159
test_h0_mean_x_mean_u: 0.466603428125
test_h0_mean_x_min_u: 0.0797407329082
test_h0_min_x_max_u: 0.351446330547
test_h0_min_x_mean_u: 0.0251497104764
test_h0_min_x_min_u: 7.31677132076e-12
test_h0_row_norms_max: 6.55737257004
test_h0_row_norms_mean: 3.21393060684
test_h0_row_norms_min: 0.132736563683
test_objective: 0.0633104071021
test_y_col_norms_max: 11.2310876846
test_y_col_norms_mean: 9.94289398193
test_y_col_norms_min: 7.82843732834
test_y_max_max_class: 1.0
test_y_mean_max_class: 0.982944607735
test_y_min_max_class: 0.318380922079
test_y_misclass: 0.0193999987096
test_y_nll: 0.0633104071021
test_y_row_norms_max: 4.14330053329
test_y_row_norms_mean: 1.27068781853
test_y_row_norms_min: 0.189937055111
train_h0_col_norms_max: 6.53183746338
train_h0_col_norms_mean: 4.09429121017
train_h0_col_norms_min: 2.15467905998
train_h0_max_x_max_u: 0.999999940395
train_h0_max_x_mean_u: 0.969276428223
train_h0_max_x_min_u: 0.554496645927
train_h0_mean_x_max_u: 0.934813499451
train_h0_mean_x_mean_u: 0.466816186905
train_h0_mean_x_min_u: 0.0843253731728
train_h0_min_x_max_u: 0.332267045975
train_h0_min_x_mean_u: 0.0247781910002
train_h0_min_x_min_u: 9.73409200467e-12
train_h0_row_norms_max: 6.5573720932
train_h0_row_norms_mean: 3.21393036842
train_h0_row_norms_min: 0.132736548781
train_objective: 0.0125638237223
train_y_col_norms_max: 11.231086731
train_y_col_norms_mean: 9.94289398193
train_y_col_norms_min: 7.82843637466
train_y_max_max_class: 0.999999940395
train_y_mean_max_class: 0.989765167236
train_y_min_max_class: 0.37343031168
train_y_misclass: 0.00133999995887
train_y_nll: 0.0125638237223
train_y_row_norms_max: 4.14330005646
train_y_row_norms_mean: 1.27068758011
train_y_row_norms_min: 0.189937055111
valid_h0_col_norms_max: 6.53183841705
valid_h0_col_norms_mean: 4.09429168701
valid_h0_col_norms_min: 2.1546792984
valid_h0_max_x_max_u: 1.0
valid_h0_max_x_mean_u: 0.968490362167
valid_h0_max_x_min_u: 0.566503345966
valid_h0_mean_x_max_u: 0.93706715107
valid_h0_mean_x_mean_u: 0.466789364815
valid_h0_mean_x_min_u: 0.0863413140178
valid_h0_min_x_max_u: 0.307645887136
valid_h0_min_x_mean_u: 0.0253705345094
valid_h0_min_x_min_u: 7.80784985277e-12
valid_h0_row_norms_max: 6.55737257004
valid_h0_row_norms_mean: 3.21393060684
valid_h0_row_norms_min: 0.132736563683
valid_objective: 0.0684154629707
valid_y_col_norms_max: 11.2310876846
valid_y_col_norms_mean: 9.94289398193
valid_y_col_norms_min: 7.82843732834
valid_y_max_max_class: 1.0
valid_y_mean_max_class: 0.983206391335
valid_y_min_max_class: 0.354223191738
valid_y_misclass: 0.0201999973506
valid_y_nll: 0.0684154629707
valid_y_row_norms_max: 4.14330053329
valid_y_row_norms_mean: 1.27068781853
valid_y_row_norms_min: 0.189937055111
As the model trained, it should have printed out progress messages. Most of these are the values of the various channels being monitored throughout training.
We can use the print_monitor script to print the last monitoring entry of a saved model. By running it on "mlp_best.pkl", we can see the performance of the model at the point where it did the best on the validation set.
In [4]:
!print_monitor.py mlp_best.pkl | grep test_y_misclass
Using gpu device 2: GeForce GTX 285
/u/goodfeli/pylearn2/models/mlp.py:36: UserWarning: MLP changing the recursion limit.
warnings.warn("MLP changing the recursion limit.")
test_y_misclass : 0.0193999987096
The test set error has dropped to 1.94%! This is a big improvement over softmax regression.
Another common way of analyzing trained models is to look at their weights. Here we use the show_weights script to visualize $W$:
In [5]:
!show_weights.py mlp_best.pkl
Using gpu device 0: GeForce GTX 285
making weights report
loading model
loading done
loading dataset...
...done
smallest enc weight magnitude: 0.0
mean enc weight magnitude: 0.0409141770966
max enc weight magnitude: 4.76068
min norm: 2.15468
mean norm: 4.09429199219
max norm: 6.53184
So far in these tutorials, there has not been much benefit to using pylearn2, rather than some other machine learning library, or even just an implementation of softmax regression or an MLP without an accompanying library.
Now it's time to see some of why pylearn2 is useful. We're going to make several changes to our experimental setup, while still re-using most of the code. The beauty of pylearn2 is that it is built from interchangeable parts, so that if you want to create a new machine learning experiment, you don't need to rewrite the whole experiment from scratch.
We're going to take the MLP example above and change it in three major ways:
-Instead of training just a two layer MLP, we'll train a three layer MLP. We can do this just by putting one more layer in the "layers" list. We don't need to change the training algorithm or the main MLP model.
-Instead of using the Sigmoid Layer class, we'll use a different kind of layer, called a rectified linear layer. The rectified linear layer uses the usual affine function $z = x^T W + b$ to compute the presynaptic inputs, then passes each element of $z$ through the function $g(z) = \mathbb{I}_{z > 0} z$. In other words, values greater than 0 are left unchanged, while negative values are replaced with zeros. In pylearn2, we can do this just by loading a different class in the layers list. We don't need to change the training algorithm or the main MLP model.
-Instead of optimizing the log likelihood using the nonlinear conjugate gradient descent algorithm, we will optimize it using a minibatch version of stochastic gradient descent. We can do this just by passing in a different TrainingAlgorithm object. No changes to the model or the code for the cost are needed.
Here is the updated YAML description of the experiment:
In [6]:
import os
import pylearn2
path = os.path.join(pylearn2.__path__[0], 'scripts', 'tutorials', 'multilayer_perceptron', 'mlp_tutorial_part_3.yaml')
with open(path, 'r') as f:
train_2 = f.read()
hyper_params = {'train_stop' : 50000,
'valid_stop' : 60000,
'dim_h0' : 500,
'dim_h1' : 1000,
'sparse_init_h1' : 15,
'max_epochs' : 10000,
'save_path' : '.'}
train_2 = train_2 % (hyper_params)
print train_2
!obj:pylearn2.train.Train {
dataset: &train !obj:pylearn2.datasets.mnist.MNIST {
which_set: 'train',
start: 0,
stop: 50000
},
model: !obj:pylearn2.models.mlp.MLP {
layers: [ !obj:pylearn2.models.mlp.RectifiedLinear {
layer_name: 'h0',
dim: 500,
sparse_init: 15
}, !obj:pylearn2.models.mlp.RectifiedLinear {
layer_name: 'h1',
dim: 1000,
sparse_init: 15
}, !obj:pylearn2.models.mlp.Softmax {
layer_name: 'y',
n_classes: 10,
irange: 0.
}
],
nvis: 784,
},
algorithm: !obj:pylearn2.training_algorithms.sgd.SGD {
batch_size: 100,
learning_rate: .01,
monitoring_dataset:
{
'train' : *train,
'valid' : !obj:pylearn2.datasets.mnist.MNIST {
which_set: 'train',
start: 50000,
stop: 60000
},
'test' : !obj:pylearn2.datasets.mnist.MNIST {
which_set: 'test',
}
},
learning_rule: !obj:pylearn2.training_algorithms.learning_rule.Momentum {
init_momentum: .5
},
termination_criterion: !obj:pylearn2.termination_criteria.And {
criteria: [
!obj:pylearn2.termination_criteria.MonitorBased {
channel_name: "valid_y_misclass",
prop_decrease: 0.,
N: 10
},
!obj:pylearn2.termination_criteria.EpochCounter {
max_epochs: 10000
}
]
}
},
extensions: [ !obj:pylearn2.train_extensions.best_params.MonitorBasedSaveBest {
channel_name: 'valid_y_misclass',
save_path: "mlp_2_best.pkl"
}, !obj:pylearn2.training_algorithms.learning_rule.MomentumAdjustor {
start: 1,
saturate: 10,
final_momentum: .99
}
]
}
This YAML config file also introduces another use of extensions to the Train object. Here, we add the MomentumAdjustor. It uses a callback to adjust the momentum setting of the SGD algorithm at the end of each epoch. Here, we configure it to start increasing the momentum after 1 epoch, and to continue increasing it until it reaches a value of .99 at the end of the tenth epoch. See the docstring for the SGD class for more information on what this momentum setting does.
In [7]:
from pylearn2.config import yaml_parse
train_2 = yaml_parse.load(train_2)
train_2.main_loop()
Parameter and initial learning rate summary:
h0_W: 0.00999999977648
h0_b: 0.00999999977648
h1_W: 0.00999999977648
h1_b: 0.00999999977648
softmax_b: 0.00999999977648
softmax_W: 0.00999999977648
Compiling sgd_update...
Compiling sgd_update done. Time elapsed: 2.516152 seconds
compiling begin_record_entry...
compiling begin_record_entry done. Time elapsed: 0.395491 seconds
Monitored channels:
learning_rate
momentum
test_h0_col_norms_max
test_h0_col_norms_mean
test_h0_col_norms_min
test_h0_row_norms_max
test_h0_row_norms_mean
test_h0_row_norms_min
test_h1_col_norms_max
test_h1_col_norms_mean
test_h1_col_norms_min
test_h1_row_norms_max
test_h1_row_norms_mean
test_h1_row_norms_min
test_objective
test_y_col_norms_max
test_y_col_norms_mean
test_y_col_norms_min
test_y_max_max_class
test_y_mean_max_class
test_y_min_max_class
test_y_misclass
test_y_nll
test_y_row_norms_max
test_y_row_norms_mean
test_y_row_norms_min
train_h0_col_norms_max
train_h0_col_norms_mean
train_h0_col_norms_min
train_h0_row_norms_max
train_h0_row_norms_mean
train_h0_row_norms_min
train_h1_col_norms_max
train_h1_col_norms_mean
train_h1_col_norms_min
train_h1_row_norms_max
train_h1_row_norms_mean
train_h1_row_norms_min
train_objective
train_y_col_norms_max
train_y_col_norms_mean
train_y_col_norms_min
train_y_max_max_class
train_y_mean_max_class
train_y_min_max_class
train_y_misclass
train_y_nll
train_y_row_norms_max
train_y_row_norms_mean
train_y_row_norms_min
valid_h0_col_norms_max
valid_h0_col_norms_mean
valid_h0_col_norms_min
valid_h0_row_norms_max
valid_h0_row_norms_mean
valid_h0_row_norms_min
valid_h1_col_norms_max
valid_h1_col_norms_mean
valid_h1_col_norms_min
valid_h1_row_norms_max
valid_h1_row_norms_mean
valid_h1_row_norms_min
valid_objective
valid_y_col_norms_max
valid_y_col_norms_mean
valid_y_col_norms_min
valid_y_max_max_class
valid_y_mean_max_class
valid_y_min_max_class
valid_y_misclass
valid_y_nll
valid_y_row_norms_max
valid_y_row_norms_mean
valid_y_row_norms_min
Compiling accum...
graph size: 165
graph size: 163
graph size: 163
Compiling accum done. Time elapsed: 11.563393 seconds
Monitoring step:
Epochs seen: 0
Batches seen: 0
Examples seen: 0
learning_rate: 0.00999999046326
momentum: 0.499999672174
test_h0_col_norms_max: 6.23503017426
test_h0_col_norms_mean: 3.82356023788
test_h0_col_norms_min: 2.06193947792
test_h0_row_norms_max: 5.89326524734
test_h0_row_norms_mean: 2.98549389839
test_h0_row_norms_min: 0.0
test_h1_col_norms_max: 5.99438333511
test_h1_col_norms_mean: 3.80721712112
test_h1_col_norms_min: 1.71524214745
test_h1_row_norms_max: 7.80886650085
test_h1_row_norms_mean: 5.40815734863
test_h1_row_norms_min: 2.97773504257
test_objective: 2.30258488655
test_y_col_norms_max: 0.0
test_y_col_norms_mean: 0.0
test_y_col_norms_min: 0.0
test_y_max_max_class: 0.100000023842
test_y_mean_max_class: 0.100000031292
test_y_min_max_class: 0.100000023842
test_y_misclass: 0.901999890804
test_y_nll: 2.30258488655
test_y_row_norms_max: 0.0
test_y_row_norms_mean: 0.0
test_y_row_norms_min: 0.0
train_h0_col_norms_max: 6.23505115509
train_h0_col_norms_mean: 3.82354259491
train_h0_col_norms_min: 2.0619494915
train_h0_row_norms_max: 5.89324569702
train_h0_row_norms_mean: 2.98548007011
train_h0_row_norms_min: 0.0
train_h1_col_norms_max: 5.99438095093
train_h1_col_norms_mean: 3.80721092224
train_h1_col_norms_min: 1.71524274349
train_h1_row_norms_max: 7.80887794495
train_h1_row_norms_mean: 5.40813541412
train_h1_row_norms_min: 2.97772955894
train_objective: 2.30257916451
train_y_col_norms_max: 0.0
train_y_col_norms_mean: 0.0
train_y_col_norms_min: 0.0
train_y_max_max_class: 0.100000545382
train_y_mean_max_class: 0.100000545382
train_y_min_max_class: 0.100000545382
train_y_misclass: 0.901360213757
train_y_nll: 2.30257916451
train_y_row_norms_max: 0.0
train_y_row_norms_mean: 0.0
train_y_row_norms_min: 0.0
valid_h0_col_norms_max: 6.23503017426
valid_h0_col_norms_mean: 3.82356023788
valid_h0_col_norms_min: 2.06193947792
valid_h0_row_norms_max: 5.89326524734
valid_h0_row_norms_mean: 2.98549389839
valid_h0_row_norms_min: 0.0
valid_h1_col_norms_max: 5.99438333511
valid_h1_col_norms_mean: 3.80721712112
valid_h1_col_norms_min: 1.71524214745
valid_h1_row_norms_max: 7.80886650085
valid_h1_row_norms_mean: 5.40815734863
valid_h1_row_norms_min: 2.97773504257
valid_objective: 2.30258488655
valid_y_col_norms_max: 0.0
valid_y_col_norms_mean: 0.0
valid_y_col_norms_min: 0.0
valid_y_max_max_class: 0.100000023842
valid_y_mean_max_class: 0.100000031292
valid_y_min_max_class: 0.100000023842
valid_y_misclass: 0.90089994669
valid_y_nll: 2.30258488655
valid_y_row_norms_max: 0.0
valid_y_row_norms_mean: 0.0
valid_y_row_norms_min: 0.0
Time this epoch: 3.343442 seconds
Monitoring step:
Epochs seen: 1
Batches seen: 500
Examples seen: 50000
learning_rate: 0.00999999046326
momentum: 0.499999672174
test_h0_col_norms_max: 6.23488473892
test_h0_col_norms_mean: 3.82359194756
test_h0_col_norms_min: 2.06265735626
test_h0_row_norms_max: 5.89264249802
test_h0_row_norms_mean: 2.98556685448
test_h0_row_norms_min: 0.00163861282635
test_h1_col_norms_max: 5.99485731125
test_h1_col_norms_mean: 3.80723309517
test_h1_col_norms_min: 1.71526324749
test_h1_row_norms_max: 7.80893564224
test_h1_row_norms_mean: 5.40817546844
test_h1_row_norms_min: 2.97778272629
test_objective: 0.268750548363
test_y_col_norms_max: 0.645500898361
test_y_col_norms_mean: 0.596350252628
test_y_col_norms_min: 0.520334303379
test_y_max_max_class: 0.999946475029
test_y_mean_max_class: 0.904475390911
test_y_min_max_class: 0.38064879179
test_y_misclass: 0.0812000110745
test_y_nll: 0.268750548363
test_y_row_norms_max: 0.17966529727
test_y_row_norms_mean: 0.0518538914621
test_y_row_norms_min: 0.000149252169649
train_h0_col_norms_max: 6.23488473892
train_h0_col_norms_mean: 3.82361268997
train_h0_col_norms_min: 2.06266713142
train_h0_row_norms_max: 5.89267301559
train_h0_row_norms_mean: 2.98556661606
train_h0_row_norms_min: 0.001638607122
train_h1_col_norms_max: 5.99485683441
train_h1_col_norms_mean: 3.80721235275
train_h1_col_norms_min: 1.71525621414
train_h1_row_norms_max: 7.80892753601
train_h1_row_norms_mean: 5.4081993103
train_h1_row_norms_min: 2.97776818275
train_objective: 0.264730095863
train_y_col_norms_max: 0.645499527454
train_y_col_norms_mean: 0.596347033978
train_y_col_norms_min: 0.520334303379
train_y_max_max_class: 0.999963521957
train_y_mean_max_class: 0.899078428745
train_y_min_max_class: 0.361695259809
train_y_misclass: 0.0793600603938
train_y_nll: 0.264730095863
train_y_row_norms_max: 0.179665282369
train_y_row_norms_mean: 0.051854070276
train_y_row_norms_min: 0.000149251762195
valid_h0_col_norms_max: 6.23488473892
valid_h0_col_norms_mean: 3.82359194756
valid_h0_col_norms_min: 2.06265735626
valid_h0_row_norms_max: 5.89264249802
valid_h0_row_norms_mean: 2.98556685448
valid_h0_row_norms_min: 0.00163861282635
valid_h1_col_norms_max: 5.99485731125
valid_h1_col_norms_mean: 3.80723309517
valid_h1_col_norms_min: 1.71526324749
valid_h1_row_norms_max: 7.80893564224
valid_h1_row_norms_mean: 5.40817546844
valid_h1_row_norms_min: 2.97778272629
valid_objective: 0.252131432295
valid_y_col_norms_max: 0.645500898361
valid_y_col_norms_mean: 0.596350252628
valid_y_col_norms_min: 0.520334303379
valid_y_max_max_class: 0.999965012074
valid_y_mean_max_class: 0.907301902771
valid_y_min_max_class: 0.362495720387
valid_y_misclass: 0.0754000097513
valid_y_nll: 0.252131432295
valid_y_row_norms_max: 0.17966529727
valid_y_row_norms_mean: 0.0518538914621
valid_y_row_norms_min: 0.000149252169649
Time this epoch: 3.325040 seconds
Monitoring step:
Epochs seen: 2
Batches seen: 1000
Examples seen: 100000
learning_rate: 0.00999999046326
momentum: 0.554444551468
test_h0_col_norms_max: 6.2346944809
test_h0_col_norms_mean: 3.82387781143
test_h0_col_norms_min: 2.06334352493
test_h0_row_norms_max: 5.89264249802
test_h0_row_norms_mean: 2.98581314087
test_h0_row_norms_min: 0.00337248062715
test_h1_col_norms_max: 5.99546384811
test_h1_col_norms_mean: 3.80735421181
test_h1_col_norms_min: 1.71530222893
test_h1_row_norms_max: 7.80887699127
test_h1_row_norms_mean: 5.40835094452
test_h1_row_norms_min: 2.97777676582
test_objective: 0.209201917052
test_y_col_norms_max: 0.849824726582
test_y_col_norms_mean: 0.752399742603
test_y_col_norms_min: 0.648707330227
test_y_max_max_class: 0.999981224537
test_y_mean_max_class: 0.928354024887
test_y_min_max_class: 0.417280673981
test_y_misclass: 0.0621000118554
test_y_nll: 0.209201917052
test_y_row_norms_max: 0.202846974134
test_y_row_norms_mean: 0.0668164640665
test_y_row_norms_min: 0.000276584294625
train_h0_col_norms_max: 6.23466491699
train_h0_col_norms_mean: 3.82387685776
train_h0_col_norms_min: 2.06333851814
train_h0_row_norms_max: 5.89267301559
train_h0_row_norms_mean: 2.98582696915
train_h0_row_norms_min: 0.00337246293202
train_h1_col_norms_max: 5.99549293518
train_h1_col_norms_mean: 3.80733585358
train_h1_col_norms_min: 1.71530234814
train_h1_row_norms_max: 7.80891132355
train_h1_row_norms_mean: 5.4083533287
train_h1_row_norms_min: 2.97776651382
train_objective: 0.192548781633
train_y_col_norms_max: 0.849820315838
train_y_col_norms_mean: 0.752397358418
train_y_col_norms_min: 0.648707211018
train_y_max_max_class: 0.999981343746
train_y_mean_max_class: 0.925991177559
train_y_min_max_class: 0.379428476095
train_y_misclass: 0.0572400614619
train_y_nll: 0.192548781633
train_y_row_norms_max: 0.202847748995
train_y_row_norms_mean: 0.0668167173862
train_y_row_norms_min: 0.000276583392406
valid_h0_col_norms_max: 6.2346944809
valid_h0_col_norms_mean: 3.82387781143
valid_h0_col_norms_min: 2.06334352493
valid_h0_row_norms_max: 5.89264249802
valid_h0_row_norms_mean: 2.98581314087
valid_h0_row_norms_min: 0.00337248062715
valid_h1_col_norms_max: 5.99546384811
valid_h1_col_norms_mean: 3.80735421181
valid_h1_col_norms_min: 1.71530222893
valid_h1_row_norms_max: 7.80887699127
valid_h1_row_norms_mean: 5.40835094452
valid_h1_row_norms_min: 2.97777676582
valid_objective: 0.201314240694
valid_y_col_norms_max: 0.849824726582
valid_y_col_norms_mean: 0.752399742603
valid_y_col_norms_min: 0.648707330227
valid_y_max_max_class: 0.999982595444
valid_y_mean_max_class: 0.93180680275
valid_y_min_max_class: 0.40289413929
valid_y_misclass: 0.0579000003636
valid_y_nll: 0.201314240694
valid_y_row_norms_max: 0.202846974134
valid_y_row_norms_mean: 0.0668164640665
valid_y_row_norms_min: 0.000276584294625
Time this epoch: 3.321143 seconds
Monitoring step:
Epochs seen: 3
Batches seen: 1500
Examples seen: 150000
learning_rate: 0.00999999046326
momentum: 0.608888924122
test_h0_col_norms_max: 6.23464679718
test_h0_col_norms_mean: 3.82416844368
test_h0_col_norms_min: 2.06404829025
test_h0_row_norms_max: 5.89243221283
test_h0_row_norms_mean: 2.98607397079
test_h0_row_norms_min: 0.00511313043535
test_h1_col_norms_max: 5.99604940414
test_h1_col_norms_mean: 3.80747485161
test_h1_col_norms_min: 1.71535277367
test_h1_row_norms_max: 7.80883836746
test_h1_row_norms_mean: 5.40852594376
test_h1_row_norms_min: 2.97782230377
test_objective: 0.18524043262
test_y_col_norms_max: 1.00719892979
test_y_col_norms_mean: 0.879001736641
test_y_col_norms_min: 0.748181402683
test_y_max_max_class: 0.999993741512
test_y_mean_max_class: 0.939781844616
test_y_min_max_class: 0.445061296225
test_y_misclass: 0.0548000186682
test_y_nll: 0.18524043262
test_y_row_norms_max: 0.216917276382
test_y_row_norms_mean: 0.0788432434201
test_y_row_norms_min: 0.000395227049012
train_h0_col_norms_max: 6.23464632034
train_h0_col_norms_mean: 3.82414579391
train_h0_col_norms_min: 2.06404733658
train_h0_row_norms_max: 5.89245033264
train_h0_row_norms_mean: 2.98607373238
train_h0_row_norms_min: 0.00511312671006
train_h1_col_norms_max: 5.99604892731
train_h1_col_norms_mean: 3.80745625496
train_h1_col_norms_min: 1.71535873413
train_h1_row_norms_max: 7.80887460709
train_h1_row_norms_mean: 5.40852594376
train_h1_row_norms_min: 2.9778380394
train_objective: 0.161898091435
train_y_col_norms_max: 1.00719916821
train_y_col_norms_mean: 0.87899774313
train_y_col_norms_min: 0.748184919357
train_y_max_max_class: 0.999991238117
train_y_mean_max_class: 0.93733805418
train_y_min_max_class: 0.405598640442
train_y_misclass: 0.0483000576496
train_y_nll: 0.161898091435
train_y_row_norms_max: 0.216916337609
train_y_row_norms_mean: 0.0788431763649
train_y_row_norms_min: 0.000395228940761
valid_h0_col_norms_max: 6.23464679718
valid_h0_col_norms_mean: 3.82416844368
valid_h0_col_norms_min: 2.06404829025
valid_h0_row_norms_max: 5.89243221283
valid_h0_row_norms_mean: 2.98607397079
valid_h0_row_norms_min: 0.00511313043535
valid_h1_col_norms_max: 5.99604940414
valid_h1_col_norms_mean: 3.80747485161
valid_h1_col_norms_min: 1.71535277367
valid_h1_row_norms_max: 7.80883836746
valid_h1_row_norms_mean: 5.40852594376
valid_h1_row_norms_min: 2.97782230377
valid_objective: 0.174453571439
valid_y_col_norms_max: 1.00719892979
valid_y_col_norms_mean: 0.879001736641
valid_y_col_norms_min: 0.748181402683
valid_y_max_max_class: 0.999995052814
valid_y_mean_max_class: 0.94245827198
valid_y_min_max_class: 0.418575078249
valid_y_misclass: 0.0514000207186
valid_y_nll: 0.174453571439
valid_y_row_norms_max: 0.216917276382
valid_y_row_norms_mean: 0.0788432434201
valid_y_row_norms_min: 0.000395227049012
Time this epoch: 3.407873 seconds
Monitoring step:
Epochs seen: 4
Batches seen: 2000
Examples seen: 200000
learning_rate: 0.00999999046326
momentum: 0.663333714008
test_h0_col_norms_max: 6.23483276367
test_h0_col_norms_mean: 3.82449483871
test_h0_col_norms_min: 2.06498026848
test_h0_row_norms_max: 5.89247989655
test_h0_row_norms_mean: 2.98636126518
test_h0_row_norms_min: 0.00637936964631
test_h1_col_norms_max: 5.99670314789
test_h1_col_norms_mean: 3.80761146545
test_h1_col_norms_min: 1.71540987492
test_h1_row_norms_max: 7.80886650085
test_h1_row_norms_mean: 5.40871572495
test_h1_row_norms_min: 2.97799134254
test_objective: 0.167924150825
test_y_col_norms_max: 1.14452064037
test_y_col_norms_mean: 0.995063841343
test_y_col_norms_min: 0.840617954731
test_y_max_max_class: 0.99999588728
test_y_mean_max_class: 0.946992635727
test_y_min_max_class: 0.455186247826
test_y_misclass: 0.0552000291646
test_y_nll: 0.167924150825
test_y_row_norms_max: 0.23083357513
test_y_row_norms_mean: 0.08986672014
test_y_row_norms_min: 0.000483248528326
train_h0_col_norms_max: 6.2348651886
train_h0_col_norms_mean: 3.82447862625
train_h0_col_norms_min: 2.06498932838
train_h0_row_norms_max: 5.89249992371
train_h0_row_norms_mean: 2.98634982109
train_h0_row_norms_min: 0.00637934077531
train_h1_col_norms_max: 5.99670362473
train_h1_col_norms_mean: 3.80763316154
train_h1_col_norms_min: 1.71541762352
train_h1_row_norms_max: 7.80887794495
train_h1_row_norms_mean: 5.40874290466
train_h1_row_norms_min: 2.97797679901
train_objective: 0.138446286321
train_y_col_norms_max: 1.1445235014
train_y_col_norms_mean: 0.995067954063
train_y_col_norms_min: 0.840613126755
train_y_max_max_class: 0.999992251396
train_y_mean_max_class: 0.945943057537
train_y_min_max_class: 0.423846125603
train_y_misclass: 0.0430600605905
train_y_nll: 0.138446286321
train_y_row_norms_max: 0.230833858252
train_y_row_norms_mean: 0.0898664072156
train_y_row_norms_min: 0.000483250943944
valid_h0_col_norms_max: 6.23483276367
valid_h0_col_norms_mean: 3.82449483871
valid_h0_col_norms_min: 2.06498026848
valid_h0_row_norms_max: 5.89247989655
valid_h0_row_norms_mean: 2.98636126518
valid_h0_row_norms_min: 0.00637936964631
valid_h1_col_norms_max: 5.99670314789
valid_h1_col_norms_mean: 3.80761146545
valid_h1_col_norms_min: 1.71540987492
valid_h1_row_norms_max: 7.80886650085
valid_h1_row_norms_mean: 5.40871572495
valid_h1_row_norms_min: 2.97799134254
valid_objective: 0.157675400376
valid_y_col_norms_max: 1.14452064037
valid_y_col_norms_mean: 0.995063841343
valid_y_col_norms_min: 0.840617954731
valid_y_max_max_class: 0.999996602535
valid_y_mean_max_class: 0.949966013432
valid_y_min_max_class: 0.442742049694
valid_y_misclass: 0.046300008893
valid_y_nll: 0.157675400376
valid_y_row_norms_max: 0.23083357513
valid_y_row_norms_mean: 0.08986672014
valid_y_row_norms_min: 0.000483248528326
Time this epoch: 3.220654 seconds
Monitoring step:
Epochs seen: 5
Batches seen: 2500
Examples seen: 250000
learning_rate: 0.00999999046326
momentum: 0.717777192593
test_h0_col_norms_max: 6.23521852493
test_h0_col_norms_mean: 3.82483482361
test_h0_col_norms_min: 2.06603121758
test_h0_row_norms_max: 5.89207363129
test_h0_row_norms_mean: 2.98667144775
test_h0_row_norms_min: 0.00797319039702
test_h1_col_norms_max: 5.99737501144
test_h1_col_norms_mean: 3.80774116516
test_h1_col_norms_min: 1.71550190449
test_h1_row_norms_max: 7.80892467499
test_h1_row_norms_mean: 5.40890693665
test_h1_row_norms_min: 2.97820734978
test_objective: 0.13814201951
test_y_col_norms_max: 1.26785862446
test_y_col_norms_mean: 1.10942089558
test_y_col_norms_min: 0.9239538908
test_y_max_max_class: 0.999995410442
test_y_mean_max_class: 0.953776538372
test_y_min_max_class: 0.461881011724
test_y_misclass: 0.0431000031531
test_y_nll: 0.13814201951
test_y_row_norms_max: 0.258687496185
test_y_row_norms_mean: 0.10072222352
test_y_row_norms_min: 0.000603844528086
train_h0_col_norms_max: 6.23519468307
train_h0_col_norms_mean: 3.82483053207
train_h0_col_norms_min: 2.06602716446
train_h0_row_norms_max: 5.89205408096
train_h0_row_norms_mean: 2.98667001724
train_h0_row_norms_min: 0.0079732267186
train_h1_col_norms_max: 5.99740314484
train_h1_col_norms_mean: 3.80775809288
train_h1_col_norms_min: 1.71549510956
train_h1_row_norms_max: 7.80892419815
train_h1_row_norms_mean: 5.40891933441
train_h1_row_norms_min: 2.97820615768
train_objective: 0.104295127094
train_y_col_norms_max: 1.26785480976
train_y_col_norms_mean: 1.109421134
train_y_col_norms_min: 0.923955321312
train_y_max_max_class: 0.999992787838
train_y_mean_max_class: 0.954641282558
train_y_min_max_class: 0.442351669073
train_y_misclass: 0.0312000326812
train_y_nll: 0.104295127094
train_y_row_norms_max: 0.258685946465
train_y_row_norms_mean: 0.100721813738
train_y_row_norms_min: 0.000603846099693
valid_h0_col_norms_max: 6.23521852493
valid_h0_col_norms_mean: 3.82483482361
valid_h0_col_norms_min: 2.06603121758
valid_h0_row_norms_max: 5.89207363129
valid_h0_row_norms_mean: 2.98667144775
valid_h0_row_norms_min: 0.00797319039702
valid_h1_col_norms_max: 5.99737501144
valid_h1_col_norms_mean: 3.80774116516
valid_h1_col_norms_min: 1.71550190449
valid_h1_row_norms_max: 7.80892467499
valid_h1_row_norms_mean: 5.40890693665
valid_h1_row_norms_min: 2.97820734978
valid_objective: 0.136576414108
valid_y_col_norms_max: 1.26785862446
valid_y_col_norms_mean: 1.10942089558
valid_y_col_norms_min: 0.9239538908
valid_y_max_max_class: 0.999996840954
valid_y_mean_max_class: 0.956140458584
valid_y_min_max_class: 0.448911756277
valid_y_misclass: 0.0386999994516
valid_y_nll: 0.136576414108
valid_y_row_norms_max: 0.258687496185
valid_y_row_norms_mean: 0.10072222352
valid_y_row_norms_min: 0.000603844528086
Time this epoch: 3.204515 seconds
Monitoring step:
Epochs seen: 6
Batches seen: 3000
Examples seen: 300000
learning_rate: 0.00999999046326
momentum: 0.772221684456
test_h0_col_norms_max: 6.23541164398
test_h0_col_norms_mean: 3.82526040077
test_h0_col_norms_min: 2.0674469471
test_h0_row_norms_max: 5.89197492599
test_h0_row_norms_mean: 2.98706746101
test_h0_row_norms_min: 0.00963484868407
test_h1_col_norms_max: 5.9978518486
test_h1_col_norms_mean: 3.80790233612
test_h1_col_norms_min: 1.71558940411
test_h1_row_norms_max: 7.80901002884
test_h1_row_norms_mean: 5.40913200378
test_h1_row_norms_min: 2.97820520401
test_objective: 0.12612003088
test_y_col_norms_max: 1.39495909214
test_y_col_norms_mean: 1.23315572739
test_y_col_norms_min: 1.02864944935
test_y_max_max_class: 0.999998807907
test_y_mean_max_class: 0.961598396301
test_y_min_max_class: 0.503333091736
test_y_misclass: 0.040100004524
test_y_nll: 0.12612003088
test_y_row_norms_max: 0.288501292467
test_y_row_norms_mean: 0.112407810986
test_y_row_norms_min: 0.000765459961258
train_h0_col_norms_max: 6.23538017273
train_h0_col_norms_mean: 3.82528162003
train_h0_col_norms_min: 2.0674469471
train_h0_row_norms_max: 5.89197683334
train_h0_row_norms_mean: 2.98705887794
train_h0_row_norms_min: 0.00963485334069
train_h1_col_norms_max: 5.99787998199
train_h1_col_norms_mean: 3.80790233612
train_h1_col_norms_min: 1.7155970335
train_h1_row_norms_max: 7.80897331238
train_h1_row_norms_mean: 5.40915393829
train_h1_row_norms_min: 2.97820544243
train_objective: 0.0812869444489
train_y_col_norms_max: 1.39496576786
train_y_col_norms_mean: 1.23315918446
train_y_col_norms_min: 1.02865147591
train_y_max_max_class: 0.99999409914
train_y_mean_max_class: 0.963725090027
train_y_min_max_class: 0.476592302322
train_y_misclass: 0.0230800136924
train_y_nll: 0.0812869444489
train_y_row_norms_max: 0.288501352072
train_y_row_norms_mean: 0.112407691777
train_y_row_norms_min: 0.00076545990305
valid_h0_col_norms_max: 6.23541164398
valid_h0_col_norms_mean: 3.82526040077
valid_h0_col_norms_min: 2.0674469471
valid_h0_row_norms_max: 5.89197492599
valid_h0_row_norms_mean: 2.98706746101
valid_h0_row_norms_min: 0.00963484868407
valid_h1_col_norms_max: 5.9978518486
valid_h1_col_norms_mean: 3.80790233612
valid_h1_col_norms_min: 1.71558940411
valid_h1_row_norms_max: 7.80901002884
valid_h1_row_norms_mean: 5.40913200378
valid_h1_row_norms_min: 2.97820520401
valid_objective: 0.127863824368
valid_y_col_norms_max: 1.39495909214
valid_y_col_norms_mean: 1.23315572739
valid_y_col_norms_min: 1.02864944935
valid_y_max_max_class: 0.999999046326
valid_y_mean_max_class: 0.964188098907
valid_y_min_max_class: 0.480807334185
valid_y_misclass: 0.0376999974251
valid_y_nll: 0.127863824368
valid_y_row_norms_max: 0.288501292467
valid_y_row_norms_mean: 0.112407810986
valid_y_row_norms_min: 0.000765459961258
Time this epoch: 3.235264 seconds
Monitoring step:
Epochs seen: 7
Batches seen: 3500
Examples seen: 350000
learning_rate: 0.00999999046326
momentum: 0.826667308807
test_h0_col_norms_max: 6.23617553711
test_h0_col_norms_mean: 3.82576131821
test_h0_col_norms_min: 2.06955361366
test_h0_row_norms_max: 5.8926115036
test_h0_row_norms_mean: 2.98752951622
test_h0_row_norms_min: 0.011014319025
test_h1_col_norms_max: 5.99838781357
test_h1_col_norms_mean: 3.8080675602
test_h1_col_norms_min: 1.71574032307
test_h1_row_norms_max: 7.80883789062
test_h1_row_norms_mean: 5.40936756134
test_h1_row_norms_min: 2.97880935669
test_objective: 0.127731248736
test_y_col_norms_max: 1.54538154602
test_y_col_norms_mean: 1.37167823315
test_y_col_norms_min: 1.13854420185
test_y_max_max_class: 0.999999046326
test_y_mean_max_class: 0.9629342556
test_y_min_max_class: 0.519809484482
test_y_misclass: 0.0402000173926
test_y_nll: 0.127731248736
test_y_row_norms_max: 0.32344275713
test_y_row_norms_mean: 0.125407382846
test_y_row_norms_min: 0.000886962865479
train_h0_col_norms_max: 6.23615169525
train_h0_col_norms_mean: 3.82577753067
train_h0_col_norms_min: 2.06954622269
train_h0_row_norms_max: 5.89259195328
train_h0_row_norms_mean: 2.98751401901
train_h0_row_norms_min: 0.0110142948106
train_h1_col_norms_max: 5.99837732315
train_h1_col_norms_mean: 3.80804681778
train_h1_col_norms_min: 1.71573352814
train_h1_row_norms_max: 7.80887413025
train_h1_row_norms_mean: 5.4093914032
train_h1_row_norms_min: 2.97880387306
train_objective: 0.0784979835153
train_y_col_norms_max: 1.54537415504
train_y_col_norms_mean: 1.37168061733
train_y_col_norms_min: 1.13854324818
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.965206980705
train_y_min_max_class: 0.486533343792
train_y_misclass: 0.0245000198483
train_y_nll: 0.0784979835153
train_y_row_norms_max: 0.323444247246
train_y_row_norms_mean: 0.125407934189
train_y_row_norms_min: 0.000886966707185
valid_h0_col_norms_max: 6.23617553711
valid_h0_col_norms_mean: 3.82576131821
valid_h0_col_norms_min: 2.06955361366
valid_h0_row_norms_max: 5.8926115036
valid_h0_row_norms_mean: 2.98752951622
valid_h0_row_norms_min: 0.011014319025
valid_h1_col_norms_max: 5.99838781357
valid_h1_col_norms_mean: 3.8080675602
valid_h1_col_norms_min: 1.71574032307
valid_h1_row_norms_max: 7.80883789062
valid_h1_row_norms_mean: 5.40936756134
valid_h1_row_norms_min: 2.97880935669
valid_objective: 0.126347467303
valid_y_col_norms_max: 1.54538154602
valid_y_col_norms_mean: 1.37167823315
valid_y_col_norms_min: 1.13854420185
valid_y_max_max_class: 0.999999165535
valid_y_mean_max_class: 0.966301620007
valid_y_min_max_class: 0.483229219913
valid_y_misclass: 0.0362999886274
valid_y_nll: 0.126347467303
valid_y_row_norms_max: 0.32344275713
valid_y_row_norms_mean: 0.125407382846
valid_y_row_norms_min: 0.000886962865479
Time this epoch: 3.324166 seconds
Monitoring step:
Epochs seen: 8
Batches seen: 4000
Examples seen: 400000
learning_rate: 0.00999999046326
momentum: 0.881111502647
test_h0_col_norms_max: 6.23693847656
test_h0_col_norms_mean: 3.8264799118
test_h0_col_norms_min: 2.07238268852
test_h0_row_norms_max: 5.89200305939
test_h0_row_norms_mean: 2.98819732666
test_h0_row_norms_min: 0.0122548062354
test_h1_col_norms_max: 5.99879837036
test_h1_col_norms_mean: 3.80823135376
test_h1_col_norms_min: 1.71583795547
test_h1_row_norms_max: 7.80892133713
test_h1_row_norms_mean: 5.40960502625
test_h1_row_norms_min: 2.97916102409
test_objective: 0.121290750802
test_y_col_norms_max: 1.74212527275
test_y_col_norms_mean: 1.55456089973
test_y_col_norms_min: 1.29530310631
test_y_max_max_class: 0.999999284744
test_y_mean_max_class: 0.970344901085
test_y_min_max_class: 0.541184604168
test_y_misclass: 0.0355000011623
test_y_nll: 0.121290750802
test_y_row_norms_max: 0.393140137196
test_y_row_norms_mean: 0.142595127225
test_y_row_norms_min: 0.00119761796668
train_h0_col_norms_max: 6.23696804047
train_h0_col_norms_mean: 3.82649302483
train_h0_col_norms_min: 2.07238888741
train_h0_row_norms_max: 5.89202260971
train_h0_row_norms_mean: 2.98821163177
train_h0_row_norms_min: 0.0122548071668
train_h1_col_norms_max: 5.99882984161
train_h1_col_norms_mean: 3.80823636055
train_h1_col_norms_min: 1.71583855152
train_h1_row_norms_max: 7.80892324448
train_h1_row_norms_mean: 5.40962982178
train_h1_row_norms_min: 2.979159832
train_objective: 0.0608208738267
train_y_col_norms_max: 1.7421246767
train_y_col_norms_mean: 1.55455350876
train_y_col_norms_min: 1.29530549049
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.97307318449
train_y_min_max_class: 0.52649885416
train_y_misclass: 0.018220026046
train_y_nll: 0.0608208738267
train_y_row_norms_max: 0.393138289452
train_y_row_norms_mean: 0.142595857382
train_y_row_norms_min: 0.00119762122631
valid_h0_col_norms_max: 6.23693847656
valid_h0_col_norms_mean: 3.8264799118
valid_h0_col_norms_min: 2.07238268852
valid_h0_row_norms_max: 5.89200305939
valid_h0_row_norms_mean: 2.98819732666
valid_h0_row_norms_min: 0.0122548062354
valid_h1_col_norms_max: 5.99879837036
valid_h1_col_norms_mean: 3.80823135376
valid_h1_col_norms_min: 1.71583795547
valid_h1_row_norms_max: 7.80892133713
valid_h1_row_norms_mean: 5.40960502625
valid_h1_row_norms_min: 2.97916102409
valid_objective: 0.120653524995
valid_y_col_norms_max: 1.74212527275
valid_y_col_norms_mean: 1.55456089973
valid_y_col_norms_min: 1.29530310631
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.971736133099
valid_y_min_max_class: 0.502751410007
valid_y_misclass: 0.0357999950647
valid_y_nll: 0.120653524995
valid_y_row_norms_max: 0.393140137196
valid_y_row_norms_mean: 0.142595127225
valid_y_row_norms_min: 0.00119761796668
Time this epoch: 3.219467 seconds
Monitoring step:
Epochs seen: 9
Batches seen: 4500
Examples seen: 450000
learning_rate: 0.00999999046326
momentum: 0.935554862022
test_h0_col_norms_max: 6.23974847794
test_h0_col_norms_mean: 3.82828760147
test_h0_col_norms_min: 2.07858109474
test_h0_row_norms_max: 5.89074993134
test_h0_row_norms_mean: 2.98990464211
test_h0_row_norms_min: 0.0139329638332
test_h1_col_norms_max: 6.00128126144
test_h1_col_norms_mean: 3.80823659897
test_h1_col_norms_min: 1.71664977074
test_h1_row_norms_max: 7.80959177017
test_h1_row_norms_mean: 5.40965270996
test_h1_row_norms_min: 2.98309516907
test_objective: 0.133454963565
test_y_col_norms_max: 2.09113478661
test_y_col_norms_mean: 1.89531803131
test_y_col_norms_min: 1.55502259731
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.972993254662
test_y_min_max_class: 0.555838704109
test_y_misclass: 0.03900000453
test_y_nll: 0.133454963565
test_y_row_norms_max: 0.505987465382
test_y_row_norms_mean: 0.174324646592
test_y_row_norms_min: 0.00215850048698
train_h0_col_norms_max: 6.23972511292
train_h0_col_norms_mean: 3.82828736305
train_h0_col_norms_min: 2.07858753204
train_h0_row_norms_max: 5.89076900482
train_h0_row_norms_mean: 2.98989081383
train_h0_row_norms_min: 0.0139330253005
train_h1_col_norms_max: 6.00125265121
train_h1_col_norms_mean: 3.80825352669
train_h1_col_norms_min: 1.7166570425
train_h1_row_norms_max: 7.80962467194
train_h1_row_norms_mean: 5.40965032578
train_h1_row_norms_min: 2.98309373856
train_objective: 0.0678227543831
train_y_col_norms_max: 2.09112644196
train_y_col_norms_mean: 1.8953114748
train_y_col_norms_min: 1.55502521992
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.976901352406
train_y_min_max_class: 0.541133284569
train_y_misclass: 0.0215600207448
train_y_nll: 0.0678227543831
train_y_row_norms_max: 0.505986630917
train_y_row_norms_mean: 0.174323886633
train_y_row_norms_min: 0.00215849909
valid_h0_col_norms_max: 6.23974847794
valid_h0_col_norms_mean: 3.82828760147
valid_h0_col_norms_min: 2.07858109474
valid_h0_row_norms_max: 5.89074993134
valid_h0_row_norms_mean: 2.98990464211
valid_h0_row_norms_min: 0.0139329638332
valid_h1_col_norms_max: 6.00128126144
valid_h1_col_norms_mean: 3.80823659897
valid_h1_col_norms_min: 1.71664977074
valid_h1_row_norms_max: 7.80959177017
valid_h1_row_norms_mean: 5.40965270996
valid_h1_row_norms_min: 2.98309516907
valid_objective: 0.14155356586
valid_y_col_norms_max: 2.09113478661
valid_y_col_norms_mean: 1.89531803131
valid_y_col_norms_min: 1.55502259731
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.975651443005
valid_y_min_max_class: 0.524011075497
valid_y_misclass: 0.0348999835551
valid_y_nll: 0.14155356586
valid_y_row_norms_max: 0.505987465382
valid_y_row_norms_mean: 0.174324646592
valid_y_row_norms_min: 0.00215850048698
Time this epoch: 3.242812 seconds
Monitoring step:
Epochs seen: 10
Batches seen: 5000
Examples seen: 500000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.33813095093
test_h0_col_norms_mean: 4.00221395493
test_h0_col_norms_min: 2.23122644424
test_h0_row_norms_max: 6.13888168335
test_h0_row_norms_mean: 3.13162064552
test_h0_row_norms_min: 0.0540144480765
test_h1_col_norms_max: 5.99460268021
test_h1_col_norms_mean: 3.81764769554
test_h1_col_norms_min: 1.72675585747
test_h1_row_norms_max: 7.80806827545
test_h1_row_norms_mean: 5.42556667328
test_h1_row_norms_min: 3.22008705139
test_objective: 0.242982923985
test_y_col_norms_max: 4.86701011658
test_y_col_norms_mean: 4.50406503677
test_y_col_norms_min: 3.79116678238
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.970284223557
test_y_min_max_class: 0.494895517826
test_y_misclass: 0.0614000074565
test_y_nll: 0.242982923985
test_y_row_norms_max: 1.25091540813
test_y_row_norms_mean: 0.422105878592
test_y_row_norms_min: 0.00902531389147
train_h0_col_norms_max: 6.33812093735
train_h0_col_norms_mean: 4.00221395493
train_h0_col_norms_min: 2.23123693466
train_h0_row_norms_max: 6.13886117935
train_h0_row_norms_mean: 3.13162612915
train_h0_row_norms_min: 0.0540147125721
train_h1_col_norms_max: 5.99457454681
train_h1_col_norms_mean: 3.81765389442
train_h1_col_norms_min: 1.726749897
train_h1_row_norms_max: 7.80803012848
train_h1_row_norms_mean: 5.42554092407
train_h1_row_norms_min: 3.2200820446
train_objective: 0.216101527214
train_y_col_norms_max: 4.86700248718
train_y_col_norms_mean: 4.50406646729
train_y_col_norms_min: 3.7911875248
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.971834897995
train_y_min_max_class: 0.494699120522
train_y_misclass: 0.0546000786126
train_y_nll: 0.216101527214
train_y_row_norms_max: 1.25092113018
train_y_row_norms_mean: 0.422105282545
train_y_row_norms_min: 0.00902529340237
valid_h0_col_norms_max: 6.33813095093
valid_h0_col_norms_mean: 4.00221395493
valid_h0_col_norms_min: 2.23122644424
valid_h0_row_norms_max: 6.13888168335
valid_h0_row_norms_mean: 3.13162064552
valid_h0_row_norms_min: 0.0540144480765
valid_h1_col_norms_max: 5.99460268021
valid_h1_col_norms_mean: 3.81764769554
valid_h1_col_norms_min: 1.72675585747
valid_h1_row_norms_max: 7.80806827545
valid_h1_row_norms_mean: 5.42556667328
valid_h1_row_norms_min: 3.22008705139
valid_objective: 0.262977838516
valid_y_col_norms_max: 4.86701011658
valid_y_col_norms_mean: 4.50406503677
valid_y_col_norms_min: 3.79116678238
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.972873926163
valid_y_min_max_class: 0.484322339296
valid_y_misclass: 0.0602999925613
valid_y_nll: 0.262977838516
valid_y_row_norms_max: 1.25091540813
valid_y_row_norms_mean: 0.422105878592
valid_y_row_norms_min: 0.00902531389147
Time this epoch: 3.246498 seconds
Monitoring step:
Epochs seen: 11
Batches seen: 5500
Examples seen: 550000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.34423732758
test_h0_col_norms_mean: 4.09757995605
test_h0_col_norms_min: 2.23610663414
test_h0_row_norms_max: 6.29168701172
test_h0_row_norms_mean: 3.20701622963
test_h0_row_norms_min: 0.0794842615724
test_h1_col_norms_max: 5.99344968796
test_h1_col_norms_mean: 3.83266830444
test_h1_col_norms_min: 1.72617077827
test_h1_row_norms_max: 7.81531667709
test_h1_row_norms_mean: 5.44732666016
test_h1_row_norms_min: 3.22785973549
test_objective: 0.149660229683
test_y_col_norms_max: 5.28322935104
test_y_col_norms_mean: 4.86907577515
test_y_col_norms_min: 4.24763870239
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.971002280712
test_y_min_max_class: 0.500847101212
test_y_misclass: 0.0448000095785
test_y_nll: 0.149660229683
test_y_row_norms_max: 1.53015840054
test_y_row_norms_mean: 0.458264380693
test_y_row_norms_min: 0.0079955086112
train_h0_col_norms_max: 6.34425830841
train_h0_col_norms_mean: 4.09757804871
train_h0_col_norms_min: 2.23611760139
train_h0_row_norms_max: 6.29168462753
train_h0_row_norms_mean: 3.20700359344
train_h0_row_norms_min: 0.0794841647148
train_h1_col_norms_max: 5.99343013763
train_h1_col_norms_mean: 3.83267450333
train_h1_col_norms_min: 1.72616374493
train_h1_row_norms_max: 7.81534910202
train_h1_row_norms_mean: 5.44732189178
train_h1_row_norms_min: 3.22785782814
train_objective: 0.115495532751
train_y_col_norms_max: 5.2832069397
train_y_col_norms_mean: 4.86906385422
train_y_col_norms_min: 4.24765825272
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.974365890026
train_y_min_max_class: 0.503006339073
train_y_misclass: 0.0362600125372
train_y_nll: 0.115495532751
train_y_row_norms_max: 1.53016579151
train_y_row_norms_mean: 0.458266496658
train_y_row_norms_min: 0.00799546111375
valid_h0_col_norms_max: 6.34423732758
valid_h0_col_norms_mean: 4.09757995605
valid_h0_col_norms_min: 2.23610663414
valid_h0_row_norms_max: 6.29168701172
valid_h0_row_norms_mean: 3.20701622963
valid_h0_row_norms_min: 0.0794842615724
valid_h1_col_norms_max: 5.99344968796
valid_h1_col_norms_mean: 3.83266830444
valid_h1_col_norms_min: 1.72617077827
valid_h1_row_norms_max: 7.81531667709
valid_h1_row_norms_mean: 5.44732666016
valid_h1_row_norms_min: 3.22785973549
valid_objective: 0.1691185534
valid_y_col_norms_max: 5.28322935104
valid_y_col_norms_mean: 4.86907577515
valid_y_col_norms_min: 4.24763870239
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.974966526031
valid_y_min_max_class: 0.529185950756
valid_y_misclass: 0.0438999943435
valid_y_nll: 0.1691185534
valid_y_row_norms_max: 1.53015840054
valid_y_row_norms_mean: 0.458264380693
valid_y_row_norms_min: 0.0079955086112
Time this epoch: 3.224365 seconds
Monitoring step:
Epochs seen: 12
Batches seen: 6000
Examples seen: 600000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.34843397141
test_h0_col_norms_mean: 4.13394451141
test_h0_col_norms_min: 2.23612523079
test_h0_row_norms_max: 6.36067008972
test_h0_row_norms_mean: 3.23545217514
test_h0_row_norms_min: 0.111102260649
test_h1_col_norms_max: 5.99360513687
test_h1_col_norms_mean: 3.8399875164
test_h1_col_norms_min: 1.72649633884
test_h1_row_norms_max: 7.9447259903
test_h1_row_norms_mean: 5.45721006393
test_h1_row_norms_min: 3.23267006874
test_objective: 0.138930052519
test_y_col_norms_max: 5.38853263855
test_y_col_norms_mean: 4.97749423981
test_y_col_norms_min: 4.37515115738
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.976369380951
test_y_min_max_class: 0.539442539215
test_y_misclass: 0.0395999997854
test_y_nll: 0.138930052519
test_y_row_norms_max: 1.5151270628
test_y_row_norms_mean: 0.468785196543
test_y_row_norms_min: 0.00989222805947
train_h0_col_norms_max: 6.34842920303
train_h0_col_norms_mean: 4.13394021988
train_h0_col_norms_min: 2.23612689972
train_h0_row_norms_max: 6.36069536209
train_h0_row_norms_mean: 3.23545718193
train_h0_row_norms_min: 0.111102797091
train_h1_col_norms_max: 5.99360513687
train_h1_col_norms_mean: 3.83997154236
train_h1_col_norms_min: 1.72650408745
train_h1_row_norms_max: 7.94476556778
train_h1_row_norms_mean: 5.45718336105
train_h1_row_norms_min: 3.23268294334
train_objective: 0.0762413665652
train_y_col_norms_max: 5.38851499557
train_y_col_norms_mean: 4.97747087479
train_y_col_norms_min: 4.37513685226
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.980703771114
train_y_min_max_class: 0.538486421108
train_y_misclass: 0.0236600115895
train_y_nll: 0.0762413665652
train_y_row_norms_max: 1.51513409615
train_y_row_norms_mean: 0.46878734231
train_y_row_norms_min: 0.00989221502095
valid_h0_col_norms_max: 6.34843397141
valid_h0_col_norms_mean: 4.13394451141
valid_h0_col_norms_min: 2.23612523079
valid_h0_row_norms_max: 6.36067008972
valid_h0_row_norms_mean: 3.23545217514
valid_h0_row_norms_min: 0.111102260649
valid_h1_col_norms_max: 5.99360513687
valid_h1_col_norms_mean: 3.8399875164
valid_h1_col_norms_min: 1.72649633884
valid_h1_row_norms_max: 7.9447259903
valid_h1_row_norms_mean: 5.45721006393
valid_h1_row_norms_min: 3.23267006874
valid_objective: 0.158047273755
valid_y_col_norms_max: 5.38853263855
valid_y_col_norms_mean: 4.97749423981
valid_y_col_norms_min: 4.37515115738
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.978621006012
valid_y_min_max_class: 0.533575236797
valid_y_misclass: 0.0357999950647
valid_y_nll: 0.158047273755
valid_y_row_norms_max: 1.5151270628
valid_y_row_norms_mean: 0.468785196543
valid_y_row_norms_min: 0.00989222805947
Time this epoch: 3.233253 seconds
Monitoring step:
Epochs seen: 13
Batches seen: 6500
Examples seen: 650000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.34697389603
test_h0_col_norms_mean: 4.15769052505
test_h0_col_norms_min: 2.23618888855
test_h0_row_norms_max: 6.40273475647
test_h0_row_norms_mean: 3.25405025482
test_h0_row_norms_min: 0.113349400461
test_h1_col_norms_max: 5.99226903915
test_h1_col_norms_mean: 3.84424233437
test_h1_col_norms_min: 1.7265651226
test_h1_row_norms_max: 8.25644397736
test_h1_row_norms_mean: 5.46302652359
test_h1_row_norms_min: 3.24811220169
test_objective: 0.126156955957
test_y_col_norms_max: 5.49813652039
test_y_col_norms_mean: 5.06592178345
test_y_col_norms_min: 4.50360441208
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.983133792877
test_y_min_max_class: 0.586351394653
test_y_misclass: 0.0298999845982
test_y_nll: 0.126156955957
test_y_row_norms_max: 1.57926058769
test_y_row_norms_mean: 0.477343022823
test_y_row_norms_min: 0.0155787682161
train_h0_col_norms_max: 6.34694576263
train_h0_col_norms_mean: 4.15768814087
train_h0_col_norms_min: 2.23619389534
train_h0_row_norms_max: 6.40273189545
train_h0_row_norms_mean: 3.25405526161
train_h0_row_norms_min: 0.113349400461
train_h1_col_norms_max: 5.99224901199
train_h1_col_norms_mean: 3.84425520897
train_h1_col_norms_min: 1.72656738758
train_h1_row_norms_max: 8.25643634796
train_h1_row_norms_mean: 5.46304035187
train_h1_row_norms_min: 3.24809789658
train_objective: 0.0474301576614
train_y_col_norms_max: 5.49815416336
train_y_col_norms_mean: 5.06591844559
train_y_col_norms_min: 4.5035943985
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.986293017864
train_y_min_max_class: 0.579336941242
train_y_misclass: 0.0156600344926
train_y_nll: 0.0474301576614
train_y_row_norms_max: 1.57926678658
train_y_row_norms_mean: 0.477343559265
train_y_row_norms_min: 0.0155787058175
valid_h0_col_norms_max: 6.34697389603
valid_h0_col_norms_mean: 4.15769052505
valid_h0_col_norms_min: 2.23618888855
valid_h0_row_norms_max: 6.40273475647
valid_h0_row_norms_mean: 3.25405025482
valid_h0_row_norms_min: 0.113349400461
valid_h1_col_norms_max: 5.99226903915
valid_h1_col_norms_mean: 3.84424233437
valid_h1_col_norms_min: 1.7265651226
valid_h1_row_norms_max: 8.25644397736
valid_h1_row_norms_mean: 5.46302652359
valid_h1_row_norms_min: 3.24811220169
valid_objective: 0.136303275824
valid_y_col_norms_max: 5.49813652039
valid_y_col_norms_mean: 5.06592178345
valid_y_col_norms_min: 4.50360441208
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.983997404575
valid_y_min_max_class: 0.568609714508
valid_y_misclass: 0.0302999857813
valid_y_nll: 0.136303275824
valid_y_row_norms_max: 1.57926058769
valid_y_row_norms_mean: 0.477343022823
valid_y_row_norms_min: 0.0155787682161
Time this epoch: 3.243910 seconds
Monitoring step:
Epochs seen: 14
Batches seen: 7000
Examples seen: 700000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.3465590477
test_h0_col_norms_mean: 4.17470979691
test_h0_col_norms_min: 2.23621320724
test_h0_row_norms_max: 6.44742536545
test_h0_row_norms_mean: 3.26748609543
test_h0_row_norms_min: 0.117137983441
test_h1_col_norms_max: 5.99374818802
test_h1_col_norms_mean: 3.84760499001
test_h1_col_norms_min: 1.7263559103
test_h1_row_norms_max: 8.39470767975
test_h1_row_norms_mean: 5.46778011322
test_h1_row_norms_min: 3.26342630386
test_objective: 0.107709117234
test_y_col_norms_max: 5.54377269745
test_y_col_norms_mean: 5.13376808167
test_y_col_norms_min: 4.58503246307
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.985660552979
test_y_min_max_class: 0.598270595074
test_y_misclass: 0.0245999917388
test_y_nll: 0.107709117234
test_y_row_norms_max: 1.54413878918
test_y_row_norms_mean: 0.484508126974
test_y_row_norms_min: 0.0144754517823
train_h0_col_norms_max: 6.346534729
train_h0_col_norms_mean: 4.17470979691
train_h0_col_norms_min: 2.23620676994
train_h0_row_norms_max: 6.44738912582
train_h0_row_norms_mean: 3.26749873161
train_h0_row_norms_min: 0.117138013244
train_h1_col_norms_max: 5.99376821518
train_h1_col_norms_mean: 3.84760093689
train_h1_col_norms_min: 1.72634637356
train_h1_row_norms_max: 8.39471530914
train_h1_row_norms_mean: 5.46780490875
train_h1_row_norms_min: 3.26344394684
train_objective: 0.0289139077067
train_y_col_norms_max: 5.54377365112
train_y_col_norms_mean: 5.13377904892
train_y_col_norms_min: 4.58502912521
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.990230798721
train_y_min_max_class: 0.636234402657
train_y_misclass: 0.0095800133422
train_y_nll: 0.0289139077067
train_y_row_norms_max: 1.54413354397
train_y_row_norms_mean: 0.484510302544
train_y_row_norms_min: 0.0144755160436
valid_h0_col_norms_max: 6.3465590477
valid_h0_col_norms_mean: 4.17470979691
valid_h0_col_norms_min: 2.23621320724
valid_h0_row_norms_max: 6.44742536545
valid_h0_row_norms_mean: 3.26748609543
valid_h0_row_norms_min: 0.117137983441
valid_h1_col_norms_max: 5.99374818802
valid_h1_col_norms_mean: 3.84760499001
valid_h1_col_norms_min: 1.7263559103
valid_h1_row_norms_max: 8.39470767975
valid_h1_row_norms_mean: 5.46778011322
valid_h1_row_norms_min: 3.26342630386
valid_objective: 0.118425898254
valid_y_col_norms_max: 5.54377269745
valid_y_col_norms_mean: 5.13376808167
valid_y_col_norms_min: 4.58503246307
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.987525939941
valid_y_min_max_class: 0.608628451824
valid_y_misclass: 0.0258999839425
valid_y_nll: 0.118425898254
valid_y_row_norms_max: 1.54413878918
valid_y_row_norms_mean: 0.484508126974
valid_y_row_norms_min: 0.0144754517823
Time this epoch: 3.231089 seconds
Monitoring step:
Epochs seen: 15
Batches seen: 7500
Examples seen: 750000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.34646034241
test_h0_col_norms_mean: 4.18901968002
test_h0_col_norms_min: 2.23616552353
test_h0_row_norms_max: 6.49371194839
test_h0_row_norms_mean: 3.27877855301
test_h0_row_norms_min: 0.122728899121
test_h1_col_norms_max: 5.9948592186
test_h1_col_norms_mean: 3.85039448738
test_h1_col_norms_min: 1.72630560398
test_h1_row_norms_max: 8.49246692657
test_h1_row_norms_mean: 5.47177028656
test_h1_row_norms_min: 3.27335119247
test_objective: 0.120883144438
test_y_col_norms_max: 5.61785268784
test_y_col_norms_mean: 5.21456623077
test_y_col_norms_min: 4.61228704453
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.985354423523
test_y_min_max_class: 0.593527436256
test_y_misclass: 0.0276999864727
test_y_nll: 0.120883144438
test_y_row_norms_max: 1.59560739994
test_y_row_norms_mean: 0.492057174444
test_y_row_norms_min: 0.0153611358255
train_h0_col_norms_max: 6.34646320343
train_h0_col_norms_mean: 4.18901586533
train_h0_col_norms_min: 2.23617053032
train_h0_row_norms_max: 6.49373817444
train_h0_row_norms_mean: 3.27876186371
train_h0_row_norms_min: 0.122729450464
train_h1_col_norms_max: 5.99485731125
train_h1_col_norms_mean: 3.8503715992
train_h1_col_norms_min: 1.72631311417
train_h1_row_norms_max: 8.49246883392
train_h1_row_norms_mean: 5.47177219391
train_h1_row_norms_min: 3.27336573601
train_objective: 0.0283282585442
train_y_col_norms_max: 5.61785554886
train_y_col_norms_mean: 5.21454381943
train_y_col_norms_min: 4.61229038239
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.990546524525
train_y_min_max_class: 0.649133205414
train_y_misclass: 0.00910001061857
train_y_nll: 0.0283282585442
train_y_row_norms_max: 1.59561276436
train_y_row_norms_mean: 0.492054820061
train_y_row_norms_min: 0.0153612047434
valid_h0_col_norms_max: 6.34646034241
valid_h0_col_norms_mean: 4.18901968002
valid_h0_col_norms_min: 2.23616552353
valid_h0_row_norms_max: 6.49371194839
valid_h0_row_norms_mean: 3.27877855301
valid_h0_row_norms_min: 0.122728899121
valid_h1_col_norms_max: 5.9948592186
valid_h1_col_norms_mean: 3.85039448738
valid_h1_col_norms_min: 1.72630560398
valid_h1_row_norms_max: 8.49246692657
valid_h1_row_norms_mean: 5.47177028656
valid_h1_row_norms_min: 3.27335119247
valid_objective: 0.126225486398
valid_y_col_norms_max: 5.61785268784
valid_y_col_norms_mean: 5.21456623077
valid_y_col_norms_min: 4.61228704453
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.987080872059
valid_y_min_max_class: 0.583371043205
valid_y_misclass: 0.0265999827534
valid_y_nll: 0.126225486398
valid_y_row_norms_max: 1.59560739994
valid_y_row_norms_mean: 0.492057174444
valid_y_row_norms_min: 0.0153611358255
Time this epoch: 3.249726 seconds
Monitoring step:
Epochs seen: 16
Batches seen: 8000
Examples seen: 800000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.34652328491
test_h0_col_norms_mean: 4.20311164856
test_h0_col_norms_min: 2.23617291451
test_h0_row_norms_max: 6.52294111252
test_h0_row_norms_mean: 3.28994369507
test_h0_row_norms_min: 0.123597666621
test_h1_col_norms_max: 5.99608755112
test_h1_col_norms_mean: 3.85341596603
test_h1_col_norms_min: 1.72634136677
test_h1_row_norms_max: 8.52042388916
test_h1_row_norms_mean: 5.47620201111
test_h1_row_norms_min: 3.27072739601
test_objective: 0.140668272972
test_y_col_norms_max: 5.69501256943
test_y_col_norms_mean: 5.31268548965
test_y_col_norms_min: 4.74868249893
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.989543557167
test_y_min_max_class: 0.626976370811
test_y_misclass: 0.0258999932557
test_y_nll: 0.140668272972
test_y_row_norms_max: 1.60322284698
test_y_row_norms_mean: 0.500980615616
test_y_row_norms_min: 0.0168750006706
train_h0_col_norms_max: 6.34652090073
train_h0_col_norms_mean: 4.20310306549
train_h0_col_norms_min: 2.23617100716
train_h0_row_norms_max: 6.52291107178
train_h0_row_norms_mean: 3.28993988037
train_h0_row_norms_min: 0.123597674072
train_h1_col_norms_max: 5.99606466293
train_h1_col_norms_mean: 3.85343289375
train_h1_col_norms_min: 1.72633349895
train_h1_row_norms_max: 8.52042198181
train_h1_row_norms_mean: 5.47621965408
train_h1_row_norms_min: 3.27073836327
train_objective: 0.0259083565325
train_y_col_norms_max: 5.69503641129
train_y_col_norms_mean: 5.31268262863
train_y_col_norms_min: 4.74867868423
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.993756473064
train_y_min_max_class: 0.699871182442
train_y_misclass: 0.0079400036484
train_y_nll: 0.0259083565325
train_y_row_norms_max: 1.6032307148
train_y_row_norms_mean: 0.500979840755
train_y_row_norms_min: 0.0168750379235
valid_h0_col_norms_max: 6.34652328491
valid_h0_col_norms_mean: 4.20311164856
valid_h0_col_norms_min: 2.23617291451
valid_h0_row_norms_max: 6.52294111252
valid_h0_row_norms_mean: 3.28994369507
valid_h0_row_norms_min: 0.123597666621
valid_h1_col_norms_max: 5.99608755112
valid_h1_col_norms_mean: 3.85341596603
valid_h1_col_norms_min: 1.72634136677
valid_h1_row_norms_max: 8.52042388916
valid_h1_row_norms_mean: 5.47620201111
valid_h1_row_norms_min: 3.27072739601
valid_objective: 0.140435069799
valid_y_col_norms_max: 5.69501256943
valid_y_col_norms_mean: 5.31268548965
valid_y_col_norms_min: 4.74868249893
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.990495383739
valid_y_min_max_class: 0.633842229843
valid_y_misclass: 0.0265999827534
valid_y_nll: 0.140435069799
valid_y_row_norms_max: 1.60322284698
valid_y_row_norms_mean: 0.500980615616
valid_y_row_norms_min: 0.0168750006706
Time this epoch: 3.211907 seconds
Monitoring step:
Epochs seen: 17
Batches seen: 8500
Examples seen: 850000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.34764194489
test_h0_col_norms_mean: 4.21877479553
test_h0_col_norms_min: 2.23619961739
test_h0_row_norms_max: 6.5714468956
test_h0_row_norms_mean: 3.30228757858
test_h0_row_norms_min: 0.13643656671
test_h1_col_norms_max: 5.99594020844
test_h1_col_norms_mean: 3.85699319839
test_h1_col_norms_min: 1.72638630867
test_h1_row_norms_max: 8.61135101318
test_h1_row_norms_mean: 5.48117828369
test_h1_row_norms_min: 3.27077460289
test_objective: 0.152983635664
test_y_col_norms_max: 5.81860494614
test_y_col_norms_mean: 5.40938711166
test_y_col_norms_min: 4.81085681915
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.990412473679
test_y_min_max_class: 0.641472399235
test_y_misclass: 0.0277999881655
test_y_nll: 0.152983635664
test_y_row_norms_max: 1.66027259827
test_y_row_norms_mean: 0.509944438934
test_y_row_norms_min: 0.0174780637026
train_h0_col_norms_max: 6.3476524353
train_h0_col_norms_mean: 4.21879482269
train_h0_col_norms_min: 2.23619699478
train_h0_row_norms_max: 6.57147264481
train_h0_row_norms_mean: 3.30230164528
train_h0_row_norms_min: 0.136435881257
train_h1_col_norms_max: 5.9959692955
train_h1_col_norms_mean: 3.85701036453
train_h1_col_norms_min: 1.72638809681
train_h1_row_norms_max: 8.61137866974
train_h1_row_norms_mean: 5.48117685318
train_h1_row_norms_min: 3.27077269554
train_objective: 0.0280419886112
train_y_col_norms_max: 5.81860494614
train_y_col_norms_mean: 5.40940761566
train_y_col_norms_min: 4.81083345413
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.993593096733
train_y_min_max_class: 0.69740664959
train_y_misclass: 0.00842001195997
train_y_nll: 0.0280419886112
train_y_row_norms_max: 1.66027379036
train_y_row_norms_mean: 0.509946644306
train_y_row_norms_min: 0.0174781102687
valid_h0_col_norms_max: 6.34764194489
valid_h0_col_norms_mean: 4.21877479553
valid_h0_col_norms_min: 2.23619961739
valid_h0_row_norms_max: 6.5714468956
valid_h0_row_norms_mean: 3.30228757858
valid_h0_row_norms_min: 0.13643656671
valid_h1_col_norms_max: 5.99594020844
valid_h1_col_norms_mean: 3.85699319839
valid_h1_col_norms_min: 1.72638630867
valid_h1_row_norms_max: 8.61135101318
valid_h1_row_norms_mean: 5.48117828369
valid_h1_row_norms_min: 3.27077460289
valid_objective: 0.156515717506
valid_y_col_norms_max: 5.81860494614
valid_y_col_norms_mean: 5.40938711166
valid_y_col_norms_min: 4.81085681915
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.991046726704
valid_y_min_max_class: 0.649928092957
valid_y_misclass: 0.0286999810487
valid_y_nll: 0.156515717506
valid_y_row_norms_max: 1.66027259827
valid_y_row_norms_mean: 0.509944438934
valid_y_row_norms_min: 0.0174780637026
Time this epoch: 3.213883 seconds
Monitoring step:
Epochs seen: 18
Batches seen: 9000
Examples seen: 900000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.34813308716
test_h0_col_norms_mean: 4.2329621315
test_h0_col_norms_min: 2.23619866371
test_h0_row_norms_max: 6.60563611984
test_h0_row_norms_mean: 3.31354284286
test_h0_row_norms_min: 0.142215177417
test_h1_col_norms_max: 5.99625921249
test_h1_col_norms_mean: 3.86032938957
test_h1_col_norms_min: 1.72629284859
test_h1_row_norms_max: 8.70863246918
test_h1_row_norms_mean: 5.48595952988
test_h1_row_norms_min: 3.27107739449
test_objective: 0.1266990453
test_y_col_norms_max: 5.94182395935
test_y_col_norms_mean: 5.48706197739
test_y_col_norms_min: 4.85955810547
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.991642594337
test_y_min_max_class: 0.680797755718
test_y_misclass: 0.0225999932736
test_y_nll: 0.1266990453
test_y_row_norms_max: 1.65575671196
test_y_row_norms_mean: 0.517508506775
test_y_row_norms_min: 0.0219007991254
train_h0_col_norms_max: 6.34813261032
train_h0_col_norms_mean: 4.23297452927
train_h0_col_norms_min: 2.23619627953
train_h0_row_norms_max: 6.6056265831
train_h0_row_norms_mean: 3.31354165077
train_h0_row_norms_min: 0.142215907574
train_h1_col_norms_max: 5.99623250961
train_h1_col_norms_mean: 3.86034679413
train_h1_col_norms_min: 1.72628378868
train_h1_row_norms_max: 8.70865249634
train_h1_row_norms_mean: 5.48598957062
train_h1_row_norms_min: 3.27109384537
train_objective: 0.0143134472892
train_y_col_norms_max: 5.94185161591
train_y_col_norms_mean: 5.48704767227
train_y_col_norms_min: 4.85954427719
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.996097743511
train_y_min_max_class: 0.768372476101
train_y_misclass: 0.00466000149027
train_y_nll: 0.0143134472892
train_y_row_norms_max: 1.6557571888
train_y_row_norms_mean: 0.517506301403
train_y_row_norms_min: 0.0219009146094
valid_h0_col_norms_max: 6.34813308716
valid_h0_col_norms_mean: 4.2329621315
valid_h0_col_norms_min: 2.23619866371
valid_h0_row_norms_max: 6.60563611984
valid_h0_row_norms_mean: 3.31354284286
valid_h0_row_norms_min: 0.142215177417
valid_h1_col_norms_max: 5.99625921249
valid_h1_col_norms_mean: 3.86032938957
valid_h1_col_norms_min: 1.72629284859
valid_h1_row_norms_max: 8.70863246918
valid_h1_row_norms_mean: 5.48595952988
valid_h1_row_norms_min: 3.27107739449
valid_objective: 0.158007115126
valid_y_col_norms_max: 5.94182395935
valid_y_col_norms_mean: 5.48706197739
valid_y_col_norms_min: 4.85955810547
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.99270170927
valid_y_min_max_class: 0.685421526432
valid_y_misclass: 0.0256999861449
valid_y_nll: 0.158007115126
valid_y_row_norms_max: 1.65575671196
valid_y_row_norms_mean: 0.517508506775
valid_y_row_norms_min: 0.0219007991254
Time this epoch: 3.216884 seconds
Monitoring step:
Epochs seen: 19
Batches seen: 9500
Examples seen: 950000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.35075473785
test_h0_col_norms_mean: 4.24279737473
test_h0_col_norms_min: 2.23619127274
test_h0_row_norms_max: 6.61569023132
test_h0_row_norms_mean: 3.32117795944
test_h0_row_norms_min: 0.160097524524
test_h1_col_norms_max: 5.99536848068
test_h1_col_norms_mean: 3.86252450943
test_h1_col_norms_min: 1.72685301304
test_h1_row_norms_max: 8.74706554413
test_h1_row_norms_mean: 5.48911523819
test_h1_row_norms_min: 3.27158546448
test_objective: 0.128275766969
test_y_col_norms_max: 6.00630426407
test_y_col_norms_mean: 5.54901790619
test_y_col_norms_min: 4.95159053802
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.992789447308
test_y_min_max_class: 0.689596951008
test_y_misclass: 0.0208999924362
test_y_nll: 0.128275766969
test_y_row_norms_max: 1.56810212135
test_y_row_norms_mean: 0.523332059383
test_y_row_norms_min: 0.0221748072654
train_h0_col_norms_max: 6.35075521469
train_h0_col_norms_mean: 4.24279022217
train_h0_col_norms_min: 2.23619437218
train_h0_row_norms_max: 6.61565685272
train_h0_row_norms_mean: 3.32117271423
train_h0_row_norms_min: 0.160097926855
train_h1_col_norms_max: 5.99534845352
train_h1_col_norms_mean: 3.86250782013
train_h1_col_norms_min: 1.7268614769
train_h1_row_norms_max: 8.74709033966
train_h1_row_norms_mean: 5.48911237717
train_h1_row_norms_min: 3.27158045769
train_objective: 0.0107667120174
train_y_col_norms_max: 6.00630140305
train_y_col_norms_mean: 5.54901885986
train_y_col_norms_min: 4.95157289505
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.996949017048
train_y_min_max_class: 0.813183248043
train_y_misclass: 0.00347999692895
train_y_nll: 0.0107667120174
train_y_row_norms_max: 1.56809437275
train_y_row_norms_mean: 0.523329675198
train_y_row_norms_min: 0.0221747960895
valid_h0_col_norms_max: 6.35075473785
valid_h0_col_norms_mean: 4.24279737473
valid_h0_col_norms_min: 2.23619127274
valid_h0_row_norms_max: 6.61569023132
valid_h0_row_norms_mean: 3.32117795944
valid_h0_row_norms_min: 0.160097524524
valid_h1_col_norms_max: 5.99536848068
valid_h1_col_norms_mean: 3.86252450943
valid_h1_col_norms_min: 1.72685301304
valid_h1_row_norms_max: 8.74706554413
valid_h1_row_norms_mean: 5.48911523819
valid_h1_row_norms_min: 3.27158546448
valid_objective: 0.152880609035
valid_y_col_norms_max: 6.00630426407
valid_y_col_norms_mean: 5.54901790619
valid_y_col_norms_min: 4.95159053802
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.992956161499
valid_y_min_max_class: 0.687586247921
valid_y_misclass: 0.0238999892026
valid_y_nll: 0.152880609035
valid_y_row_norms_max: 1.56810212135
valid_y_row_norms_mean: 0.523332059383
valid_y_row_norms_min: 0.0221748072654
Time this epoch: 3.381361 seconds
Monitoring step:
Epochs seen: 20
Batches seen: 10000
Examples seen: 1000000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.36764955521
test_h0_col_norms_mean: 4.25145339966
test_h0_col_norms_min: 2.23608016968
test_h0_row_norms_max: 6.65068340302
test_h0_row_norms_mean: 3.32794356346
test_h0_row_norms_min: 0.160930916667
test_h1_col_norms_max: 5.99686193466
test_h1_col_norms_mean: 3.86456871033
test_h1_col_norms_min: 1.72680532932
test_h1_row_norms_max: 8.77167224884
test_h1_row_norms_mean: 5.49206733704
test_h1_row_norms_min: 3.27174091339
test_objective: 0.135456323624
test_y_col_norms_max: 6.06686162949
test_y_col_norms_mean: 5.60846662521
test_y_col_norms_min: 5.02197170258
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.992590069771
test_y_min_max_class: 0.687869131565
test_y_misclass: 0.0230999924242
test_y_nll: 0.135456323624
test_y_row_norms_max: 1.63880228996
test_y_row_norms_mean: 0.528553962708
test_y_row_norms_min: 0.0211624447256
train_h0_col_norms_max: 6.36767864227
train_h0_col_norms_mean: 4.25147294998
train_h0_col_norms_min: 2.23607754707
train_h0_row_norms_max: 6.65068531036
train_h0_row_norms_mean: 3.32795858383
train_h0_row_norms_min: 0.160931810737
train_h1_col_norms_max: 5.9968791008
train_h1_col_norms_mean: 3.86455130577
train_h1_col_norms_min: 1.72680592537
train_h1_row_norms_max: 8.77168178558
train_h1_row_norms_mean: 5.49205350876
train_h1_row_norms_min: 3.27172803879
train_objective: 0.0139410560951
train_y_col_norms_max: 6.06685829163
train_y_col_norms_mean: 5.60847902298
train_y_col_norms_min: 5.02197313309
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.997198402882
train_y_min_max_class: 0.82025551796
train_y_misclass: 0.00411999737844
train_y_nll: 0.0139410560951
train_y_row_norms_max: 1.63881158829
train_y_row_norms_mean: 0.528555572033
train_y_row_norms_min: 0.0211624447256
valid_h0_col_norms_max: 6.36764955521
valid_h0_col_norms_mean: 4.25145339966
valid_h0_col_norms_min: 2.23608016968
valid_h0_row_norms_max: 6.65068340302
valid_h0_row_norms_mean: 3.32794356346
valid_h0_row_norms_min: 0.160930916667
valid_h1_col_norms_max: 5.99686193466
valid_h1_col_norms_mean: 3.86456871033
valid_h1_col_norms_min: 1.72680532932
valid_h1_row_norms_max: 8.77167224884
valid_h1_row_norms_mean: 5.49206733704
valid_h1_row_norms_min: 3.27174091339
valid_objective: 0.154028758407
valid_y_col_norms_max: 6.06686162949
valid_y_col_norms_mean: 5.60846662521
valid_y_col_norms_min: 5.02197170258
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.993701696396
valid_y_min_max_class: 0.705734312534
valid_y_misclass: 0.0234999880195
valid_y_nll: 0.154028758407
valid_y_row_norms_max: 1.63880228996
valid_y_row_norms_mean: 0.528553962708
valid_y_row_norms_min: 0.0211624447256
Time this epoch: 3.224501 seconds
Monitoring step:
Epochs seen: 21
Batches seen: 10500
Examples seen: 1050000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.36559724808
test_h0_col_norms_mean: 4.25865936279
test_h0_col_norms_min: 2.23606491089
test_h0_row_norms_max: 6.65287876129
test_h0_row_norms_mean: 3.33374094963
test_h0_row_norms_min: 0.160923495889
test_h1_col_norms_max: 5.9981341362
test_h1_col_norms_mean: 3.866314888
test_h1_col_norms_min: 1.72683930397
test_h1_row_norms_max: 8.78785800934
test_h1_row_norms_mean: 5.49455070496
test_h1_row_norms_min: 3.27166962624
test_objective: 0.132553175092
test_y_col_norms_max: 6.10146903992
test_y_col_norms_mean: 5.65123224258
test_y_col_norms_min: 5.06749105453
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.992962539196
test_y_min_max_class: 0.700105249882
test_y_misclass: 0.0216999929398
test_y_nll: 0.132553175092
test_y_row_norms_max: 1.6686950922
test_y_row_norms_mean: 0.532644450665
test_y_row_norms_min: 0.0201863590628
train_h0_col_norms_max: 6.36559391022
train_h0_col_norms_mean: 4.25863981247
train_h0_col_norms_min: 2.23606181145
train_h0_row_norms_max: 6.6528468132
train_h0_row_norms_mean: 3.33372306824
train_h0_row_norms_min: 0.160924375057
train_h1_col_norms_max: 5.9981341362
train_h1_col_norms_mean: 3.86631464958
train_h1_col_norms_min: 1.7268487215
train_h1_row_norms_max: 8.78784656525
train_h1_row_norms_mean: 5.494576931
train_h1_row_norms_min: 3.27167153358
train_objective: 0.00576127693057
train_y_col_norms_max: 6.10144424438
train_y_col_norms_mean: 5.65123510361
train_y_col_norms_min: 5.06749773026
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998003363609
train_y_min_max_class: 0.865698575974
train_y_misclass: 0.00196000072174
train_y_nll: 0.00576127693057
train_y_row_norms_max: 1.66869258881
train_y_row_norms_mean: 0.532643556595
train_y_row_norms_min: 0.0201863981783
valid_h0_col_norms_max: 6.36559724808
valid_h0_col_norms_mean: 4.25865936279
valid_h0_col_norms_min: 2.23606491089
valid_h0_row_norms_max: 6.65287876129
valid_h0_row_norms_mean: 3.33374094963
valid_h0_row_norms_min: 0.160923495889
valid_h1_col_norms_max: 5.9981341362
valid_h1_col_norms_mean: 3.866314888
valid_h1_col_norms_min: 1.72683930397
valid_h1_row_norms_max: 8.78785800934
valid_h1_row_norms_mean: 5.49455070496
valid_h1_row_norms_min: 3.27166962624
valid_objective: 0.149952054024
valid_y_col_norms_max: 6.10146903992
valid_y_col_norms_mean: 5.65123224258
valid_y_col_norms_min: 5.06749105453
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.993861615658
valid_y_min_max_class: 0.696114599705
valid_y_misclass: 0.0218999926001
valid_y_nll: 0.149952054024
valid_y_row_norms_max: 1.6686950922
valid_y_row_norms_mean: 0.532644450665
valid_y_row_norms_min: 0.0201863590628
Time this epoch: 3.191485 seconds
Monitoring step:
Epochs seen: 22
Batches seen: 11000
Examples seen: 1100000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.37540435791
test_h0_col_norms_mean: 4.26554250717
test_h0_col_norms_min: 2.23606491089
test_h0_row_norms_max: 6.68969488144
test_h0_row_norms_mean: 3.33926701546
test_h0_row_norms_min: 0.15927760303
test_h1_col_norms_max: 5.99918460846
test_h1_col_norms_mean: 3.86793661118
test_h1_col_norms_min: 1.72684121132
test_h1_row_norms_max: 8.80519104004
test_h1_row_norms_mean: 5.49690055847
test_h1_row_norms_min: 3.27168059349
test_objective: 0.129877910018
test_y_col_norms_max: 6.1563615799
test_y_col_norms_mean: 5.69634532928
test_y_col_norms_min: 5.04322528839
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.993051469326
test_y_min_max_class: 0.701347351074
test_y_misclass: 0.0222999919206
test_y_nll: 0.129877910018
test_y_row_norms_max: 1.71757590771
test_y_row_norms_mean: 0.536909937859
test_y_row_norms_min: 0.019919058308
train_h0_col_norms_max: 6.37537336349
train_h0_col_norms_mean: 4.26554632187
train_h0_col_norms_min: 2.23606181145
train_h0_row_norms_max: 6.68972921371
train_h0_row_norms_mean: 3.33928227425
train_h0_row_norms_min: 0.159278333187
train_h1_col_norms_max: 5.99916362762
train_h1_col_norms_mean: 3.86795496941
train_h1_col_norms_min: 1.72684931755
train_h1_row_norms_max: 8.80523300171
train_h1_row_norms_mean: 5.4969124794
train_h1_row_norms_min: 3.27169203758
train_objective: 0.00547823868692
train_y_col_norms_max: 6.15638256073
train_y_col_norms_mean: 5.69631719589
train_y_col_norms_min: 5.04325008392
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998302519321
train_y_min_max_class: 0.880661785603
train_y_misclass: 0.00176000059582
train_y_nll: 0.00547823868692
train_y_row_norms_max: 1.71756851673
train_y_row_norms_mean: 0.53690803051
train_y_row_norms_min: 0.0199191085994
valid_h0_col_norms_max: 6.37540435791
valid_h0_col_norms_mean: 4.26554250717
valid_h0_col_norms_min: 2.23606491089
valid_h0_row_norms_max: 6.68969488144
valid_h0_row_norms_mean: 3.33926701546
valid_h0_row_norms_min: 0.15927760303
valid_h1_col_norms_max: 5.99918460846
valid_h1_col_norms_mean: 3.86793661118
valid_h1_col_norms_min: 1.72684121132
valid_h1_row_norms_max: 8.80519104004
valid_h1_row_norms_mean: 5.49690055847
valid_h1_row_norms_min: 3.27168059349
valid_objective: 0.151706501842
valid_y_col_norms_max: 6.1563615799
valid_y_col_norms_mean: 5.69634532928
valid_y_col_norms_min: 5.04322528839
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.99382263422
valid_y_min_max_class: 0.683702290058
valid_y_misclass: 0.0223999917507
valid_y_nll: 0.151706501842
valid_y_row_norms_max: 1.71757590771
valid_y_row_norms_mean: 0.536909937859
valid_y_row_norms_min: 0.019919058308
Time this epoch: 3.206554 seconds
Monitoring step:
Epochs seen: 23
Batches seen: 11500
Examples seen: 1150000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.38593149185
test_h0_col_norms_mean: 4.27048826218
test_h0_col_norms_min: 2.23606491089
test_h0_row_norms_max: 6.67957162857
test_h0_row_norms_mean: 3.34312343597
test_h0_row_norms_min: 0.159357041121
test_h1_col_norms_max: 5.99570322037
test_h1_col_norms_mean: 3.8691701889
test_h1_col_norms_min: 1.72683918476
test_h1_row_norms_max: 8.81508731842
test_h1_row_norms_mean: 5.49859952927
test_h1_row_norms_min: 3.27181625366
test_objective: 0.123887695372
test_y_col_norms_max: 6.22244215012
test_y_col_norms_mean: 5.73378896713
test_y_col_norms_min: 5.06025886536
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.993882775307
test_y_min_max_class: 0.726153492928
test_y_misclass: 0.0201999936253
test_y_nll: 0.123887695372
test_y_row_norms_max: 1.69931674004
test_y_row_norms_mean: 0.540387809277
test_y_row_norms_min: 0.020066447556
train_h0_col_norms_max: 6.38596725464
train_h0_col_norms_mean: 4.27046966553
train_h0_col_norms_min: 2.23606181145
train_h0_row_norms_max: 6.67954874039
train_h0_row_norms_mean: 3.34311199188
train_h0_row_norms_min: 0.159357577562
train_h1_col_norms_max: 5.9957318306
train_h1_col_norms_mean: 3.86917424202
train_h1_col_norms_min: 1.72684860229
train_h1_row_norms_max: 8.81507587433
train_h1_row_norms_mean: 5.49862718582
train_h1_row_norms_min: 3.27181768417
train_objective: 0.00308265769854
train_y_col_norms_max: 6.22247123718
train_y_col_norms_mean: 5.73379087448
train_y_col_norms_min: 5.06026697159
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998538374901
train_y_min_max_class: 0.894672214985
train_y_misclass: 0.000919999612961
train_y_nll: 0.00308265769854
train_y_row_norms_max: 1.69932484627
train_y_row_norms_mean: 0.540388822556
train_y_row_norms_min: 0.0200663488358
valid_h0_col_norms_max: 6.38593149185
valid_h0_col_norms_mean: 4.27048826218
valid_h0_col_norms_min: 2.23606491089
valid_h0_row_norms_max: 6.67957162857
valid_h0_row_norms_mean: 3.34312343597
valid_h0_row_norms_min: 0.159357041121
valid_h1_col_norms_max: 5.99570322037
valid_h1_col_norms_mean: 3.8691701889
valid_h1_col_norms_min: 1.72683918476
valid_h1_row_norms_max: 8.81508731842
valid_h1_row_norms_mean: 5.49859952927
valid_h1_row_norms_min: 3.27181625366
valid_objective: 0.14809820056
valid_y_col_norms_max: 6.22244215012
valid_y_col_norms_mean: 5.73378896713
valid_y_col_norms_min: 5.06025886536
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.993686497211
valid_y_min_max_class: 0.684677302837
valid_y_misclass: 0.0215999912471
valid_y_nll: 0.14809820056
valid_y_row_norms_max: 1.69931674004
valid_y_row_norms_mean: 0.540387809277
valid_y_row_norms_min: 0.020066447556
Time this epoch: 3.230241 seconds
Monitoring step:
Epochs seen: 24
Batches seen: 12000
Examples seen: 1200000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.392578125
test_h0_col_norms_mean: 4.27436256409
test_h0_col_norms_min: 2.23606491089
test_h0_row_norms_max: 6.68859195709
test_h0_row_norms_mean: 3.34622907639
test_h0_row_norms_min: 0.159570723772
test_h1_col_norms_max: 5.99895811081
test_h1_col_norms_mean: 3.87013435364
test_h1_col_norms_min: 1.72682142258
test_h1_row_norms_max: 8.82981967926
test_h1_row_norms_mean: 5.49993467331
test_h1_row_norms_min: 3.27214646339
test_objective: 0.123282536864
test_y_col_norms_max: 6.26617622375
test_y_col_norms_mean: 5.76239967346
test_y_col_norms_min: 5.08875703812
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.994056642056
test_y_min_max_class: 0.738864719868
test_y_misclass: 0.0195999927819
test_y_nll: 0.123282536864
test_y_row_norms_max: 1.71165382862
test_y_row_norms_mean: 0.542974531651
test_y_row_norms_min: 0.0200950335711
train_h0_col_norms_max: 6.39255237579
train_h0_col_norms_mean: 4.27436685562
train_h0_col_norms_min: 2.23606181145
train_h0_row_norms_max: 6.68859434128
train_h0_row_norms_mean: 3.34621357918
train_h0_row_norms_min: 0.159569814801
train_h1_col_norms_max: 5.99892854691
train_h1_col_norms_mean: 3.8701300621
train_h1_col_norms_min: 1.72681927681
train_h1_row_norms_max: 8.82980918884
train_h1_row_norms_mean: 5.49992132187
train_h1_row_norms_min: 3.27214837074
train_objective: 0.00190907681827
train_y_col_norms_max: 6.26617431641
train_y_col_norms_mean: 5.76240110397
train_y_col_norms_min: 5.08878278732
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998930156231
train_y_min_max_class: 0.918284237385
train_y_misclass: 0.000559999898542
train_y_nll: 0.00190907681827
train_y_row_norms_max: 1.71166217327
train_y_row_norms_mean: 0.542971789837
train_y_row_norms_min: 0.0200951248407
valid_h0_col_norms_max: 6.392578125
valid_h0_col_norms_mean: 4.27436256409
valid_h0_col_norms_min: 2.23606491089
valid_h0_row_norms_max: 6.68859195709
valid_h0_row_norms_mean: 3.34622907639
valid_h0_row_norms_min: 0.159570723772
valid_h1_col_norms_max: 5.99895811081
valid_h1_col_norms_mean: 3.87013435364
valid_h1_col_norms_min: 1.72682142258
valid_h1_row_norms_max: 8.82981967926
valid_h1_row_norms_mean: 5.49993467331
valid_h1_row_norms_min: 3.27214646339
valid_objective: 0.146879151464
valid_y_col_norms_max: 6.26617622375
valid_y_col_norms_mean: 5.76239967346
valid_y_col_norms_min: 5.08875703812
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.994406104088
valid_y_min_max_class: 0.706291854382
valid_y_misclass: 0.0211999956518
valid_y_nll: 0.146879151464
valid_y_row_norms_max: 1.71165382862
valid_y_row_norms_mean: 0.542974531651
valid_y_row_norms_min: 0.0200950335711
Time this epoch: 3.222738 seconds
Monitoring step:
Epochs seen: 25
Batches seen: 12500
Examples seen: 1250000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.4013338089
test_h0_col_norms_mean: 4.27870225906
test_h0_col_norms_min: 2.2360560894
test_h0_row_norms_max: 6.69665718079
test_h0_row_norms_mean: 3.34976291656
test_h0_row_norms_min: 0.160002231598
test_h1_col_norms_max: 6.00171422958
test_h1_col_norms_mean: 3.87124419212
test_h1_col_norms_min: 1.72680687904
test_h1_row_norms_max: 8.85285282135
test_h1_row_norms_mean: 5.50152254105
test_h1_row_norms_min: 3.27291631699
test_objective: 0.121946468949
test_y_col_norms_max: 6.27880191803
test_y_col_norms_mean: 5.80026340485
test_y_col_norms_min: 5.12123060226
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.993581533432
test_y_min_max_class: 0.695117354393
test_y_misclass: 0.0199999921024
test_y_nll: 0.121946468949
test_y_row_norms_max: 1.76450884342
test_y_row_norms_mean: 0.546270668507
test_y_row_norms_min: 0.0209660548717
train_h0_col_norms_max: 6.40130519867
train_h0_col_norms_mean: 4.2786822319
train_h0_col_norms_min: 2.23605871201
train_h0_row_norms_max: 6.69668722153
train_h0_row_norms_mean: 3.34977436066
train_h0_row_norms_min: 0.160001769662
train_h1_col_norms_max: 6.00171136856
train_h1_col_norms_mean: 3.87122607231
train_h1_col_norms_min: 1.726806283
train_h1_row_norms_max: 8.85290527344
train_h1_row_norms_mean: 5.5015130043
train_h1_row_norms_min: 3.27290010452
train_objective: 0.0036760433577
train_y_col_norms_max: 6.27877187729
train_y_col_norms_mean: 5.80025196075
train_y_col_norms_min: 5.12123060226
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998859524727
train_y_min_max_class: 0.912291646004
train_y_misclass: 0.00121999997646
train_y_nll: 0.0036760433577
train_y_row_norms_max: 1.76451909542
train_y_row_norms_mean: 0.546273350716
train_y_row_norms_min: 0.0209659561515
valid_h0_col_norms_max: 6.4013338089
valid_h0_col_norms_mean: 4.27870225906
valid_h0_col_norms_min: 2.2360560894
valid_h0_row_norms_max: 6.69665718079
valid_h0_row_norms_mean: 3.34976291656
valid_h0_row_norms_min: 0.160002231598
valid_h1_col_norms_max: 6.00171422958
valid_h1_col_norms_mean: 3.87124419212
valid_h1_col_norms_min: 1.72680687904
valid_h1_row_norms_max: 8.85285282135
valid_h1_row_norms_mean: 5.50152254105
valid_h1_row_norms_min: 3.27291631699
valid_objective: 0.137758076191
valid_y_col_norms_max: 6.27880191803
valid_y_col_norms_mean: 5.80026340485
valid_y_col_norms_min: 5.12123060226
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.994390308857
valid_y_min_max_class: 0.728678107262
valid_y_misclass: 0.019999993965
valid_y_nll: 0.137758076191
valid_y_row_norms_max: 1.76450884342
valid_y_row_norms_mean: 0.546270668507
valid_y_row_norms_min: 0.0209660548717
Time this epoch: 3.272793 seconds
Monitoring step:
Epochs seen: 26
Batches seen: 13000
Examples seen: 1300000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.4121389389
test_h0_col_norms_mean: 4.28374528885
test_h0_col_norms_min: 2.2360560894
test_h0_row_norms_max: 6.71324443817
test_h0_row_norms_mean: 3.35392951965
test_h0_row_norms_min: 0.1600792557
test_h1_col_norms_max: 6.00099658966
test_h1_col_norms_mean: 3.87249565125
test_h1_col_norms_min: 1.72674298286
test_h1_row_norms_max: 8.85911655426
test_h1_row_norms_mean: 5.50325918198
test_h1_row_norms_min: 3.27451777458
test_objective: 0.148935392499
test_y_col_norms_max: 6.33092308044
test_y_col_norms_mean: 5.83676052094
test_y_col_norms_min: 5.21046447754
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.993810713291
test_y_min_max_class: 0.718041598797
test_y_misclass: 0.0221999920905
test_y_nll: 0.148935392499
test_y_row_norms_max: 1.7590252161
test_y_row_norms_mean: 0.54948079586
test_y_row_norms_min: 0.020847639069
train_h0_col_norms_max: 6.41210317612
train_h0_col_norms_mean: 4.2837562561
train_h0_col_norms_min: 2.23605871201
train_h0_row_norms_max: 6.71320962906
train_h0_row_norms_mean: 3.35394501686
train_h0_row_norms_min: 0.16007861495
train_h1_col_norms_max: 6.00099611282
train_h1_col_norms_mean: 3.87251186371
train_h1_col_norms_min: 1.72674548626
train_h1_row_norms_max: 8.85914611816
train_h1_row_norms_mean: 5.50324678421
train_h1_row_norms_min: 3.27453041077
train_objective: 0.00680599268526
train_y_col_norms_max: 6.33095264435
train_y_col_norms_mean: 5.83674097061
train_y_col_norms_min: 5.21046924591
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.99815505743
train_y_min_max_class: 0.87034368515
train_y_misclass: 0.00213999999687
train_y_nll: 0.00680599268526
train_y_row_norms_max: 1.75903534889
train_y_row_norms_mean: 0.549481749535
train_y_row_norms_min: 0.0208476502448
valid_h0_col_norms_max: 6.4121389389
valid_h0_col_norms_mean: 4.28374528885
valid_h0_col_norms_min: 2.2360560894
valid_h0_row_norms_max: 6.71324443817
valid_h0_row_norms_mean: 3.35392951965
valid_h0_row_norms_min: 0.1600792557
valid_h1_col_norms_max: 6.00099658966
valid_h1_col_norms_mean: 3.87249565125
valid_h1_col_norms_min: 1.72674298286
valid_h1_row_norms_max: 8.85911655426
valid_h1_row_norms_mean: 5.50325918198
valid_h1_row_norms_min: 3.27451777458
valid_objective: 0.157335549593
valid_y_col_norms_max: 6.33092308044
valid_y_col_norms_mean: 5.83676052094
valid_y_col_norms_min: 5.21046447754
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.994030356407
valid_y_min_max_class: 0.726344525814
valid_y_misclass: 0.0226999893785
valid_y_nll: 0.157335549593
valid_y_row_norms_max: 1.7590252161
valid_y_row_norms_mean: 0.54948079586
valid_y_row_norms_min: 0.020847639069
Time this epoch: 3.208633 seconds
Monitoring step:
Epochs seen: 27
Batches seen: 13500
Examples seen: 1350000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.41813564301
test_h0_col_norms_mean: 4.28969669342
test_h0_col_norms_min: 2.2360560894
test_h0_row_norms_max: 6.7286157608
test_h0_row_norms_mean: 3.35873889923
test_h0_row_norms_min: 0.160087496042
test_h1_col_norms_max: 6.00020074844
test_h1_col_norms_mean: 3.87404108047
test_h1_col_norms_min: 1.72669911385
test_h1_row_norms_max: 8.87103843689
test_h1_row_norms_mean: 5.50552749634
test_h1_row_norms_min: 3.27386808395
test_objective: 0.143524944782
test_y_col_norms_max: 6.35547590256
test_y_col_norms_mean: 5.87758922577
test_y_col_norms_min: 5.21483325958
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.994994282722
test_y_min_max_class: 0.740391731262
test_y_misclass: 0.0209999959916
test_y_nll: 0.143524944782
test_y_row_norms_max: 1.73408651352
test_y_row_norms_mean: 0.5533670187
test_y_row_norms_min: 0.0205177664757
train_h0_col_norms_max: 6.41816806793
train_h0_col_norms_mean: 4.28971195221
train_h0_col_norms_min: 2.23605871201
train_h0_row_norms_max: 6.72864484787
train_h0_row_norms_mean: 3.35872411728
train_h0_row_norms_min: 0.160087764263
train_h1_col_norms_max: 6.00021934509
train_h1_col_norms_mean: 3.87405753136
train_h1_col_norms_min: 1.72669124603
train_h1_row_norms_max: 8.87106800079
train_h1_row_norms_mean: 5.50554513931
train_h1_row_norms_min: 3.27385210991
train_objective: 0.00366839556955
train_y_col_norms_max: 6.3555059433
train_y_col_norms_mean: 5.87757110596
train_y_col_norms_min: 5.21484279633
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998865902424
train_y_min_max_class: 0.908865869045
train_y_misclass: 0.00126000004821
train_y_nll: 0.00366839556955
train_y_row_norms_max: 1.73407900333
train_y_row_norms_mean: 0.553368866444
train_y_row_norms_min: 0.0205178782344
valid_h0_col_norms_max: 6.41813564301
valid_h0_col_norms_mean: 4.28969669342
valid_h0_col_norms_min: 2.2360560894
valid_h0_row_norms_max: 6.7286157608
valid_h0_row_norms_mean: 3.35873889923
valid_h0_row_norms_min: 0.160087496042
valid_h1_col_norms_max: 6.00020074844
valid_h1_col_norms_mean: 3.87404108047
valid_h1_col_norms_min: 1.72669911385
valid_h1_row_norms_max: 8.87103843689
valid_h1_row_norms_mean: 5.50552749634
valid_h1_row_norms_min: 3.27386808395
valid_objective: 0.155297890306
valid_y_col_norms_max: 6.35547590256
valid_y_col_norms_mean: 5.87758922577
valid_y_col_norms_min: 5.21483325958
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.994454801083
valid_y_min_max_class: 0.73979562521
valid_y_misclass: 0.0205999910831
valid_y_nll: 0.155297890306
valid_y_row_norms_max: 1.73408651352
valid_y_row_norms_mean: 0.5533670187
valid_y_row_norms_min: 0.0205177664757
Time this epoch: 3.239587 seconds
Monitoring step:
Epochs seen: 28
Batches seen: 14000
Examples seen: 1400000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.42320108414
test_h0_col_norms_mean: 4.29595088959
test_h0_col_norms_min: 2.2360560894
test_h0_row_norms_max: 6.73588323593
test_h0_row_norms_mean: 3.36365532875
test_h0_row_norms_min: 0.160095050931
test_h1_col_norms_max: 6.00109481812
test_h1_col_norms_mean: 3.87561798096
test_h1_col_norms_min: 1.72673380375
test_h1_row_norms_max: 8.90102100372
test_h1_row_norms_mean: 5.50783443451
test_h1_row_norms_min: 3.27579259872
test_objective: 0.176090538502
test_y_col_norms_max: 6.37317848206
test_y_col_norms_mean: 5.91372203827
test_y_col_norms_min: 5.26935434341
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.994395077229
test_y_min_max_class: 0.743200361729
test_y_misclass: 0.0221999939531
test_y_nll: 0.176090538502
test_y_row_norms_max: 1.72095572948
test_y_row_norms_mean: 0.556442499161
test_y_row_norms_min: 0.0208181608468
train_h0_col_norms_max: 6.4232301712
train_h0_col_norms_mean: 4.29595375061
train_h0_col_norms_min: 2.23605871201
train_h0_row_norms_max: 6.73585557938
train_h0_row_norms_mean: 3.36363792419
train_h0_row_norms_min: 0.160095304251
train_h1_col_norms_max: 6.00107383728
train_h1_col_norms_mean: 3.87561368942
train_h1_col_norms_min: 1.72674226761
train_h1_row_norms_max: 8.90106678009
train_h1_row_norms_mean: 5.50785970688
train_h1_row_norms_min: 3.27577996254
train_objective: 0.00485403602943
train_y_col_norms_max: 6.37316846848
train_y_col_norms_mean: 5.91373300552
train_y_col_norms_min: 5.26935815811
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998713254929
train_y_min_max_class: 0.896820962429
train_y_misclass: 0.00136000022758
train_y_nll: 0.00485403602943
train_y_row_norms_max: 1.72096157074
train_y_row_norms_mean: 0.556439995766
train_y_row_norms_min: 0.0208181329072
valid_h0_col_norms_max: 6.42320108414
valid_h0_col_norms_mean: 4.29595088959
valid_h0_col_norms_min: 2.2360560894
valid_h0_row_norms_max: 6.73588323593
valid_h0_row_norms_mean: 3.36365532875
valid_h0_row_norms_min: 0.160095050931
valid_h1_col_norms_max: 6.00109481812
valid_h1_col_norms_mean: 3.87561798096
valid_h1_col_norms_min: 1.72673380375
valid_h1_row_norms_max: 8.90102100372
valid_h1_row_norms_mean: 5.50783443451
valid_h1_row_norms_min: 3.27579259872
valid_objective: 0.183195546269
valid_y_col_norms_max: 6.37317848206
valid_y_col_norms_mean: 5.91372203827
valid_y_col_norms_min: 5.26935434341
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.994852602482
valid_y_min_max_class: 0.74536216259
valid_y_misclass: 0.0237999893725
valid_y_nll: 0.183195546269
valid_y_row_norms_max: 1.72095572948
valid_y_row_norms_mean: 0.556442499161
valid_y_row_norms_min: 0.0208181608468
Time this epoch: 3.306142 seconds
Monitoring step:
Epochs seen: 29
Batches seen: 14500
Examples seen: 1450000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.45381164551
test_h0_col_norms_mean: 4.30269384384
test_h0_col_norms_min: 2.2360560894
test_h0_row_norms_max: 6.74906110764
test_h0_row_norms_mean: 3.36890244484
test_h0_row_norms_min: 0.159244820476
test_h1_col_norms_max: 6.00183820724
test_h1_col_norms_mean: 3.87737250328
test_h1_col_norms_min: 1.7269256115
test_h1_row_norms_max: 8.89922237396
test_h1_row_norms_mean: 5.51038217545
test_h1_row_norms_min: 3.27727627754
test_objective: 0.158995479345
test_y_col_norms_max: 6.38246154785
test_y_col_norms_mean: 5.95248889923
test_y_col_norms_min: 5.29096841812
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.994712769985
test_y_min_max_class: 0.747330605984
test_y_misclass: 0.0207999963313
test_y_nll: 0.158995479345
test_y_row_norms_max: 1.74560809135
test_y_row_norms_mean: 0.559956371784
test_y_row_norms_min: 0.0206812545657
train_h0_col_norms_max: 6.45380783081
train_h0_col_norms_mean: 4.3027176857
train_h0_col_norms_min: 2.23605871201
train_h0_row_norms_max: 6.74909591675
train_h0_row_norms_mean: 3.36888813972
train_h0_row_norms_min: 0.159244179726
train_h1_col_norms_max: 6.00187015533
train_h1_col_norms_mean: 3.87737822533
train_h1_col_norms_min: 1.72692549229
train_h1_row_norms_max: 8.89921569824
train_h1_row_norms_mean: 5.51035165787
train_h1_row_norms_min: 3.27729272842
train_objective: 0.00499874725938
train_y_col_norms_max: 6.38246393204
train_y_col_norms_mean: 5.9525179863
train_y_col_norms_min: 5.29098033905
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998843669891
train_y_min_max_class: 0.907702028751
train_y_misclass: 0.00154000031762
train_y_nll: 0.00499874725938
train_y_row_norms_max: 1.7455984354
train_y_row_norms_mean: 0.559955894947
train_y_row_norms_min: 0.0206812303513
valid_h0_col_norms_max: 6.45381164551
valid_h0_col_norms_mean: 4.30269384384
valid_h0_col_norms_min: 2.2360560894
valid_h0_row_norms_max: 6.74906110764
valid_h0_row_norms_mean: 3.36890244484
valid_h0_row_norms_min: 0.159244820476
valid_h1_col_norms_max: 6.00183820724
valid_h1_col_norms_mean: 3.87737250328
valid_h1_col_norms_min: 1.7269256115
valid_h1_row_norms_max: 8.89922237396
valid_h1_row_norms_mean: 5.51038217545
valid_h1_row_norms_min: 3.27727627754
valid_objective: 0.161353841424
valid_y_col_norms_max: 6.38246154785
valid_y_col_norms_mean: 5.95248889923
valid_y_col_norms_min: 5.29096841812
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995362341404
valid_y_min_max_class: 0.764035582542
valid_y_misclass: 0.0211999919266
valid_y_nll: 0.161353841424
valid_y_row_norms_max: 1.74560809135
valid_y_row_norms_mean: 0.559956371784
valid_y_row_norms_min: 0.0206812545657
Time this epoch: 3.264931 seconds
Monitoring step:
Epochs seen: 30
Batches seen: 15000
Examples seen: 1500000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.45126152039
test_h0_col_norms_mean: 4.30855321884
test_h0_col_norms_min: 2.2360560894
test_h0_row_norms_max: 6.77185153961
test_h0_row_norms_mean: 3.37364006042
test_h0_row_norms_min: 0.159440949559
test_h1_col_norms_max: 6.00142860413
test_h1_col_norms_mean: 3.8789036274
test_h1_col_norms_min: 1.72696387768
test_h1_row_norms_max: 8.92525005341
test_h1_row_norms_mean: 5.5125746727
test_h1_row_norms_min: 3.27923321724
test_objective: 0.159945309162
test_y_col_norms_max: 6.50855636597
test_y_col_norms_mean: 5.9870095253
test_y_col_norms_min: 5.30891561508
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995383441448
test_y_min_max_class: 0.755910158157
test_y_misclass: 0.0218999926001
test_y_nll: 0.159945309162
test_y_row_norms_max: 1.7809484005
test_y_row_norms_mean: 0.563234627247
test_y_row_norms_min: 0.0199234094471
train_h0_col_norms_max: 6.45129537582
train_h0_col_norms_mean: 4.30855226517
train_h0_col_norms_min: 2.23605871201
train_h0_row_norms_max: 6.77182006836
train_h0_row_norms_mean: 3.37362527847
train_h0_row_norms_min: 0.159441739321
train_h1_col_norms_max: 6.00145721436
train_h1_col_norms_mean: 3.87892222404
train_h1_col_norms_min: 1.72697114944
train_h1_row_norms_max: 8.92523765564
train_h1_row_norms_mean: 5.5125579834
train_h1_row_norms_min: 3.27921772003
train_objective: 0.0052194846794
train_y_col_norms_max: 6.50858449936
train_y_col_norms_mean: 5.98699235916
train_y_col_norms_min: 5.3088889122
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998774111271
train_y_min_max_class: 0.904954195023
train_y_misclass: 0.00160000030883
train_y_nll: 0.0052194846794
train_y_row_norms_max: 1.78094053268
train_y_row_norms_mean: 0.56323415041
train_y_row_norms_min: 0.0199234373868
valid_h0_col_norms_max: 6.45126152039
valid_h0_col_norms_mean: 4.30855321884
valid_h0_col_norms_min: 2.2360560894
valid_h0_row_norms_max: 6.77185153961
valid_h0_row_norms_mean: 3.37364006042
valid_h0_row_norms_min: 0.159440949559
valid_h1_col_norms_max: 6.00142860413
valid_h1_col_norms_mean: 3.8789036274
valid_h1_col_norms_min: 1.72696387768
valid_h1_row_norms_max: 8.92525005341
valid_h1_row_norms_mean: 5.5125746727
valid_h1_row_norms_min: 3.27923321724
valid_objective: 0.172797784209
valid_y_col_norms_max: 6.50855636597
valid_y_col_norms_mean: 5.9870095253
valid_y_col_norms_min: 5.30891561508
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995368361473
valid_y_min_max_class: 0.741488099098
valid_y_misclass: 0.0201999936253
valid_y_nll: 0.172797784209
valid_y_row_norms_max: 1.7809484005
valid_y_row_norms_mean: 0.563234627247
valid_y_row_norms_min: 0.0199234094471
Time this epoch: 3.279603 seconds
Monitoring step:
Epochs seen: 31
Batches seen: 15500
Examples seen: 1550000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.45977544785
test_h0_col_norms_mean: 4.31496477127
test_h0_col_norms_min: 2.23605561256
test_h0_row_norms_max: 6.77787017822
test_h0_row_norms_mean: 3.37872552872
test_h0_row_norms_min: 0.167061835527
test_h1_col_norms_max: 6.00070905685
test_h1_col_norms_mean: 3.88056731224
test_h1_col_norms_min: 1.7269756794
test_h1_row_norms_max: 8.94437408447
test_h1_row_norms_mean: 5.51490449905
test_h1_row_norms_min: 3.27992272377
test_objective: 0.131766811013
test_y_col_norms_max: 6.49069547653
test_y_col_norms_mean: 6.01968860626
test_y_col_norms_min: 5.32379293442
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.99406349659
test_y_min_max_class: 0.709186255932
test_y_misclass: 0.0214999895543
test_y_nll: 0.131766811013
test_y_row_norms_max: 1.75881135464
test_y_row_norms_mean: 0.566387176514
test_y_row_norms_min: 0.0195109490305
train_h0_col_norms_max: 6.45976924896
train_h0_col_norms_mean: 4.31498289108
train_h0_col_norms_min: 2.23605871201
train_h0_row_norms_max: 6.77783346176
train_h0_row_norms_mean: 3.37874174118
train_h0_row_norms_min: 0.167062133551
train_h1_col_norms_max: 6.00073814392
train_h1_col_norms_mean: 3.88058972359
train_h1_col_norms_min: 1.72698163986
train_h1_row_norms_max: 8.94434833527
train_h1_row_norms_mean: 5.51487779617
train_h1_row_norms_min: 3.27992391586
train_objective: 0.00692026689649
train_y_col_norms_max: 6.49070358276
train_y_col_norms_mean: 6.01966762543
train_y_col_norms_min: 5.323802948
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.99833124876
train_y_min_max_class: 0.877075016499
train_y_misclass: 0.00206000055186
train_y_nll: 0.00692026689649
train_y_row_norms_max: 1.7588135004
train_y_row_norms_mean: 0.566390037537
train_y_row_norms_min: 0.0195109229535
valid_h0_col_norms_max: 6.45977544785
valid_h0_col_norms_mean: 4.31496477127
valid_h0_col_norms_min: 2.23605561256
valid_h0_row_norms_max: 6.77787017822
valid_h0_row_norms_mean: 3.37872552872
valid_h0_row_norms_min: 0.167061835527
valid_h1_col_norms_max: 6.00070905685
valid_h1_col_norms_mean: 3.88056731224
valid_h1_col_norms_min: 1.7269756794
valid_h1_row_norms_max: 8.94437408447
valid_h1_row_norms_mean: 5.51490449905
valid_h1_row_norms_min: 3.27992272377
valid_objective: 0.161748409271
valid_y_col_norms_max: 6.49069547653
valid_y_col_norms_mean: 6.01968860626
valid_y_col_norms_min: 5.32379293442
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.994541585445
valid_y_min_max_class: 0.741445958614
valid_y_misclass: 0.0221999902278
valid_y_nll: 0.161748409271
valid_y_row_norms_max: 1.75881135464
valid_y_row_norms_mean: 0.566387176514
valid_y_row_norms_min: 0.0195109490305
Time this epoch: 3.251266 seconds
Monitoring step:
Epochs seen: 32
Batches seen: 16000
Examples seen: 1600000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.45842170715
test_h0_col_norms_mean: 4.32045173645
test_h0_col_norms_min: 2.23605561256
test_h0_row_norms_max: 6.78466415405
test_h0_row_norms_mean: 3.38315415382
test_h0_row_norms_min: 0.16744081676
test_h1_col_norms_max: 6.00018596649
test_h1_col_norms_mean: 3.88205099106
test_h1_col_norms_min: 1.72608160973
test_h1_row_norms_max: 8.9562330246
test_h1_row_norms_mean: 5.51705217361
test_h1_row_norms_min: 3.28056788445
test_objective: 0.156137660146
test_y_col_norms_max: 6.55750894547
test_y_col_norms_mean: 6.04845666885
test_y_col_norms_min: 5.33018064499
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995250225067
test_y_min_max_class: 0.769113063812
test_y_misclass: 0.0210999939591
test_y_nll: 0.156137660146
test_y_row_norms_max: 1.7797113657
test_y_row_norms_mean: 0.568675458431
test_y_row_norms_min: 0.0223224461079
train_h0_col_norms_max: 6.45844841003
train_h0_col_norms_mean: 4.32046604156
train_h0_col_norms_min: 2.23605871201
train_h0_row_norms_max: 6.7846736908
train_h0_row_norms_mean: 3.3831589222
train_h0_row_norms_min: 0.167441576719
train_h1_col_norms_max: 6.00020599365
train_h1_col_norms_mean: 3.88205075264
train_h1_col_norms_min: 1.7260876894
train_h1_row_norms_max: 8.9562292099
train_h1_row_norms_mean: 5.51702356339
train_h1_row_norms_min: 3.280554533
train_objective: 0.00448899809271
train_y_col_norms_max: 6.55747938156
train_y_col_norms_mean: 6.0484457016
train_y_col_norms_min: 5.33017015457
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998964965343
train_y_min_max_class: 0.915260314941
train_y_misclass: 0.0013400001917
train_y_nll: 0.00448899809271
train_y_row_norms_max: 1.77971935272
train_y_row_norms_mean: 0.568675458431
train_y_row_norms_min: 0.0223225466907
valid_h0_col_norms_max: 6.45842170715
valid_h0_col_norms_mean: 4.32045173645
valid_h0_col_norms_min: 2.23605561256
valid_h0_row_norms_max: 6.78466415405
valid_h0_row_norms_mean: 3.38315415382
valid_h0_row_norms_min: 0.16744081676
valid_h1_col_norms_max: 6.00018596649
valid_h1_col_norms_mean: 3.88205099106
valid_h1_col_norms_min: 1.72608160973
valid_h1_row_norms_max: 8.9562330246
valid_h1_row_norms_mean: 5.51705217361
valid_h1_row_norms_min: 3.28056788445
valid_objective: 0.185146003962
valid_y_col_norms_max: 6.55750894547
valid_y_col_norms_mean: 6.04845666885
valid_y_col_norms_min: 5.33018064499
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995594918728
valid_y_min_max_class: 0.771956503391
valid_y_misclass: 0.0222999881953
valid_y_nll: 0.185146003962
valid_y_row_norms_max: 1.7797113657
valid_y_row_norms_mean: 0.568675458431
valid_y_row_norms_min: 0.0223224461079
Time this epoch: 3.265816 seconds
Monitoring step:
Epochs seen: 33
Batches seen: 16500
Examples seen: 1650000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.48042154312
test_h0_col_norms_mean: 4.32581949234
test_h0_col_norms_min: 2.23605561256
test_h0_row_norms_max: 6.79249668121
test_h0_row_norms_mean: 3.38737988472
test_h0_row_norms_min: 0.167504921556
test_h1_col_norms_max: 6.0035238266
test_h1_col_norms_mean: 3.88333916664
test_h1_col_norms_min: 1.72610199451
test_h1_row_norms_max: 8.94651126862
test_h1_row_norms_mean: 5.51890897751
test_h1_row_norms_min: 3.28360319138
test_objective: 0.142962425947
test_y_col_norms_max: 6.59494447708
test_y_col_norms_mean: 6.06826543808
test_y_col_norms_min: 5.36811923981
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995299935341
test_y_min_max_class: 0.757121562958
test_y_misclass: 0.0198999904096
test_y_nll: 0.142962425947
test_y_row_norms_max: 1.8589527607
test_y_row_norms_mean: 0.570496380329
test_y_row_norms_min: 0.0232647489756
train_h0_col_norms_max: 6.48045063019
train_h0_col_norms_mean: 4.32584047318
train_h0_col_norms_min: 2.23605871201
train_h0_row_norms_max: 6.79252815247
train_h0_row_norms_mean: 3.38736534119
train_h0_row_norms_min: 0.167504131794
train_h1_col_norms_max: 6.00354385376
train_h1_col_norms_mean: 3.88335561752
train_h1_col_norms_min: 1.72609436512
train_h1_row_norms_max: 8.94646167755
train_h1_row_norms_mean: 5.51891183853
train_h1_row_norms_min: 3.28361749649
train_objective: 0.00277355127037
train_y_col_norms_max: 6.59491348267
train_y_col_norms_mean: 6.06824493408
train_y_col_norms_min: 5.36814403534
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999049842358
train_y_min_max_class: 0.921933472157
train_y_misclass: 0.000999999581836
train_y_nll: 0.00277355127037
train_y_row_norms_max: 1.85895049572
train_y_row_norms_mean: 0.570495426655
train_y_row_norms_min: 0.0232646763325
valid_h0_col_norms_max: 6.48042154312
valid_h0_col_norms_mean: 4.32581949234
valid_h0_col_norms_min: 2.23605561256
valid_h0_row_norms_max: 6.79249668121
valid_h0_row_norms_mean: 3.38737988472
valid_h0_row_norms_min: 0.167504921556
valid_h1_col_norms_max: 6.0035238266
valid_h1_col_norms_mean: 3.88333916664
valid_h1_col_norms_min: 1.72610199451
valid_h1_row_norms_max: 8.94651126862
valid_h1_row_norms_mean: 5.51890897751
valid_h1_row_norms_min: 3.28360319138
valid_objective: 0.179574415088
valid_y_col_norms_max: 6.59494447708
valid_y_col_norms_mean: 6.06826543808
valid_y_col_norms_min: 5.36811923981
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995453417301
valid_y_min_max_class: 0.75123167038
valid_y_misclass: 0.0197999905795
valid_y_nll: 0.179574415088
valid_y_row_norms_max: 1.8589527607
valid_y_row_norms_mean: 0.570496380329
valid_y_row_norms_min: 0.0232647489756
Time this epoch: 3.231476 seconds
Monitoring step:
Epochs seen: 34
Batches seen: 17000
Examples seen: 1700000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.4917049408
test_h0_col_norms_mean: 4.33141994476
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.79732465744
test_h0_row_norms_mean: 3.39186024666
test_h0_row_norms_min: 0.171120882034
test_h1_col_norms_max: 6.00534772873
test_h1_col_norms_mean: 3.88460206985
test_h1_col_norms_min: 1.72610270977
test_h1_row_norms_max: 8.96625423431
test_h1_row_norms_mean: 5.52066421509
test_h1_row_norms_min: 3.28276824951
test_objective: 0.141110450029
test_y_col_norms_max: 6.61644887924
test_y_col_norms_mean: 6.09203910828
test_y_col_norms_min: 5.40572547913
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.994820356369
test_y_min_max_class: 0.726503133774
test_y_misclass: 0.0200999956578
test_y_nll: 0.141110450029
test_y_row_norms_max: 1.85092616081
test_y_row_norms_mean: 0.572713196278
test_y_row_norms_min: 0.0240506455302
train_h0_col_norms_max: 6.49167537689
train_h0_col_norms_mean: 4.33143472672
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.79731702805
train_h0_row_norms_mean: 3.39186143875
train_h0_row_norms_min: 0.171120166779
train_h1_col_norms_max: 6.00534725189
train_h1_col_norms_mean: 3.88458299637
train_h1_col_norms_min: 1.72609496117
train_h1_row_norms_max: 8.9662437439
train_h1_row_norms_mean: 5.52065134048
train_h1_row_norms_min: 3.28278303146
train_objective: 0.00290546845645
train_y_col_norms_max: 6.61642169952
train_y_col_norms_mean: 6.09206676483
train_y_col_norms_min: 5.40573072433
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999073982239
train_y_min_max_class: 0.924475312233
train_y_misclass: 0.000939999590628
train_y_nll: 0.00290546845645
train_y_row_norms_max: 1.85091614723
train_y_row_norms_mean: 0.572711467743
train_y_row_norms_min: 0.0240505319089
valid_h0_col_norms_max: 6.4917049408
valid_h0_col_norms_mean: 4.33141994476
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.79732465744
valid_h0_row_norms_mean: 3.39186024666
valid_h0_row_norms_min: 0.171120882034
valid_h1_col_norms_max: 6.00534772873
valid_h1_col_norms_mean: 3.88460206985
valid_h1_col_norms_min: 1.72610270977
valid_h1_row_norms_max: 8.96625423431
valid_h1_row_norms_mean: 5.52066421509
valid_h1_row_norms_min: 3.28276824951
valid_objective: 0.162981122732
valid_y_col_norms_max: 6.61644887924
valid_y_col_norms_mean: 6.09203910828
valid_y_col_norms_min: 5.40572547913
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995312690735
valid_y_min_max_class: 0.743762373924
valid_y_misclass: 0.0194999910891
valid_y_nll: 0.162981122732
valid_y_row_norms_max: 1.85092616081
valid_y_row_norms_mean: 0.572713196278
valid_y_row_norms_min: 0.0240506455302
Time this epoch: 3.214131 seconds
Monitoring step:
Epochs seen: 35
Batches seen: 17500
Examples seen: 1750000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.49574804306
test_h0_col_norms_mean: 4.3364033699
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.8160161972
test_h0_row_norms_mean: 3.39588427544
test_h0_row_norms_min: 0.171171665192
test_h1_col_norms_max: 6.00441598892
test_h1_col_norms_mean: 3.88574457169
test_h1_col_norms_min: 1.72610199451
test_h1_row_norms_max: 8.98808574677
test_h1_row_norms_mean: 5.52225542068
test_h1_row_norms_min: 3.28273797035
test_objective: 0.170048907399
test_y_col_norms_max: 6.62913417816
test_y_col_norms_mean: 6.11489725113
test_y_col_norms_min: 5.41416931152
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.994616866112
test_y_min_max_class: 0.73312073946
test_y_misclass: 0.0217999909073
test_y_nll: 0.170048907399
test_y_row_norms_max: 1.85863983631
test_y_row_norms_mean: 0.574832618237
test_y_row_norms_min: 0.0238261986524
train_h0_col_norms_max: 6.49571895599
train_h0_col_norms_mean: 4.33637952805
train_h0_col_norms_min: 2.23605918884
train_h0_row_norms_max: 6.81597948074
train_h0_row_norms_mean: 3.39588832855
train_h0_row_norms_min: 0.171171709895
train_h1_col_norms_max: 6.00439691544
train_h1_col_norms_mean: 3.88574552536
train_h1_col_norms_min: 1.72609436512
train_h1_row_norms_max: 8.98807621002
train_h1_row_norms_mean: 5.52225255966
train_h1_row_norms_min: 3.28275132179
train_objective: 0.00725457724184
train_y_col_norms_max: 6.62916135788
train_y_col_norms_mean: 6.11490011215
train_y_col_norms_min: 5.41417980194
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998784661293
train_y_min_max_class: 0.90457379818
train_y_misclass: 0.00184000050649
train_y_nll: 0.00725457724184
train_y_row_norms_max: 1.85864841938
train_y_row_norms_mean: 0.574829816818
train_y_row_norms_min: 0.0238260868937
valid_h0_col_norms_max: 6.49574804306
valid_h0_col_norms_mean: 4.3364033699
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.8160161972
valid_h0_row_norms_mean: 3.39588427544
valid_h0_row_norms_min: 0.171171665192
valid_h1_col_norms_max: 6.00441598892
valid_h1_col_norms_mean: 3.88574457169
valid_h1_col_norms_min: 1.72610199451
valid_h1_row_norms_max: 8.98808574677
valid_h1_row_norms_mean: 5.52225542068
valid_h1_row_norms_min: 3.28273797035
valid_objective: 0.188135892153
valid_y_col_norms_max: 6.62913417816
valid_y_col_norms_mean: 6.11489725113
valid_y_col_norms_min: 5.41416931152
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.99530762434
valid_y_min_max_class: 0.754189014435
valid_y_misclass: 0.0216999892145
valid_y_nll: 0.188135892153
valid_y_row_norms_max: 1.85863983631
valid_y_row_norms_mean: 0.574832618237
valid_y_row_norms_min: 0.0238261986524
Time this epoch: 3.284179 seconds
Monitoring step:
Epochs seen: 36
Batches seen: 18000
Examples seen: 1800000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.50911712646
test_h0_col_norms_mean: 4.34344434738
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.84686803818
test_h0_row_norms_mean: 3.40156388283
test_h0_row_norms_min: 0.171174243093
test_h1_col_norms_max: 6.00547456741
test_h1_col_norms_mean: 3.88733744621
test_h1_col_norms_min: 1.72609961033
test_h1_row_norms_max: 9.016705513
test_h1_row_norms_mean: 5.5244436264
test_h1_row_norms_min: 3.28328037262
test_objective: 0.147451668978
test_y_col_norms_max: 6.66465806961
test_y_col_norms_mean: 6.14104557037
test_y_col_norms_min: 5.43022489548
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.994985222816
test_y_min_max_class: 0.730050563812
test_y_misclass: 0.0206999927759
test_y_nll: 0.147451668978
test_y_row_norms_max: 1.78328752518
test_y_row_norms_mean: 0.577396690845
test_y_row_norms_min: 0.025094171986
train_h0_col_norms_max: 6.50908374786
train_h0_col_norms_mean: 4.34342718124
train_h0_col_norms_min: 2.23605918884
train_h0_row_norms_max: 6.8468914032
train_h0_row_norms_mean: 3.40154623985
train_h0_row_norms_min: 0.171174883842
train_h1_col_norms_max: 6.00550603867
train_h1_col_norms_mean: 3.8873193264
train_h1_col_norms_min: 1.72609198093
train_h1_row_norms_max: 9.0167131424
train_h1_row_norms_mean: 5.52441453934
train_h1_row_norms_min: 3.28326916695
train_objective: 0.00539966486394
train_y_col_norms_max: 6.66468572617
train_y_col_norms_mean: 6.14102125168
train_y_col_norms_min: 5.43022203445
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.99889010191
train_y_min_max_class: 0.915294647217
train_y_misclass: 0.00152000051457
train_y_nll: 0.00539966486394
train_y_row_norms_max: 1.78329563141
train_y_row_norms_mean: 0.57739341259
train_y_row_norms_min: 0.0250942651182
valid_h0_col_norms_max: 6.50911712646
valid_h0_col_norms_mean: 4.34344434738
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.84686803818
valid_h0_row_norms_mean: 3.40156388283
valid_h0_row_norms_min: 0.171174243093
valid_h1_col_norms_max: 6.00547456741
valid_h1_col_norms_mean: 3.88733744621
valid_h1_col_norms_min: 1.72609961033
valid_h1_row_norms_max: 9.016705513
valid_h1_row_norms_mean: 5.5244436264
valid_h1_row_norms_min: 3.28328037262
valid_objective: 0.161581993103
valid_y_col_norms_max: 6.66465806961
valid_y_col_norms_mean: 6.14104557037
valid_y_col_norms_min: 5.43022489548
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995217263699
valid_y_min_max_class: 0.752208411694
valid_y_misclass: 0.0202999934554
valid_y_nll: 0.161581993103
valid_y_row_norms_max: 1.78328752518
valid_y_row_norms_mean: 0.577396690845
valid_y_row_norms_min: 0.025094171986
Time this epoch: 3.277391 seconds
Monitoring step:
Epochs seen: 37
Batches seen: 18500
Examples seen: 1850000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.51882982254
test_h0_col_norms_mean: 4.34898805618
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.86316585541
test_h0_row_norms_mean: 3.40600013733
test_h0_row_norms_min: 0.171176031232
test_h1_col_norms_max: 6.00360631943
test_h1_col_norms_mean: 3.88884663582
test_h1_col_norms_min: 1.72619795799
test_h1_row_norms_max: 9.0371131897
test_h1_row_norms_mean: 5.526512146
test_h1_row_norms_min: 3.28363656998
test_objective: 0.174357533455
test_y_col_norms_max: 6.70250511169
test_y_col_norms_mean: 6.17451667786
test_y_col_norms_min: 5.43355512619
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995144307613
test_y_min_max_class: 0.754079401493
test_y_misclass: 0.0214999951422
test_y_nll: 0.174357533455
test_y_row_norms_max: 1.83495354652
test_y_row_norms_mean: 0.580399692059
test_y_row_norms_min: 0.0246269144118
train_h0_col_norms_max: 6.51883935928
train_h0_col_norms_mean: 4.34899139404
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.86313438416
train_h0_row_norms_mean: 3.40601491928
train_h0_row_norms_min: 0.171176567674
train_h1_col_norms_max: 6.00361680984
train_h1_col_norms_mean: 3.88884592056
train_h1_col_norms_min: 1.72620582581
train_h1_row_norms_max: 9.03706741333
train_h1_row_norms_mean: 5.52653741837
train_h1_row_norms_min: 3.28362202644
train_objective: 0.00331209623255
train_y_col_norms_max: 6.70247983932
train_y_col_norms_mean: 6.17454624176
train_y_col_norms_min: 5.43355798721
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999139487743
train_y_min_max_class: 0.931698381901
train_y_misclass: 0.000979999545962
train_y_nll: 0.00331209623255
train_y_row_norms_max: 1.83494448662
train_y_row_norms_mean: 0.580400049686
train_y_row_norms_min: 0.0246269479394
valid_h0_col_norms_max: 6.51882982254
valid_h0_col_norms_mean: 4.34898805618
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.86316585541
valid_h0_row_norms_mean: 3.40600013733
valid_h0_row_norms_min: 0.171176031232
valid_h1_col_norms_max: 6.00360631943
valid_h1_col_norms_mean: 3.88884663582
valid_h1_col_norms_min: 1.72619795799
valid_h1_row_norms_max: 9.0371131897
valid_h1_row_norms_mean: 5.526512146
valid_h1_row_norms_min: 3.28363656998
valid_objective: 0.164556577802
valid_y_col_norms_max: 6.70250511169
valid_y_col_norms_mean: 6.17451667786
valid_y_col_norms_min: 5.43355512619
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995738983154
valid_y_min_max_class: 0.76286149025
valid_y_misclass: 0.0205999910831
valid_y_nll: 0.164556577802
valid_y_row_norms_max: 1.83495354652
valid_y_row_norms_mean: 0.580399692059
valid_y_row_norms_min: 0.0246269144118
Time this epoch: 3.300500 seconds
Monitoring step:
Epochs seen: 38
Batches seen: 19000
Examples seen: 1900000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.52381372452
test_h0_col_norms_mean: 4.35216140747
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.87646770477
test_h0_row_norms_mean: 3.40848636627
test_h0_row_norms_min: 0.171177119017
test_h1_col_norms_max: 6.00470304489
test_h1_col_norms_mean: 3.88970422745
test_h1_col_norms_min: 1.72622287273
test_h1_row_norms_max: 9.0545091629
test_h1_row_norms_mean: 5.52772140503
test_h1_row_norms_min: 3.28486537933
test_objective: 0.16956473887
test_y_col_norms_max: 6.70925521851
test_y_col_norms_mean: 6.2000246048
test_y_col_norms_min: 5.47072219849
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.996227920055
test_y_min_max_class: 0.774923741817
test_y_misclass: 0.0192999932915
test_y_nll: 0.16956473887
test_y_row_norms_max: 1.87937033176
test_y_row_norms_mean: 0.582399070263
test_y_row_norms_min: 0.0244527608156
train_h0_col_norms_max: 6.52384281158
train_h0_col_norms_mean: 4.35217618942
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.87646818161
train_h0_row_norms_mean: 3.40846681595
train_h0_row_norms_min: 0.171177104115
train_h1_col_norms_max: 6.00473213196
train_h1_col_norms_mean: 3.88968753815
train_h1_col_norms_min: 1.72621440887
train_h1_row_norms_max: 9.05449295044
train_h1_row_norms_mean: 5.52773332596
train_h1_row_norms_min: 3.28484797478
train_objective: 0.0016456496669
train_y_col_norms_max: 6.70928049088
train_y_col_norms_mean: 6.20005607605
train_y_col_norms_min: 5.47074699402
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999532461166
train_y_min_max_class: 0.958807349205
train_y_misclass: 0.000520000001416
train_y_nll: 0.0016456496669
train_y_row_norms_max: 1.87937915325
train_y_row_norms_mean: 0.582397639751
train_y_row_norms_min: 0.0244527999312
valid_h0_col_norms_max: 6.52381372452
valid_h0_col_norms_mean: 4.35216140747
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.87646770477
valid_h0_row_norms_mean: 3.40848636627
valid_h0_row_norms_min: 0.171177119017
valid_h1_col_norms_max: 6.00470304489
valid_h1_col_norms_mean: 3.88970422745
valid_h1_col_norms_min: 1.72622287273
valid_h1_row_norms_max: 9.0545091629
valid_h1_row_norms_mean: 5.52772140503
valid_h1_row_norms_min: 3.28486537933
valid_objective: 0.174608826637
valid_y_col_norms_max: 6.70925521851
valid_y_col_norms_mean: 6.2000246048
valid_y_col_norms_min: 5.47072219849
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.996570110321
valid_y_min_max_class: 0.792669534683
valid_y_misclass: 0.0185999963433
valid_y_nll: 0.174608826637
valid_y_row_norms_max: 1.87937033176
valid_y_row_norms_mean: 0.582399070263
valid_y_row_norms_min: 0.0244527608156
Time this epoch: 3.301847 seconds
Monitoring step:
Epochs seen: 39
Batches seen: 19500
Examples seen: 1950000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.52631568909
test_h0_col_norms_mean: 4.35376691818
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.87409830093
test_h0_row_norms_mean: 3.40977239609
test_h0_row_norms_min: 0.171177133918
test_h1_col_norms_max: 6.00363349915
test_h1_col_norms_mean: 3.89011406898
test_h1_col_norms_min: 1.72623074055
test_h1_row_norms_max: 9.06535053253
test_h1_row_norms_mean: 5.52831077576
test_h1_row_norms_min: 3.28474617004
test_objective: 0.158702552319
test_y_col_norms_max: 6.72936153412
test_y_col_norms_mean: 6.2109913826
test_y_col_norms_min: 5.48157644272
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995932340622
test_y_min_max_class: 0.764656722546
test_y_misclass: 0.0200999919325
test_y_nll: 0.158702552319
test_y_row_norms_max: 1.87921774387
test_y_row_norms_mean: 0.583407759666
test_y_row_norms_min: 0.024447273463
train_h0_col_norms_max: 6.52629041672
train_h0_col_norms_mean: 4.35376310349
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.87408828735
train_h0_row_norms_mean: 3.40976953506
train_h0_row_norms_min: 0.171177104115
train_h1_col_norms_max: 6.00362253189
train_h1_col_norms_mean: 3.8901321888
train_h1_col_norms_min: 1.72622382641
train_h1_row_norms_max: 9.06535148621
train_h1_row_norms_mean: 5.52829360962
train_h1_row_norms_min: 3.28472876549
train_objective: 0.00152394291945
train_y_col_norms_max: 6.7293639183
train_y_col_norms_mean: 6.21102333069
train_y_col_norms_min: 5.48156309128
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999768614769
train_y_min_max_class: 0.979155957699
train_y_misclass: 0.000379999983124
train_y_nll: 0.00152394291945
train_y_row_norms_max: 1.87921559811
train_y_row_norms_mean: 0.583409488201
train_y_row_norms_min: 0.0244472324848
valid_h0_col_norms_max: 6.52631568909
valid_h0_col_norms_mean: 4.35376691818
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.87409830093
valid_h0_row_norms_mean: 3.40977239609
valid_h0_row_norms_min: 0.171177133918
valid_h1_col_norms_max: 6.00363349915
valid_h1_col_norms_mean: 3.89011406898
valid_h1_col_norms_min: 1.72623074055
valid_h1_row_norms_max: 9.06535053253
valid_h1_row_norms_mean: 5.52831077576
valid_h1_row_norms_min: 3.28474617004
valid_objective: 0.17522443831
valid_y_col_norms_max: 6.72936153412
valid_y_col_norms_mean: 6.2109913826
valid_y_col_norms_min: 5.48157644272
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.996479153633
valid_y_min_max_class: 0.788241684437
valid_y_misclass: 0.0187999941409
valid_y_nll: 0.17522443831
valid_y_row_norms_max: 1.87921774387
valid_y_row_norms_mean: 0.583407759666
valid_y_row_norms_min: 0.024447273463
Time this epoch: 3.268098 seconds
Monitoring step:
Epochs seen: 40
Batches seen: 20000
Examples seen: 2000000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.53570699692
test_h0_col_norms_mean: 4.35643339157
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.86570596695
test_h0_row_norms_mean: 3.41193628311
test_h0_row_norms_min: 0.171177208424
test_h1_col_norms_max: 6.00472784042
test_h1_col_norms_mean: 3.89065885544
test_h1_col_norms_min: 1.72635400295
test_h1_row_norms_max: 9.0626745224
test_h1_row_norms_mean: 5.52905321121
test_h1_row_norms_min: 3.28488898277
test_objective: 0.16143476963
test_y_col_norms_max: 6.73923158646
test_y_col_norms_mean: 6.22264146805
test_y_col_norms_min: 5.52369451523
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995911836624
test_y_min_max_class: 0.785954415798
test_y_misclass: 0.019199995324
test_y_nll: 0.16143476963
test_y_row_norms_max: 1.85353505611
test_y_row_norms_mean: 0.584432959557
test_y_row_norms_min: 0.0243270788342
train_h0_col_norms_max: 6.5357131958
train_h0_col_norms_mean: 4.35641145706
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.86573553085
train_h0_row_norms_mean: 3.411921978
train_h0_row_norms_min: 0.171177133918
train_h1_col_norms_max: 6.00474691391
train_h1_col_norms_mean: 3.89064121246
train_h1_col_norms_min: 1.72634625435
train_h1_row_norms_max: 9.06272411346
train_h1_row_norms_mean: 5.52907943726
train_h1_row_norms_min: 3.28490185738
train_objective: 0.00306967948563
train_y_col_norms_max: 6.73919677734
train_y_col_norms_mean: 6.22266340256
train_y_col_norms_min: 5.52368307114
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999223351479
train_y_min_max_class: 0.938300907612
train_y_misclass: 0.00105999980588
train_y_nll: 0.00306967948563
train_y_row_norms_max: 1.85352873802
train_y_row_norms_mean: 0.584433317184
train_y_row_norms_min: 0.0243270788342
valid_h0_col_norms_max: 6.53570699692
valid_h0_col_norms_mean: 4.35643339157
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.86570596695
valid_h0_row_norms_mean: 3.41193628311
valid_h0_row_norms_min: 0.171177208424
valid_h1_col_norms_max: 6.00472784042
valid_h1_col_norms_mean: 3.89065885544
valid_h1_col_norms_min: 1.72635400295
valid_h1_row_norms_max: 9.0626745224
valid_h1_row_norms_mean: 5.52905321121
valid_h1_row_norms_min: 3.28488898277
valid_objective: 0.182417109609
valid_y_col_norms_max: 6.73923158646
valid_y_col_norms_mean: 6.22264146805
valid_y_col_norms_min: 5.52369451523
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.996435403824
valid_y_min_max_class: 0.793238520622
valid_y_misclass: 0.0203999951482
valid_y_nll: 0.182417109609
valid_y_row_norms_max: 1.85353505611
valid_y_row_norms_mean: 0.584432959557
valid_y_row_norms_min: 0.0243270788342
Time this epoch: 3.294892 seconds
Monitoring step:
Epochs seen: 41
Batches seen: 20500
Examples seen: 2050000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.5425863266
test_h0_col_norms_mean: 4.35914468765
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.87823629379
test_h0_row_norms_mean: 3.41422724724
test_h0_row_norms_min: 0.171178132296
test_h1_col_norms_max: 6.00586032867
test_h1_col_norms_mean: 3.89139056206
test_h1_col_norms_min: 1.72638916969
test_h1_row_norms_max: 9.06592273712
test_h1_row_norms_mean: 5.53010177612
test_h1_row_norms_min: 3.28573608398
test_objective: 0.158061608672
test_y_col_norms_max: 6.74868965149
test_y_col_norms_mean: 6.23669672012
test_y_col_norms_min: 5.50828027725
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995399415493
test_y_min_max_class: 0.739345610142
test_y_misclass: 0.0194999948144
test_y_nll: 0.158061608672
test_y_row_norms_max: 1.86322903633
test_y_row_norms_mean: 0.585780024529
test_y_row_norms_min: 0.0242832899094
train_h0_col_norms_max: 6.54261350632
train_h0_col_norms_mean: 4.3591375351
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.87820577621
train_h0_row_norms_mean: 3.41424489021
train_h0_row_norms_min: 0.171177610755
train_h1_col_norms_max: 6.00583934784
train_h1_col_norms_mean: 3.89137220383
train_h1_col_norms_min: 1.72638905048
train_h1_row_norms_max: 9.06593418121
train_h1_row_norms_mean: 5.53011369705
train_h1_row_norms_min: 3.28573846817
train_objective: 0.00130198767874
train_y_col_norms_max: 6.74868011475
train_y_col_norms_mean: 6.23671960831
train_y_col_norms_min: 5.50826644897
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999629914761
train_y_min_max_class: 0.966445803642
train_y_misclass: 0.000419999967562
train_y_nll: 0.00130198767874
train_y_row_norms_max: 1.86323726177
train_y_row_norms_mean: 0.585781753063
train_y_row_norms_min: 0.0242832992226
valid_h0_col_norms_max: 6.5425863266
valid_h0_col_norms_mean: 4.35914468765
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.87823629379
valid_h0_row_norms_mean: 3.41422724724
valid_h0_row_norms_min: 0.171178132296
valid_h1_col_norms_max: 6.00586032867
valid_h1_col_norms_mean: 3.89139056206
valid_h1_col_norms_min: 1.72638916969
valid_h1_row_norms_max: 9.06592273712
valid_h1_row_norms_mean: 5.53010177612
valid_h1_row_norms_min: 3.28573608398
valid_objective: 0.168345704675
valid_y_col_norms_max: 6.74868965149
valid_y_col_norms_mean: 6.23669672012
valid_y_col_norms_min: 5.50828027725
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995861887932
valid_y_min_max_class: 0.767153561115
valid_y_misclass: 0.0193999931216
valid_y_nll: 0.168345704675
valid_y_row_norms_max: 1.86322903633
valid_y_row_norms_mean: 0.585780024529
valid_y_row_norms_min: 0.0242832899094
Time this epoch: 3.283051 seconds
Monitoring step:
Epochs seen: 42
Batches seen: 21000
Examples seen: 2100000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.54826259613
test_h0_col_norms_mean: 4.36148118973
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.87971019745
test_h0_row_norms_mean: 3.41604399681
test_h0_row_norms_min: 0.171194016933
test_h1_col_norms_max: 6.00196123123
test_h1_col_norms_mean: 3.89196276665
test_h1_col_norms_min: 1.72636771202
test_h1_row_norms_max: 9.06931400299
test_h1_row_norms_mean: 5.53089809418
test_h1_row_norms_min: 3.28621292114
test_objective: 0.152915328741
test_y_col_norms_max: 6.76382827759
test_y_col_norms_mean: 6.25065279007
test_y_col_norms_min: 5.53469228745
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995481073856
test_y_min_max_class: 0.755885243416
test_y_misclass: 0.0184999946505
test_y_nll: 0.152915328741
test_y_row_norms_max: 1.87736725807
test_y_row_norms_mean: 0.586914539337
test_y_row_norms_min: 0.0246897321194
train_h0_col_norms_max: 6.54823541641
train_h0_col_norms_mean: 4.36147928238
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.87974071503
train_h0_row_norms_mean: 3.4160592556
train_h0_row_norms_min: 0.171194061637
train_h1_col_norms_max: 6.00195074081
train_h1_col_norms_mean: 3.8919467926
train_h1_col_norms_min: 1.72637498379
train_h1_row_norms_max: 9.06928443909
train_h1_row_norms_mean: 5.53089904785
train_h1_row_norms_min: 3.28621530533
train_objective: 0.00141110678669
train_y_col_norms_max: 6.76386547089
train_y_col_norms_mean: 6.25062465668
train_y_col_norms_min: 5.5346660614
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999635100365
train_y_min_max_class: 0.967330634594
train_y_misclass: 0.000319999962812
train_y_nll: 0.00141110678669
train_y_row_norms_max: 1.87736737728
train_y_row_norms_mean: 0.58691483736
train_y_row_norms_min: 0.0246896371245
valid_h0_col_norms_max: 6.54826259613
valid_h0_col_norms_mean: 4.36148118973
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.87971019745
valid_h0_row_norms_mean: 3.41604399681
valid_h0_row_norms_min: 0.171194016933
valid_h1_col_norms_max: 6.00196123123
valid_h1_col_norms_mean: 3.89196276665
valid_h1_col_norms_min: 1.72636771202
valid_h1_row_norms_max: 9.06931400299
valid_h1_row_norms_mean: 5.53089809418
valid_h1_row_norms_min: 3.28621292114
valid_objective: 0.164742320776
valid_y_col_norms_max: 6.76382827759
valid_y_col_norms_mean: 6.25065279007
valid_y_col_norms_min: 5.53469228745
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.996081888676
valid_y_min_max_class: 0.769794583321
valid_y_misclass: 0.0189999956638
valid_y_nll: 0.164742320776
valid_y_row_norms_max: 1.87736725807
valid_y_row_norms_mean: 0.586914539337
valid_y_row_norms_min: 0.0246897321194
Time this epoch: 3.293110 seconds
Monitoring step:
Epochs seen: 43
Batches seen: 21500
Examples seen: 2150000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.55259990692
test_h0_col_norms_mean: 4.36336374283
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.8740735054
test_h0_row_norms_mean: 3.41756176949
test_h0_row_norms_min: 0.17119500041
test_h1_col_norms_max: 6.0039639473
test_h1_col_norms_mean: 3.89240264893
test_h1_col_norms_min: 1.72636425495
test_h1_row_norms_max: 9.07901191711
test_h1_row_norms_mean: 5.53150510788
test_h1_row_norms_min: 3.28636312485
test_objective: 0.136897221208
test_y_col_norms_max: 6.77247095108
test_y_col_norms_mean: 6.26096439362
test_y_col_norms_min: 5.51252508163
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995941281319
test_y_min_max_class: 0.776527881622
test_y_misclass: 0.0178999938071
test_y_nll: 0.136897221208
test_y_row_norms_max: 1.87920343876
test_y_row_norms_mean: 0.587858736515
test_y_row_norms_min: 0.0247891973704
train_h0_col_norms_max: 6.55262708664
train_h0_col_norms_mean: 4.36338043213
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.8740811348
train_h0_row_norms_mean: 3.41757678986
train_h0_row_norms_min: 0.171194568276
train_h1_col_norms_max: 6.00393533707
train_h1_col_norms_mean: 3.89241600037
train_h1_col_norms_min: 1.72637200356
train_h1_row_norms_max: 9.07898330688
train_h1_row_norms_mean: 5.53153181076
train_h1_row_norms_min: 3.28636193275
train_objective: 0.00148291292135
train_y_col_norms_max: 6.77250146866
train_y_col_norms_mean: 6.26093387604
train_y_col_norms_min: 5.51249742508
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999666690826
train_y_min_max_class: 0.971082031727
train_y_misclass: 0.000460000039311
train_y_nll: 0.00148291292135
train_y_row_norms_max: 1.87921154499
train_y_row_norms_mean: 0.58786034584
train_y_row_norms_min: 0.0247890818864
valid_h0_col_norms_max: 6.55259990692
valid_h0_col_norms_mean: 4.36336374283
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.8740735054
valid_h0_row_norms_mean: 3.41756176949
valid_h0_row_norms_min: 0.17119500041
valid_h1_col_norms_max: 6.0039639473
valid_h1_col_norms_mean: 3.89240264893
valid_h1_col_norms_min: 1.72636425495
valid_h1_row_norms_max: 9.07901191711
valid_h1_row_norms_mean: 5.53150510788
valid_h1_row_norms_min: 3.28636312485
valid_objective: 0.161794766784
valid_y_col_norms_max: 6.77247095108
valid_y_col_norms_mean: 6.26096439362
valid_y_col_norms_min: 5.51252508163
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995917260647
valid_y_min_max_class: 0.753068387508
valid_y_misclass: 0.0201999936253
valid_y_nll: 0.161794766784
valid_y_row_norms_max: 1.87920343876
valid_y_row_norms_mean: 0.587858736515
valid_y_row_norms_min: 0.0247891973704
Time this epoch: 3.359274 seconds
Monitoring step:
Epochs seen: 44
Batches seen: 22000
Examples seen: 2200000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.55098342896
test_h0_col_norms_mean: 4.36544847488
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.87497997284
test_h0_row_norms_mean: 3.41930341721
test_h0_row_norms_min: 0.171195015311
test_h1_col_norms_max: 6.00462388992
test_h1_col_norms_mean: 3.89291667938
test_h1_col_norms_min: 1.72640001774
test_h1_row_norms_max: 9.07387065887
test_h1_row_norms_mean: 5.53226518631
test_h1_row_norms_min: 3.28615379333
test_objective: 0.140558704734
test_y_col_norms_max: 6.78662919998
test_y_col_norms_mean: 6.26840209961
test_y_col_norms_min: 5.52506113052
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.996500074863
test_y_min_max_class: 0.794377505779
test_y_misclass: 0.0174999963492
test_y_nll: 0.140558704734
test_y_row_norms_max: 1.89163661003
test_y_row_norms_mean: 0.58866494894
test_y_row_norms_min: 0.0248291995376
train_h0_col_norms_max: 6.5510134697
train_h0_col_norms_mean: 4.36544466019
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.87498617172
train_h0_row_norms_mean: 3.41928720474
train_h0_row_norms_min: 0.171194568276
train_h1_col_norms_max: 6.00459194183
train_h1_col_norms_mean: 3.89290046692
train_h1_col_norms_min: 1.72639226913
train_h1_row_norms_max: 9.07389450073
train_h1_row_norms_mean: 5.53226518631
train_h1_row_norms_min: 3.28615093231
train_objective: 0.000438805494923
train_y_col_norms_max: 6.7865986824
train_y_col_norms_mean: 6.2684264183
train_y_col_norms_min: 5.52509069443
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999836444855
train_y_min_max_class: 0.985349237919
train_y_misclass: 0.000159999981406
train_y_nll: 0.000438805494923
train_y_row_norms_max: 1.89162778854
train_y_row_norms_mean: 0.588665127754
train_y_row_norms_min: 0.0248291157186
valid_h0_col_norms_max: 6.55098342896
valid_h0_col_norms_mean: 4.36544847488
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.87497997284
valid_h0_row_norms_mean: 3.41930341721
valid_h0_row_norms_min: 0.171195015311
valid_h1_col_norms_max: 6.00462388992
valid_h1_col_norms_mean: 3.89291667938
valid_h1_col_norms_min: 1.72640001774
valid_h1_row_norms_max: 9.07387065887
valid_h1_row_norms_mean: 5.53226518631
valid_h1_row_norms_min: 3.28615379333
valid_objective: 0.157897502184
valid_y_col_norms_max: 6.78662919998
valid_y_col_norms_mean: 6.26840209961
valid_y_col_norms_min: 5.52506113052
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995646238327
valid_y_min_max_class: 0.742088675499
valid_y_misclass: 0.0179999954998
valid_y_nll: 0.157897502184
valid_y_row_norms_max: 1.89163661003
valid_y_row_norms_mean: 0.58866494894
valid_y_row_norms_min: 0.0248291995376
Time this epoch: 3.258919 seconds
Monitoring step:
Epochs seen: 45
Batches seen: 22500
Examples seen: 2250000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.55773639679
test_h0_col_norms_mean: 4.36757230759
test_h0_col_norms_min: 2.23605656624
test_h0_row_norms_max: 6.88162136078
test_h0_row_norms_mean: 3.42107534409
test_h0_row_norms_min: 0.171194553375
test_h1_col_norms_max: 6.00479459763
test_h1_col_norms_mean: 3.89360809326
test_h1_col_norms_min: 1.72638893127
test_h1_row_norms_max: 9.08829307556
test_h1_row_norms_mean: 5.53330039978
test_h1_row_norms_min: 3.28643465042
test_objective: 0.172753751278
test_y_col_norms_max: 6.79764652252
test_y_col_norms_mean: 6.28485965729
test_y_col_norms_min: 5.55204916
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.99615073204
test_y_min_max_class: 0.77610886097
test_y_misclass: 0.020499991253
test_y_nll: 0.172753751278
test_y_row_norms_max: 1.87029504776
test_y_row_norms_mean: 0.590070128441
test_y_row_norms_min: 0.0248381886631
train_h0_col_norms_max: 6.55770730972
train_h0_col_norms_mean: 4.36758470535
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.8816576004
train_h0_row_norms_mean: 3.4210703373
train_h0_row_norms_min: 0.171194195747
train_h1_col_norms_max: 6.00480556488
train_h1_col_norms_mean: 3.89361214638
train_h1_col_norms_min: 1.72638893127
train_h1_row_norms_max: 9.08830928802
train_h1_row_norms_mean: 5.53328752518
train_h1_row_norms_min: 3.28645133972
train_objective: 0.00231568375602
train_y_col_norms_max: 6.79761791229
train_y_col_norms_mean: 6.2848906517
train_y_col_norms_min: 5.55205202103
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999481022358
train_y_min_max_class: 0.955595433712
train_y_misclass: 0.000539999979082
train_y_nll: 0.00231568375602
train_y_row_norms_max: 1.87028670311
train_y_row_norms_mean: 0.590068638325
train_y_row_norms_min: 0.0248383041471
valid_h0_col_norms_max: 6.55773639679
valid_h0_col_norms_mean: 4.36757230759
valid_h0_col_norms_min: 2.23605656624
valid_h0_row_norms_max: 6.88162136078
valid_h0_row_norms_mean: 3.42107534409
valid_h0_row_norms_min: 0.171194553375
valid_h1_col_norms_max: 6.00479459763
valid_h1_col_norms_mean: 3.89360809326
valid_h1_col_norms_min: 1.72638893127
valid_h1_row_norms_max: 9.08829307556
valid_h1_row_norms_mean: 5.53330039978
valid_h1_row_norms_min: 3.28643465042
valid_objective: 0.17547737062
valid_y_col_norms_max: 6.79764652252
valid_y_col_norms_mean: 6.28485965729
valid_y_col_norms_min: 5.55204916
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995465695858
valid_y_min_max_class: 0.7330275774
valid_y_misclass: 0.020799998194
valid_y_nll: 0.17547737062
valid_y_row_norms_max: 1.87029504776
valid_y_row_norms_mean: 0.590070128441
valid_y_row_norms_min: 0.0248381886631
Time this epoch: 3.263050 seconds
Monitoring step:
Epochs seen: 46
Batches seen: 23000
Examples seen: 2300000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.59422492981
test_h0_col_norms_mean: 4.37052488327
test_h0_col_norms_min: 2.23605632782
test_h0_row_norms_max: 6.8726644516
test_h0_row_norms_mean: 3.42359375954
test_h0_row_norms_min: 0.171194955707
test_h1_col_norms_max: 6.00406217575
test_h1_col_norms_mean: 3.89443039894
test_h1_col_norms_min: 1.72642493248
test_h1_row_norms_max: 9.08111953735
test_h1_row_norms_mean: 5.53444480896
test_h1_row_norms_min: 3.28672647476
test_objective: 0.176214575768
test_y_col_norms_max: 6.79807567596
test_y_col_norms_mean: 6.29843473434
test_y_col_norms_min: 5.56106996536
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.996127128601
test_y_min_max_class: 0.767001569271
test_y_misclass: 0.0188999976963
test_y_nll: 0.176214575768
test_y_row_norms_max: 1.88375401497
test_y_row_norms_mean: 0.591480791569
test_y_row_norms_min: 0.0244950912893
train_h0_col_norms_max: 6.59419536591
train_h0_col_norms_mean: 4.37053442001
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.87265539169
train_h0_row_norms_mean: 3.42357826233
train_h0_row_norms_min: 0.171194553375
train_h1_col_norms_max: 6.00408983231
train_h1_col_norms_mean: 3.89444637299
train_h1_col_norms_min: 1.72643446922
train_h1_row_norms_max: 9.08109664917
train_h1_row_norms_mean: 5.53442716599
train_h1_row_norms_min: 3.28672146797
train_objective: 0.00163910887204
train_y_col_norms_max: 6.79804325104
train_y_col_norms_mean: 6.29846715927
train_y_col_norms_min: 5.56109666824
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999451816082
train_y_min_max_class: 0.952225148678
train_y_misclass: 0.00061999988975
train_y_nll: 0.00163910887204
train_y_row_norms_max: 1.88374614716
train_y_row_norms_mean: 0.591483712196
train_y_row_norms_min: 0.0244949962944
valid_h0_col_norms_max: 6.59422492981
valid_h0_col_norms_mean: 4.37052488327
valid_h0_col_norms_min: 2.23605632782
valid_h0_row_norms_max: 6.8726644516
valid_h0_row_norms_mean: 3.42359375954
valid_h0_row_norms_min: 0.171194955707
valid_h1_col_norms_max: 6.00406217575
valid_h1_col_norms_mean: 3.89443039894
valid_h1_col_norms_min: 1.72642493248
valid_h1_row_norms_max: 9.08111953735
valid_h1_row_norms_mean: 5.53444480896
valid_h1_row_norms_min: 3.28672647476
valid_objective: 0.186354964972
valid_y_col_norms_max: 6.79807567596
valid_y_col_norms_mean: 6.29843473434
valid_y_col_norms_min: 5.56106996536
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.99598556757
valid_y_min_max_class: 0.759403705597
valid_y_misclass: 0.0206999909133
valid_y_nll: 0.186354964972
valid_y_row_norms_max: 1.88375401497
valid_y_row_norms_mean: 0.591480791569
valid_y_row_norms_min: 0.0244950912893
Time this epoch: 3.263870 seconds
Monitoring step:
Epochs seen: 47
Batches seen: 23500
Examples seen: 2350000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.61253595352
test_h0_col_norms_mean: 4.37271356583
test_h0_col_norms_min: 2.23605632782
test_h0_row_norms_max: 6.87648153305
test_h0_row_norms_mean: 3.425365448
test_h0_row_norms_min: 0.17119538784
test_h1_col_norms_max: 6.00418663025
test_h1_col_norms_mean: 3.8950676918
test_h1_col_norms_min: 1.72642803192
test_h1_row_norms_max: 9.10107326508
test_h1_row_norms_mean: 5.53528594971
test_h1_row_norms_min: 3.28674340248
test_objective: 0.160995185375
test_y_col_norms_max: 6.79669380188
test_y_col_norms_mean: 6.31103897095
test_y_col_norms_min: 5.58734273911
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995669007301
test_y_min_max_class: 0.77006238699
test_y_misclass: 0.0187999941409
test_y_nll: 0.160995185375
test_y_row_norms_max: 1.89035248756
test_y_row_norms_mean: 0.592762053013
test_y_row_norms_min: 0.0258602239192
train_h0_col_norms_max: 6.61253833771
train_h0_col_norms_mean: 4.37270545959
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.87647247314
train_h0_row_norms_mean: 3.42536139488
train_h0_row_norms_min: 0.171194672585
train_h1_col_norms_max: 6.00415945053
train_h1_col_norms_mean: 3.89505052567
train_h1_col_norms_min: 1.72643530369
train_h1_row_norms_max: 9.10108375549
train_h1_row_norms_mean: 5.53529548645
train_h1_row_norms_min: 3.28672790527
train_objective: 0.0021845579613
train_y_col_norms_max: 6.79666471481
train_y_col_norms_mean: 6.31101417542
train_y_col_norms_min: 5.58734083176
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999446511269
train_y_min_max_class: 0.954655766487
train_y_misclass: 0.000679999880958
train_y_nll: 0.0021845579613
train_y_row_norms_max: 1.89036035538
train_y_row_norms_mean: 0.59276509285
train_y_row_norms_min: 0.0258601009846
valid_h0_col_norms_max: 6.61253595352
valid_h0_col_norms_mean: 4.37271356583
valid_h0_col_norms_min: 2.23605632782
valid_h0_row_norms_max: 6.87648153305
valid_h0_row_norms_mean: 3.425365448
valid_h0_row_norms_min: 0.17119538784
valid_h1_col_norms_max: 6.00418663025
valid_h1_col_norms_mean: 3.8950676918
valid_h1_col_norms_min: 1.72642803192
valid_h1_row_norms_max: 9.10107326508
valid_h1_row_norms_mean: 5.53528594971
valid_h1_row_norms_min: 3.28674340248
valid_objective: 0.158408492804
valid_y_col_norms_max: 6.79669380188
valid_y_col_norms_mean: 6.31103897095
valid_y_col_norms_min: 5.58734273911
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995150506496
valid_y_min_max_class: 0.737409770489
valid_y_misclass: 0.0196999944746
valid_y_nll: 0.158408492804
valid_y_row_norms_max: 1.89035248756
valid_y_row_norms_mean: 0.592762053013
valid_y_row_norms_min: 0.0258602239192
Time this epoch: 3.246123 seconds
Monitoring step:
Epochs seen: 48
Batches seen: 24000
Examples seen: 2400000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.60942840576
test_h0_col_norms_mean: 4.37416505814
test_h0_col_norms_min: 2.23605632782
test_h0_row_norms_max: 6.88673448563
test_h0_row_norms_mean: 3.42650437355
test_h0_row_norms_min: 0.171197414398
test_h1_col_norms_max: 6.00638771057
test_h1_col_norms_mean: 3.89544963837
test_h1_col_norms_min: 1.72642791271
test_h1_row_norms_max: 9.09871959686
test_h1_row_norms_mean: 5.53591918945
test_h1_row_norms_min: 3.2867603302
test_objective: 0.175691723824
test_y_col_norms_max: 6.78237819672
test_y_col_norms_mean: 6.3183298111
test_y_col_norms_min: 5.59972047806
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.996128737926
test_y_min_max_class: 0.775415062904
test_y_misclass: 0.0199999921024
test_y_nll: 0.175691723824
test_y_row_norms_max: 1.88972866535
test_y_row_norms_mean: 0.593335032463
test_y_row_norms_min: 0.0257790517062
train_h0_col_norms_max: 6.60943603516
train_h0_col_norms_mean: 4.37415838242
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.88672494888
train_h0_row_norms_mean: 3.4265191555
train_h0_row_norms_min: 0.171197369695
train_h1_col_norms_max: 6.00641536713
train_h1_col_norms_mean: 3.8954308033
train_h1_col_norms_min: 1.72643482685
train_h1_row_norms_max: 9.09872722626
train_h1_row_norms_mean: 5.53590679169
train_h1_row_norms_min: 3.28674817085
train_objective: 0.000762883864809
train_y_col_norms_max: 6.78235006332
train_y_col_norms_mean: 6.31833028793
train_y_col_norms_min: 5.59973287582
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.99974834919
train_y_min_max_class: 0.978080093861
train_y_misclass: 0.000239999950281
train_y_nll: 0.000762883864809
train_y_row_norms_max: 1.88971841335
train_y_row_norms_mean: 0.593334615231
train_y_row_norms_min: 0.0257790144533
valid_h0_col_norms_max: 6.60942840576
valid_h0_col_norms_mean: 4.37416505814
valid_h0_col_norms_min: 2.23605632782
valid_h0_row_norms_max: 6.88673448563
valid_h0_row_norms_mean: 3.42650437355
valid_h0_row_norms_min: 0.171197414398
valid_h1_col_norms_max: 6.00638771057
valid_h1_col_norms_mean: 3.89544963837
valid_h1_col_norms_min: 1.72642791271
valid_h1_row_norms_max: 9.09871959686
valid_h1_row_norms_mean: 5.53591918945
valid_h1_row_norms_min: 3.2867603302
valid_objective: 0.178655579686
valid_y_col_norms_max: 6.78237819672
valid_y_col_norms_mean: 6.3183298111
valid_y_col_norms_min: 5.59972047806
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.996117174625
valid_y_min_max_class: 0.773514211178
valid_y_misclass: 0.0190999954939
valid_y_nll: 0.178655579686
valid_y_row_norms_max: 1.88972866535
valid_y_row_norms_mean: 0.593335032463
valid_y_row_norms_min: 0.0257790517062
Time this epoch: 3.274107 seconds
Monitoring step:
Epochs seen: 49
Batches seen: 24500
Examples seen: 2450000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.61158514023
test_h0_col_norms_mean: 4.37596178055
test_h0_col_norms_min: 2.23605632782
test_h0_row_norms_max: 6.89095163345
test_h0_row_norms_mean: 3.42802858353
test_h0_row_norms_min: 0.171208888292
test_h1_col_norms_max: 6.00729322433
test_h1_col_norms_mean: 3.89590859413
test_h1_col_norms_min: 1.72634100914
test_h1_row_norms_max: 9.11584568024
test_h1_row_norms_mean: 5.53655290604
test_h1_row_norms_min: 3.28674292564
test_objective: 0.173922881484
test_y_col_norms_max: 6.80417919159
test_y_col_norms_mean: 6.32538461685
test_y_col_norms_min: 5.59382343292
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.995864152908
test_y_min_max_class: 0.774653494358
test_y_misclass: 0.0195999965072
test_y_nll: 0.173922881484
test_y_row_norms_max: 1.90011572838
test_y_row_norms_mean: 0.593958258629
test_y_row_norms_min: 0.0259114392102
train_h0_col_norms_max: 6.61158514023
train_h0_col_norms_mean: 4.37594175339
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.89095973969
train_h0_row_norms_mean: 3.42801046371
train_h0_row_norms_min: 0.171208947897
train_h1_col_norms_max: 6.00726985931
train_h1_col_norms_mean: 3.89590215683
train_h1_col_norms_min: 1.7263327837
train_h1_row_norms_max: 9.11585617065
train_h1_row_norms_mean: 5.53655576706
train_h1_row_norms_min: 3.28672790527
train_objective: 0.00186892366037
train_y_col_norms_max: 6.80421066284
train_y_col_norms_mean: 6.32541131973
train_y_col_norms_min: 5.59379386902
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999438583851
train_y_min_max_class: 0.950516223907
train_y_misclass: 0.000539999979082
train_y_nll: 0.00186892366037
train_y_row_norms_max: 1.90012443066
train_y_row_norms_mean: 0.59395968914
train_y_row_norms_min: 0.025911314413
valid_h0_col_norms_max: 6.61158514023
valid_h0_col_norms_mean: 4.37596178055
valid_h0_col_norms_min: 2.23605632782
valid_h0_row_norms_max: 6.89095163345
valid_h0_row_norms_mean: 3.42802858353
valid_h0_row_norms_min: 0.171208888292
valid_h1_col_norms_max: 6.00729322433
valid_h1_col_norms_mean: 3.89590859413
valid_h1_col_norms_min: 1.72634100914
valid_h1_row_norms_max: 9.11584568024
valid_h1_row_norms_mean: 5.53655290604
valid_h1_row_norms_min: 3.28674292564
valid_objective: 0.167324125767
valid_y_col_norms_max: 6.80417919159
valid_y_col_norms_mean: 6.32538461685
valid_y_col_norms_min: 5.59382343292
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.995694279671
valid_y_min_max_class: 0.751230061054
valid_y_misclass: 0.0211999900639
valid_y_nll: 0.167324125767
valid_y_row_norms_max: 1.90011572838
valid_y_row_norms_mean: 0.593958258629
valid_y_row_norms_min: 0.0259114392102
Time this epoch: 3.276921 seconds
Monitoring step:
Epochs seen: 50
Batches seen: 25000
Examples seen: 2500000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.61482906342
test_h0_col_norms_mean: 4.37731599808
test_h0_col_norms_min: 2.23605632782
test_h0_row_norms_max: 6.90527057648
test_h0_row_norms_mean: 3.42910242081
test_h0_row_norms_min: 0.171211406589
test_h1_col_norms_max: 6.01254796982
test_h1_col_norms_mean: 3.89633321762
test_h1_col_norms_min: 1.72635316849
test_h1_row_norms_max: 9.1068277359
test_h1_row_norms_mean: 5.53717756271
test_h1_row_norms_min: 3.28689336777
test_objective: 0.178737580776
test_y_col_norms_max: 6.81128787994
test_y_col_norms_mean: 6.33507156372
test_y_col_norms_min: 5.58309650421
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.99630522728
test_y_min_max_class: 0.788845181465
test_y_misclass: 0.0197999924421
test_y_nll: 0.178737580776
test_y_row_norms_max: 1.93474268913
test_y_row_norms_mean: 0.594771564007
test_y_row_norms_min: 0.0260054916143
train_h0_col_norms_max: 6.6148557663
train_h0_col_norms_mean: 4.37733840942
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.90530347824
train_h0_row_norms_mean: 3.42908787727
train_h0_row_norms_min: 0.171211794019
train_h1_col_norms_max: 6.01251840591
train_h1_col_norms_mean: 3.89634943008
train_h1_col_norms_min: 1.72634553909
train_h1_row_norms_max: 9.10683345795
train_h1_row_norms_mean: 5.53719091415
train_h1_row_norms_min: 3.28687477112
train_objective: 0.00155572697986
train_y_col_norms_max: 6.81132364273
train_y_col_norms_mean: 6.33503913879
train_y_col_norms_min: 5.58306837082
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999542534351
train_y_min_max_class: 0.959059894085
train_y_misclass: 0.000539999920875
train_y_nll: 0.00155572697986
train_y_row_norms_max: 1.93475210667
train_y_row_norms_mean: 0.594768106937
train_y_row_norms_min: 0.0260053742677
valid_h0_col_norms_max: 6.61482906342
valid_h0_col_norms_mean: 4.37731599808
valid_h0_col_norms_min: 2.23605632782
valid_h0_row_norms_max: 6.90527057648
valid_h0_row_norms_mean: 3.42910242081
valid_h0_row_norms_min: 0.171211406589
valid_h1_col_norms_max: 6.01254796982
valid_h1_col_norms_mean: 3.89633321762
valid_h1_col_norms_min: 1.72635316849
valid_h1_row_norms_max: 9.1068277359
valid_h1_row_norms_mean: 5.53717756271
valid_h1_row_norms_min: 3.28689336777
valid_objective: 0.168507456779
valid_y_col_norms_max: 6.81128787994
valid_y_col_norms_mean: 6.33507156372
valid_y_col_norms_min: 5.58309650421
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.996147751808
valid_y_min_max_class: 0.789148688316
valid_y_misclass: 0.0197999924421
valid_y_nll: 0.168507456779
valid_y_row_norms_max: 1.93474268913
valid_y_row_norms_mean: 0.594771564007
valid_y_row_norms_min: 0.0260054916143
Time this epoch: 3.200028 seconds
Monitoring step:
Epochs seen: 51
Batches seen: 25500
Examples seen: 2550000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.6147813797
test_h0_col_norms_mean: 4.37828779221
test_h0_col_norms_min: 2.23605632782
test_h0_row_norms_max: 6.90187358856
test_h0_row_norms_mean: 3.42986750603
test_h0_row_norms_min: 0.171211406589
test_h1_col_norms_max: 6.0115852356
test_h1_col_norms_mean: 3.89659976959
test_h1_col_norms_min: 1.72635293007
test_h1_row_norms_max: 9.1057882309
test_h1_row_norms_mean: 5.53757143021
test_h1_row_norms_min: 3.28781795502
test_objective: 0.172010108829
test_y_col_norms_max: 6.82110786438
test_y_col_norms_mean: 6.33999061584
test_y_col_norms_min: 5.59692811966
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.996686458588
test_y_min_max_class: 0.808368086815
test_y_misclass: 0.0198999941349
test_y_nll: 0.172010108829
test_y_row_norms_max: 1.93597054482
test_y_row_norms_mean: 0.595229923725
test_y_row_norms_min: 0.0260351337492
train_h0_col_norms_max: 6.61475372314
train_h0_col_norms_mean: 4.37829828262
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.9019112587
train_h0_row_norms_mean: 3.42988300323
train_h0_row_norms_min: 0.171211794019
train_h1_col_norms_max: 6.01156425476
train_h1_col_norms_mean: 3.89659571648
train_h1_col_norms_min: 1.72634553909
train_h1_row_norms_max: 9.10573482513
train_h1_row_norms_mean: 5.53757476807
train_h1_row_norms_min: 3.28780126572
train_objective: 0.00100987718906
train_y_col_norms_max: 6.8211388588
train_y_col_norms_mean: 6.34001255035
train_y_col_norms_min: 5.59689760208
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999795496464
train_y_min_max_class: 0.98121035099
train_y_misclass: 0.000219999958063
train_y_nll: 0.00100987718906
train_y_row_norms_max: 1.93596208096
train_y_row_norms_mean: 0.59522998333
train_y_row_norms_min: 0.0260351262987
valid_h0_col_norms_max: 6.6147813797
valid_h0_col_norms_mean: 4.37828779221
valid_h0_col_norms_min: 2.23605632782
valid_h0_row_norms_max: 6.90187358856
valid_h0_row_norms_mean: 3.42986750603
valid_h0_row_norms_min: 0.171211406589
valid_h1_col_norms_max: 6.0115852356
valid_h1_col_norms_mean: 3.89659976959
valid_h1_col_norms_min: 1.72635293007
valid_h1_row_norms_max: 9.1057882309
valid_h1_row_norms_mean: 5.53757143021
valid_h1_row_norms_min: 3.28781795502
valid_objective: 0.175070494413
valid_y_col_norms_max: 6.82110786438
valid_y_col_norms_mean: 6.33999061584
valid_y_col_norms_min: 5.59692811966
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.996282696724
valid_y_min_max_class: 0.780044913292
valid_y_misclass: 0.0196999944746
valid_y_nll: 0.175070494413
valid_y_row_norms_max: 1.93597054482
valid_y_row_norms_mean: 0.595229923725
valid_y_row_norms_min: 0.0260351337492
Time this epoch: 3.239240 seconds
Monitoring step:
Epochs seen: 52
Batches seen: 26000
Examples seen: 2600000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.61600589752
test_h0_col_norms_mean: 4.37898588181
test_h0_col_norms_min: 2.23605632782
test_h0_row_norms_max: 6.90484142303
test_h0_row_norms_mean: 3.43044447899
test_h0_row_norms_min: 0.171211466193
test_h1_col_norms_max: 6.01123189926
test_h1_col_norms_mean: 3.8968091011
test_h1_col_norms_min: 1.72635233402
test_h1_row_norms_max: 9.11070537567
test_h1_row_norms_mean: 5.53788709641
test_h1_row_norms_min: 3.28822088242
test_objective: 0.160425424576
test_y_col_norms_max: 6.80656099319
test_y_col_norms_mean: 6.34523868561
test_y_col_norms_min: 5.59447908401
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.996790707111
test_y_min_max_class: 0.826726913452
test_y_misclass: 0.0184999965131
test_y_nll: 0.160425424576
test_y_row_norms_max: 1.94036662579
test_y_row_norms_mean: 0.595700562
test_y_row_norms_min: 0.0263105537742
train_h0_col_norms_max: 6.61604356766
train_h0_col_norms_mean: 4.37900781631
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.9048409462
train_h0_row_norms_mean: 3.43045806885
train_h0_row_norms_min: 0.171212136745
train_h1_col_norms_max: 6.01124334335
train_h1_col_norms_mean: 3.89682626724
train_h1_col_norms_min: 1.72634339333
train_h1_row_norms_max: 9.11072158813
train_h1_row_norms_mean: 5.53790187836
train_h1_row_norms_min: 3.28823471069
train_objective: 0.000158851937158
train_y_col_norms_max: 6.8065943718
train_y_col_norms_mean: 6.34526729584
train_y_col_norms_min: 5.59449052811
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999920845032
train_y_min_max_class: 0.992827177048
train_y_misclass: 7.9999997979e-05
train_y_nll: 0.000158851937158
train_y_row_norms_max: 1.94036877155
train_y_row_norms_mean: 0.595700562
train_y_row_norms_min: 0.0263105835766
valid_h0_col_norms_max: 6.61600589752
valid_h0_col_norms_mean: 4.37898588181
valid_h0_col_norms_min: 2.23605632782
valid_h0_row_norms_max: 6.90484142303
valid_h0_row_norms_mean: 3.43044447899
valid_h0_row_norms_min: 0.171211466193
valid_h1_col_norms_max: 6.01123189926
valid_h1_col_norms_mean: 3.8968091011
valid_h1_col_norms_min: 1.72635233402
valid_h1_row_norms_max: 9.11070537567
valid_h1_row_norms_mean: 5.53788709641
valid_h1_row_norms_min: 3.28822088242
valid_objective: 0.169489264488
valid_y_col_norms_max: 6.80656099319
valid_y_col_norms_mean: 6.34523868561
valid_y_col_norms_min: 5.59447908401
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.996821761131
valid_y_min_max_class: 0.813496589661
valid_y_misclass: 0.0197999961674
valid_y_nll: 0.169489264488
valid_y_row_norms_max: 1.94036662579
valid_y_row_norms_mean: 0.595700562
valid_y_row_norms_min: 0.0263105537742
Time this epoch: 3.259741 seconds
Monitoring step:
Epochs seen: 53
Batches seen: 26500
Examples seen: 2650000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.6160197258
test_h0_col_norms_mean: 4.3792681694
test_h0_col_norms_min: 2.23605632782
test_h0_row_norms_max: 6.90263414383
test_h0_row_norms_mean: 3.43065404892
test_h0_row_norms_min: 0.171211466193
test_h1_col_norms_max: 6.01123523712
test_h1_col_norms_mean: 3.89689803123
test_h1_col_norms_min: 1.72635245323
test_h1_row_norms_max: 9.11182498932
test_h1_row_norms_mean: 5.53798723221
test_h1_row_norms_min: 3.2883348465
test_objective: 0.151598215103
test_y_col_norms_max: 6.82387685776
test_y_col_norms_mean: 6.34804153442
test_y_col_norms_min: 5.58777189255
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.99653673172
test_y_min_max_class: 0.809294104576
test_y_misclass: 0.0181999951601
test_y_nll: 0.151598215103
test_y_row_norms_max: 1.94308698177
test_y_row_norms_mean: 0.595957636833
test_y_row_norms_min: 0.0262990482152
train_h0_col_norms_max: 6.61604738235
train_h0_col_norms_mean: 4.3792719841
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.9026427269
train_h0_row_norms_mean: 3.43063807487
train_h0_row_norms_min: 0.171212136745
train_h1_col_norms_max: 6.01124382019
train_h1_col_norms_mean: 3.89692115784
train_h1_col_norms_min: 1.72634339333
train_h1_row_norms_max: 9.11186790466
train_h1_row_norms_mean: 5.53798723221
train_h1_row_norms_min: 3.28835225105
train_objective: 0.000210020065424
train_y_col_norms_max: 6.82384681702
train_y_col_norms_mean: 6.348072052
train_y_col_norms_min: 5.58779907227
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999953627586
train_y_min_max_class: 0.996000170708
train_y_misclass: 5.99999984843e-05
train_y_nll: 0.000210020065424
train_y_row_norms_max: 1.94309675694
train_y_row_norms_mean: 0.595957219601
train_y_row_norms_min: 0.0262989122421
valid_h0_col_norms_max: 6.6160197258
valid_h0_col_norms_mean: 4.3792681694
valid_h0_col_norms_min: 2.23605632782
valid_h0_row_norms_max: 6.90263414383
valid_h0_row_norms_mean: 3.43065404892
valid_h0_row_norms_min: 0.171211466193
valid_h1_col_norms_max: 6.01123523712
valid_h1_col_norms_mean: 3.89689803123
valid_h1_col_norms_min: 1.72635245323
valid_h1_row_norms_max: 9.11182498932
valid_h1_row_norms_mean: 5.53798723221
valid_h1_row_norms_min: 3.2883348465
valid_objective: 0.163225889206
valid_y_col_norms_max: 6.82387685776
valid_y_col_norms_mean: 6.34804153442
valid_y_col_norms_min: 5.58777189255
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.996884763241
valid_y_min_max_class: 0.818299531937
valid_y_misclass: 0.0186999943107
valid_y_nll: 0.163225889206
valid_y_row_norms_max: 1.94308698177
valid_y_row_norms_mean: 0.595957636833
valid_y_row_norms_min: 0.0262990482152
Time this epoch: 3.224364 seconds
Monitoring step:
Epochs seen: 54
Batches seen: 27000
Examples seen: 2700000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 6.61593580246
test_h0_col_norms_mean: 4.37931966782
test_h0_col_norms_min: 2.23605632782
test_h0_row_norms_max: 6.90164899826
test_h0_row_norms_mean: 3.43070554733
test_h0_row_norms_min: 0.171211466193
test_h1_col_norms_max: 6.01123189926
test_h1_col_norms_mean: 3.89690995216
test_h1_col_norms_min: 1.72635293007
test_h1_row_norms_max: 9.10886287689
test_h1_row_norms_mean: 5.53801727295
test_h1_row_norms_min: 3.28835773468
test_objective: 0.156616300344
test_y_col_norms_max: 6.82711935043
test_y_col_norms_mean: 6.34913825989
test_y_col_norms_min: 5.58710432053
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.996679246426
test_y_min_max_class: 0.821564376354
test_y_misclass: 0.0183999948204
test_y_nll: 0.156616300344
test_y_row_norms_max: 1.94326412678
test_y_row_norms_mean: 0.596061944962
test_y_row_norms_min: 0.0262778773904
train_h0_col_norms_max: 6.61593151093
train_h0_col_norms_mean: 4.37929821014
train_h0_col_norms_min: 2.23605895042
train_h0_row_norms_max: 6.90168523788
train_h0_row_norms_mean: 3.43071746826
train_h0_row_norms_min: 0.171212136745
train_h1_col_norms_max: 6.01124286652
train_h1_col_norms_mean: 3.89692831039
train_h1_col_norms_min: 1.72634553909
train_h1_row_norms_max: 9.10885238647
train_h1_row_norms_mean: 5.53799200058
train_h1_row_norms_min: 3.28836083412
train_objective: 1.18671387099e-05
train_y_col_norms_max: 6.82711696625
train_y_col_norms_mean: 6.34910583496
train_y_col_norms_min: 5.5871014595
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.999985456467
train_y_min_max_class: 0.999015629292
train_y_misclass: 0.0
train_y_nll: 1.18671387099e-05
train_y_row_norms_max: 1.94327509403
train_y_row_norms_mean: 0.596063792706
train_y_row_norms_min: 0.0262779761106
valid_h0_col_norms_max: 6.61593580246
valid_h0_col_norms_mean: 4.37931966782
valid_h0_col_norms_min: 2.23605632782
valid_h0_row_norms_max: 6.90164899826
valid_h0_row_norms_mean: 3.43070554733
valid_h0_row_norms_min: 0.171211466193
valid_h1_col_norms_max: 6.01123189926
valid_h1_col_norms_mean: 3.89690995216
valid_h1_col_norms_min: 1.72635293007
valid_h1_row_norms_max: 9.10886287689
valid_h1_row_norms_mean: 5.53801727295
valid_h1_row_norms_min: 3.28835773468
valid_objective: 0.168142601848
valid_y_col_norms_max: 6.82711935043
valid_y_col_norms_mean: 6.34913825989
valid_y_col_norms_min: 5.58710432053
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.997302770615
valid_y_min_max_class: 0.829375386238
valid_y_misclass: 0.0184999927878
valid_y_nll: 0.168142601848
valid_y_row_norms_max: 1.94326412678
valid_y_row_norms_mean: 0.596061944962
valid_y_row_norms_min: 0.0262778773904
In [8]:
!print_monitor.py mlp_2_best.pkl | grep test_y_misclass
Using gpu device 2: GeForce GTX 285
/u/goodfeli/pylearn2/models/mlp.py:36: UserWarning: MLP changing the recursion limit.
warnings.warn("MLP changing the recursion limit.")
test_y_misclass : 0.0174999963492
Using the deeper architecture, rectifier units, and SGD brought the test error rate down from 1.94% to 1.75%.
In softmax_regression.ipynb, we discussed the problem of overfitting, and how early stopping guided by validation set performance can result in better test set performance. Another way to prevent overfitting is to explicitly change the cost function to discourage overfitting.
The best way to prevent overfitting is to use Bayesian inference to predict labels on the new data. Suppose we have been given a dataset $\mathcal{D}$, and we want to classify a new point $x'$. Call its uknown label $y'$. Suppose that we also have a probability distribution over all possible model parameters, and that we call the set of all parameters $\theta$. Then
$$p(y' \mid x', \mathcal{D} ) = \int p(y', \theta \mid x', \mathcal{D}) d \theta $$$$ = \int p( y' \mid x' , \theta ) p( \theta \mid \mathcal{D} ) d \theta $$$$ \propto \int p( y' \mid x' , \theta ) p( \mathcal{D} \mid \theta ) p(\theta) d \theta $$(On the last line, we only worry about computing the distribution over $y'$ up to a constant, because we can easily find this constant by summing over the $k$ possible values of $y'$)
In other words, the right thing to do is to have all of the infinitely many possible values of $\theta$ vote on how to classify $x'$, with each value of $\theta$'s vote weighted by $p(\theta) p(\mathcal{D} \mid \theta)$.
Unfortunately, while conceptually straight forward, there is not an obvious way to evaluate this integral for a large multilayer perceptron. Instead, we assume that the distribution $p(\theta) p(\mathcal{D} \mid \theta)$ is very peaked, so that we can get a good prediction by using the single most likely value of $\theta$.
This suggests that we should maximize $p(\theta) p(\mathcal{D} \mid \theta)$, rather than maximizing $p(\mathcal{D} \mid \theta)$ as we have so far. Note that in log space, this is $\log p(\theta) + \log p( \mathcal{D} \mid \theta)$. We can thus add regularization to our training procedure by adding a term for $\log p(\theta)$ to our objective function.
This is very easy to do in pylearn2 using the SumOfCosts class. The following YAML string sets up the same experiment as before, but using SumOfCosts to add a regularization term. Before, we did not specify the "cost" argument to the training algorithm. The model provided the training algorithm with a default cost. Now, we specify that the cost should be the sum of two different costs. The first is the Default cost, which just asks the output layer what cost to use. This is the same cost we have implicitly been using all along, because models.mlp.MLP.get_default_cost() returns costs.mlp.Default(). The second term of our new cost function is called WeightDecay, and it implements a prior on our model parameters $\theta$.
In [9]:
import os
import pylearn2
path = os.path.join(pylearn2.__path__[0], 'scripts', 'tutorials', 'multilayer_perceptron', 'mlp_tutorial_part_4.yaml')
with open(path, 'r') as f:
train_3 = f.read()
hyper_params = {'train_stop' : 50000,
'valid_stop' : 60000,
'dim_h0' : 500,
'dim_h1' : 1000,
'sparse_init_h1' : 15,
'max_epochs' : 10000,
'save_path' : '.'}
train_3 = train_3 % (hyper_params)
print train_3
!obj:pylearn2.train.Train {
dataset: &train !obj:pylearn2.datasets.mnist.MNIST {
which_set: 'train',
start: 0,
stop: 50000
},
model: !obj:pylearn2.models.mlp.MLP {
layers: [ !obj:pylearn2.models.mlp.RectifiedLinear {
layer_name: 'h0',
dim: 500,
sparse_init: 15
}, !obj:pylearn2.models.mlp.RectifiedLinear {
layer_name: 'h1',
dim: 500,
sparse_init: 15
}, !obj:pylearn2.models.mlp.Softmax {
layer_name: 'y',
n_classes: 10,
irange: 0.
}
],
nvis: 784,
},
algorithm: !obj:pylearn2.training_algorithms.sgd.SGD {
batch_size: 100,
learning_rate: .01,
monitoring_dataset:
{
'train' : *train,
'valid' : !obj:pylearn2.datasets.mnist.MNIST {
which_set: 'train',
start: 50000,
stop: 60000
},
'test' : !obj:pylearn2.datasets.mnist.MNIST {
which_set: 'test',
}
},
cost: !obj:pylearn2.costs.cost.SumOfCosts { costs: [
!obj:pylearn2.costs.mlp.Default {
}, !obj:pylearn2.costs.mlp.WeightDecay {
coeffs: [ .00005, .00005, .00005 ]
}
]
},
learning_rule: !obj:pylearn2.training_algorithms.learning_rule.Momentum {
init_momentum: .5
},
termination_criterion: !obj:pylearn2.termination_criteria.And {
criteria: [
!obj:pylearn2.termination_criteria.MonitorBased {
channel_name: "valid_y_misclass",
prop_decrease: 0.,
N: 10
},
!obj:pylearn2.termination_criteria.EpochCounter {
max_epochs: 10000
}
]
}
},
extensions: [
!obj:pylearn2.train_extensions.best_params.MonitorBasedSaveBest {
channel_name: 'valid_y_misclass',
save_path: "mlp_3_best.pkl"
}, !obj:pylearn2.training_algorithms.learning_rule.MomentumAdjustor {
start: 1,
saturate: 10,
final_momentum: .99
}
]
}
The WeightDecay class adds a cost based on the sum of the squares of the elements of $W$ for the different layers, multiplying each by a different coefficient. This corresponds to $p(\theta)$ being Gaussian distribution on $W$, with a diagonal covariance matrix. (We don't regularize $b$, which is a bit of a hack, but can be thought of as putting extremely high variance on $b$ in the prior) In other words, our prior belief about $\theta$ is that the weights should be small. This basically says that, all else being equal, the different units in our network shouldn't interact with each other. Compared to the unregularized network, a network trained with weight decay wants to see more evidence that two units should interact before it allows them to do so.
Note that the SumOfCosts class doesn't explicitly have anything to do with the MLP. There is no requirement that the cost function be closely tied to the code for a particular model in pylearn2. This gives you great flexibility in the kind of experiments pylearn2 can run. The SumOfCosts class allows you to combine several pre-existing building blocks in pylearn2. By implementing your own cost classes, you can get even greater flexibility.
Of course, some costs are tightly integrated with a specific kind of model. The costs.mlp.Default cost expects to be able to ask a model for its last layer, and ask that layer what kind of cost to apply to the target values $y$ and an estimate of them produced by calling the model's fprop method. This implies that the cost can really only be used with MLP subclasses. Likewise, the WeightDecay cost depends on the assumption that the model is organized into layers and each layer has a single weight matrix. This means that it can only be used with an MLP,and even then only with layers that are governed by a weight matrix. It's OK to make a Cost that is this tightly integrated with a specific kind of model. Doing so is inevitable. Usually in pylearn2 we put the costs for a specific model family in their own submodule of pylearn2 so it's easy to tell what models they can be used with.
We now show what happens when you train the regularized MLP:
In [10]:
from pylearn2.config import yaml_parse
train_3 = yaml_parse.load(train_3)
train_3.main_loop()
Parameter and initial learning rate summary:
h0_W: 0.00999999977648
h0_b: 0.00999999977648
h1_W: 0.00999999977648
h1_b: 0.00999999977648
softmax_b: 0.00999999977648
softmax_W: 0.00999999977648
Compiling sgd_update...
Compiling sgd_update done. Time elapsed: 2.973035 seconds
compiling begin_record_entry...
compiling begin_record_entry done. Time elapsed: 0.457965 seconds
Monitored channels:
learning_rate
momentum
test_h0_col_norms_max
test_h0_col_norms_mean
test_h0_col_norms_min
test_h0_row_norms_max
test_h0_row_norms_mean
test_h0_row_norms_min
test_h1_col_norms_max
test_h1_col_norms_mean
test_h1_col_norms_min
test_h1_row_norms_max
test_h1_row_norms_mean
test_h1_row_norms_min
test_objective
test_term_0
test_term_1_weight_decay
test_y_col_norms_max
test_y_col_norms_mean
test_y_col_norms_min
test_y_max_max_class
test_y_mean_max_class
test_y_min_max_class
test_y_misclass
test_y_nll
test_y_row_norms_max
test_y_row_norms_mean
test_y_row_norms_min
train_h0_col_norms_max
train_h0_col_norms_mean
train_h0_col_norms_min
train_h0_row_norms_max
train_h0_row_norms_mean
train_h0_row_norms_min
train_h1_col_norms_max
train_h1_col_norms_mean
train_h1_col_norms_min
train_h1_row_norms_max
train_h1_row_norms_mean
train_h1_row_norms_min
train_objective
train_term_0
train_term_1_weight_decay
train_y_col_norms_max
train_y_col_norms_mean
train_y_col_norms_min
train_y_max_max_class
train_y_mean_max_class
train_y_min_max_class
train_y_misclass
train_y_nll
train_y_row_norms_max
train_y_row_norms_mean
train_y_row_norms_min
valid_h0_col_norms_max
valid_h0_col_norms_mean
valid_h0_col_norms_min
valid_h0_row_norms_max
valid_h0_row_norms_mean
valid_h0_row_norms_min
valid_h1_col_norms_max
valid_h1_col_norms_mean
valid_h1_col_norms_min
valid_h1_row_norms_max
valid_h1_row_norms_mean
valid_h1_row_norms_min
valid_objective
valid_term_0
valid_term_1_weight_decay
valid_y_col_norms_max
valid_y_col_norms_mean
valid_y_col_norms_min
valid_y_max_max_class
valid_y_mean_max_class
valid_y_min_max_class
valid_y_misclass
valid_y_nll
valid_y_row_norms_max
valid_y_row_norms_mean
valid_y_row_norms_min
Compiling accum...
graph size: 171
graph size: 169
graph size: 169
Compiling accum done. Time elapsed: 13.418733 seconds
Monitoring step:
Epochs seen: 0
Batches seen: 0
Examples seen: 0
learning_rate: 0.00999999046326
momentum: 0.499999672174
test_h0_col_norms_max: 6.23503017426
test_h0_col_norms_mean: 3.82356023788
test_h0_col_norms_min: 2.06193947792
test_h0_row_norms_max: 5.89326524734
test_h0_row_norms_mean: 2.98549389839
test_h0_row_norms_min: 0.0
test_h1_col_norms_max: 5.99438333511
test_h1_col_norms_mean: 3.80721712112
test_h1_col_norms_min: 1.71524214745
test_h1_row_norms_max: 7.80886650085
test_h1_row_norms_mean: 5.40815734863
test_h1_row_norms_min: 2.97773504257
test_objective: 3.4297709465
test_term_0: 2.30258488655
test_term_1_weight_decay: 1.12718772888
test_y_col_norms_max: 0.0
test_y_col_norms_mean: 0.0
test_y_col_norms_min: 0.0
test_y_max_max_class: 0.100000023842
test_y_mean_max_class: 0.100000031292
test_y_min_max_class: 0.100000023842
test_y_misclass: 0.901999890804
test_y_nll: 2.30258488655
test_y_row_norms_max: 0.0
test_y_row_norms_mean: 0.0
test_y_row_norms_min: 0.0
train_h0_col_norms_max: 6.23505115509
train_h0_col_norms_mean: 3.82354259491
train_h0_col_norms_min: 2.0619494915
train_h0_row_norms_max: 5.89324569702
train_h0_row_norms_mean: 2.98548007011
train_h0_row_norms_min: 0.0
train_h1_col_norms_max: 5.99438095093
train_h1_col_norms_mean: 3.80721092224
train_h1_col_norms_min: 1.71524274349
train_h1_row_norms_max: 7.80887794495
train_h1_row_norms_mean: 5.40813541412
train_h1_row_norms_min: 2.97772955894
train_objective: 3.42977070808
train_term_0: 2.30257916451
train_term_1_weight_decay: 1.12718474865
train_y_col_norms_max: 0.0
train_y_col_norms_mean: 0.0
train_y_col_norms_min: 0.0
train_y_max_max_class: 0.100000545382
train_y_mean_max_class: 0.100000545382
train_y_min_max_class: 0.100000545382
train_y_misclass: 0.901360213757
train_y_nll: 2.30257916451
train_y_row_norms_max: 0.0
train_y_row_norms_mean: 0.0
train_y_row_norms_min: 0.0
valid_h0_col_norms_max: 6.23503017426
valid_h0_col_norms_mean: 3.82356023788
valid_h0_col_norms_min: 2.06193947792
valid_h0_row_norms_max: 5.89326524734
valid_h0_row_norms_mean: 2.98549389839
valid_h0_row_norms_min: 0.0
valid_h1_col_norms_max: 5.99438333511
valid_h1_col_norms_mean: 3.80721712112
valid_h1_col_norms_min: 1.71524214745
valid_h1_row_norms_max: 7.80886650085
valid_h1_row_norms_mean: 5.40815734863
valid_h1_row_norms_min: 2.97773504257
valid_objective: 3.4297709465
valid_term_0: 2.30258488655
valid_term_1_weight_decay: 1.12718772888
valid_y_col_norms_max: 0.0
valid_y_col_norms_mean: 0.0
valid_y_col_norms_min: 0.0
valid_y_max_max_class: 0.100000023842
valid_y_mean_max_class: 0.100000031292
valid_y_min_max_class: 0.100000023842
valid_y_misclass: 0.90089994669
valid_y_nll: 2.30258488655
valid_y_row_norms_max: 0.0
valid_y_row_norms_mean: 0.0
valid_y_row_norms_min: 0.0
Time this epoch: 3.310886 seconds
Monitoring step:
Epochs seen: 1
Batches seen: 500
Examples seen: 50000
learning_rate: 0.00999999046326
momentum: 0.499999672174
test_h0_col_norms_max: 6.22863864899
test_h0_col_norms_mean: 3.81978034973
test_h0_col_norms_min: 2.06060481071
test_h0_row_norms_max: 5.88668251038
test_h0_row_norms_mean: 2.98259210587
test_h0_row_norms_min: 0.00163801340386
test_h1_col_norms_max: 5.98888349533
test_h1_col_norms_mean: 3.80343770981
test_h1_col_norms_min: 1.71354997158
test_h1_row_norms_max: 7.80116271973
test_h1_row_norms_mean: 5.40278577805
test_h1_row_norms_min: 2.97481369972
test_objective: 1.39391481876
test_term_0: 0.268794178963
test_term_1_weight_decay: 1.12512099743
test_y_col_norms_max: 0.645387113094
test_y_col_norms_mean: 0.59630638361
test_y_col_norms_min: 0.520404875278
test_y_max_max_class: 0.999945759773
test_y_mean_max_class: 0.904323577881
test_y_min_max_class: 0.380515068769
test_y_misclass: 0.0813000127673
test_y_nll: 0.268794178963
test_y_row_norms_max: 0.179665878415
test_y_row_norms_mean: 0.0518467575312
test_y_row_norms_min: 0.000148977691424
train_h0_col_norms_max: 6.2286696434
train_h0_col_norms_mean: 3.81979823112
train_h0_col_norms_min: 2.06059765816
train_h0_row_norms_max: 5.88671255112
train_h0_row_norms_mean: 2.9826066494
train_h0_row_norms_min: 0.00163802062161
train_h1_col_norms_max: 5.9888548851
train_h1_col_norms_mean: 3.80346035957
train_h1_col_norms_min: 1.71355748177
train_h1_row_norms_max: 7.80111694336
train_h1_row_norms_mean: 5.40279817581
train_h1_row_norms_min: 2.97482800484
train_objective: 1.38994812965
train_term_0: 0.264828205109
train_term_1_weight_decay: 1.12512207031
train_y_col_norms_max: 0.645388245583
train_y_col_norms_mean: 0.596305251122
train_y_col_norms_min: 0.520407259464
train_y_max_max_class: 0.99996304512
train_y_mean_max_class: 0.898920297623
train_y_min_max_class: 0.361467987299
train_y_misclass: 0.0793600603938
train_y_nll: 0.264828205109
train_y_row_norms_max: 0.179665371776
train_y_row_norms_mean: 0.0518467389047
train_y_row_norms_min: 0.000148977618665
valid_h0_col_norms_max: 6.22863864899
valid_h0_col_norms_mean: 3.81978034973
valid_h0_col_norms_min: 2.06060481071
valid_h0_row_norms_max: 5.88668251038
valid_h0_row_norms_mean: 2.98259210587
valid_h0_row_norms_min: 0.00163801340386
valid_h1_col_norms_max: 5.98888349533
valid_h1_col_norms_mean: 3.80343770981
valid_h1_col_norms_min: 1.71354997158
valid_h1_row_norms_max: 7.80116271973
valid_h1_row_norms_mean: 5.40278577805
valid_h1_row_norms_min: 2.97481369972
valid_objective: 1.37731289864
valid_term_0: 0.252192467451
valid_term_1_weight_decay: 1.12512099743
valid_y_col_norms_max: 0.645387113094
valid_y_col_norms_mean: 0.59630638361
valid_y_col_norms_min: 0.520404875278
valid_y_max_max_class: 0.999964594841
valid_y_mean_max_class: 0.907153248787
valid_y_min_max_class: 0.362326830626
valid_y_misclass: 0.0756999999285
valid_y_nll: 0.252192467451
valid_y_row_norms_max: 0.179665878415
valid_y_row_norms_mean: 0.0518467575312
valid_y_row_norms_min: 0.000148977691424
Time this epoch: 3.343837 seconds
Monitoring step:
Epochs seen: 2
Batches seen: 1000
Examples seen: 100000
learning_rate: 0.00999999046326
momentum: 0.554444551468
test_h0_col_norms_max: 6.22144937515
test_h0_col_norms_mean: 3.81579256058
test_h0_col_norms_min: 2.05898046494
test_h0_row_norms_max: 5.88006973267
test_h0_row_norms_mean: 2.9794948101
test_h0_row_norms_min: 0.00336797139607
test_h1_col_norms_max: 5.98277664185
test_h1_col_norms_mean: 3.79929542542
test_h1_col_norms_min: 1.71166646481
test_h1_row_norms_max: 7.79234170914
test_h1_row_norms_mean: 5.3969039917
test_h1_row_norms_min: 2.97146487236
test_objective: 1.3320376873
test_term_0: 0.209235101938
test_term_1_weight_decay: 1.12280321121
test_y_col_norms_max: 0.849509298801
test_y_col_norms_mean: 0.752226889133
test_y_col_norms_min: 0.648749351501
test_y_max_max_class: 0.999980688095
test_y_mean_max_class: 0.928127348423
test_y_min_max_class: 0.417017698288
test_y_misclass: 0.0624000132084
test_y_nll: 0.209235101938
test_y_row_norms_max: 0.202931031585
test_y_row_norms_mean: 0.0667919442058
test_y_row_norms_min: 0.00027507453342
train_h0_col_norms_max: 6.22147130966
train_h0_col_norms_mean: 3.81577634811
train_h0_col_norms_min: 2.0589826107
train_h0_row_norms_max: 5.8800983429
train_h0_row_norms_mean: 2.9795088768
train_h0_row_norms_min: 0.00336798490025
train_h1_col_norms_max: 5.98279714584
train_h1_col_norms_mean: 3.7993118763
train_h1_col_norms_min: 1.71166646481
train_h1_row_norms_max: 7.79229545593
train_h1_row_norms_mean: 5.39690923691
train_h1_row_norms_min: 2.97145032883
train_objective: 1.31553328037
train_term_0: 0.192730411887
train_term_1_weight_decay: 1.12280583382
train_y_col_norms_max: 0.849513113499
train_y_col_norms_mean: 0.752230584621
train_y_col_norms_min: 0.648747861385
train_y_max_max_class: 0.999980807304
train_y_mean_max_class: 0.925747811794
train_y_min_max_class: 0.379059791565
train_y_misclass: 0.0572400614619
train_y_nll: 0.192730411887
train_y_row_norms_max: 0.202931344509
train_y_row_norms_mean: 0.0667921230197
train_y_row_norms_min: 0.00027507476625
valid_h0_col_norms_max: 6.22144937515
valid_h0_col_norms_mean: 3.81579256058
valid_h0_col_norms_min: 2.05898046494
valid_h0_row_norms_max: 5.88006973267
valid_h0_row_norms_mean: 2.9794948101
valid_h0_row_norms_min: 0.00336797139607
valid_h1_col_norms_max: 5.98277664185
valid_h1_col_norms_mean: 3.79929542542
valid_h1_col_norms_min: 1.71166646481
valid_h1_row_norms_max: 7.79234170914
valid_h1_row_norms_mean: 5.3969039917
valid_h1_row_norms_min: 2.97146487236
valid_objective: 1.32417428493
valid_term_0: 0.201371654868
valid_term_1_weight_decay: 1.12280321121
valid_y_col_norms_max: 0.849509298801
valid_y_col_norms_mean: 0.752226889133
valid_y_col_norms_min: 0.648749351501
valid_y_max_max_class: 0.999982237816
valid_y_mean_max_class: 0.931577861309
valid_y_min_max_class: 0.40255895257
valid_y_misclass: 0.0578999966383
valid_y_nll: 0.201371654868
valid_y_row_norms_max: 0.202931031585
valid_y_row_norms_mean: 0.0667919442058
valid_y_row_norms_min: 0.00027507453342
Time this epoch: 3.283221 seconds
Monitoring step:
Epochs seen: 3
Batches seen: 1500
Examples seen: 150000
learning_rate: 0.00999999046326
momentum: 0.608888924122
test_h0_col_norms_max: 6.21347379684
test_h0_col_norms_mean: 3.81121587753
test_h0_col_norms_min: 2.05705142021
test_h0_row_norms_max: 5.87235736847
test_h0_row_norms_mean: 2.97595834732
test_h0_row_norms_min: 0.00510276248679
test_h1_col_norms_max: 5.97572278976
test_h1_col_norms_mean: 3.79457330704
test_h1_col_norms_min: 1.70953249931
test_h1_row_norms_max: 7.78235435486
test_h1_row_norms_mean: 5.39019727707
test_h1_row_norms_min: 2.96771478653
test_objective: 1.30544030666
test_term_0: 0.185299769044
test_term_1_weight_decay: 1.12013947964
test_y_col_norms_max: 1.00650155544
test_y_col_norms_mean: 0.878560483456
test_y_col_norms_min: 0.748090326786
test_y_max_max_class: 0.999993503094
test_y_mean_max_class: 0.939459979534
test_y_min_max_class: 0.444366723299
test_y_misclass: 0.0547000169754
test_y_nll: 0.185299769044
test_y_row_norms_max: 0.217191457748
test_y_row_norms_mean: 0.0787876471877
test_y_row_norms_min: 0.000392778747482
train_h0_col_norms_max: 6.21344470978
train_h0_col_norms_mean: 3.81123256683
train_h0_col_norms_min: 2.05706167221
train_h0_row_norms_max: 5.87232971191
train_h0_row_norms_mean: 2.97594833374
train_h0_row_norms_min: 0.00510273734108
train_h1_col_norms_max: 5.97572278976
train_h1_col_norms_mean: 3.79455709457
train_h1_col_norms_min: 1.70952439308
train_h1_row_norms_max: 7.78239917755
train_h1_row_norms_mean: 5.39017248154
train_h1_row_norms_min: 2.96771502495
train_objective: 1.2823060751
train_term_0: 0.162165120244
train_term_1_weight_decay: 1.12014472485
train_y_col_norms_max: 1.00650632381
train_y_col_norms_mean: 0.878564417362
train_y_col_norms_min: 0.748090386391
train_y_max_max_class: 0.999991178513
train_y_mean_max_class: 0.93700414896
train_y_min_max_class: 0.404900848866
train_y_misclass: 0.0482200570405
train_y_nll: 0.162165120244
train_y_row_norms_max: 0.21719174087
train_y_row_norms_mean: 0.0787875503302
train_y_row_norms_min: 0.000392780813854
valid_h0_col_norms_max: 6.21347379684
valid_h0_col_norms_mean: 3.81121587753
valid_h0_col_norms_min: 2.05705142021
valid_h0_row_norms_max: 5.87235736847
valid_h0_row_norms_mean: 2.97595834732
valid_h0_row_norms_min: 0.00510276248679
valid_h1_col_norms_max: 5.97572278976
valid_h1_col_norms_mean: 3.79457330704
valid_h1_col_norms_min: 1.70953249931
valid_h1_row_norms_max: 7.78235435486
valid_h1_row_norms_mean: 5.39019727707
valid_h1_row_norms_min: 2.96771478653
valid_objective: 1.29470717907
valid_term_0: 0.174566537142
valid_term_1_weight_decay: 1.12013947964
valid_y_col_norms_max: 1.00650155544
valid_y_col_norms_mean: 0.878560483456
valid_y_col_norms_min: 0.748090326786
valid_y_max_max_class: 0.999994695187
valid_y_mean_max_class: 0.942149102688
valid_y_min_max_class: 0.417711257935
valid_y_misclass: 0.051200017333
valid_y_nll: 0.174566537142
valid_y_row_norms_max: 0.217191457748
valid_y_row_norms_mean: 0.0787876471877
valid_y_row_norms_min: 0.000392778747482
Time this epoch: 3.301401 seconds
Monitoring step:
Epochs seen: 4
Batches seen: 2000
Examples seen: 200000
learning_rate: 0.00999999046326
momentum: 0.663333714008
test_h0_col_norms_max: 6.20446586609
test_h0_col_norms_mean: 3.80589365959
test_h0_col_norms_min: 2.05491876602
test_h0_row_norms_max: 5.86368274689
test_h0_row_norms_mean: 2.97183966637
test_h0_row_norms_min: 0.00636858073995
test_h1_col_norms_max: 5.96751737595
test_h1_col_norms_mean: 3.78907322884
test_h1_col_norms_min: 1.70705342293
test_h1_row_norms_max: 7.77082681656
test_h1_row_norms_mean: 5.38239336014
test_h1_row_norms_min: 2.96349358559
test_objective: 1.28483641148
test_term_0: 0.167798668146
test_term_1_weight_decay: 1.11703836918
test_y_col_norms_max: 1.14337170124
test_y_col_norms_mean: 0.994192421436
test_y_col_norms_min: 0.840292572975
test_y_max_max_class: 0.999995589256
test_y_mean_max_class: 0.946651279926
test_y_min_max_class: 0.454940706491
test_y_misclass: 0.0549000278115
test_y_nll: 0.167798668146
test_y_row_norms_max: 0.231142029166
test_y_row_norms_mean: 0.089763648808
test_y_row_norms_min: 0.000477136200061
train_h0_col_norms_max: 6.20444250107
train_h0_col_norms_mean: 3.80587768555
train_h0_col_norms_min: 2.05492663383
train_h0_row_norms_max: 5.86367082596
train_h0_row_norms_mean: 2.97184991837
train_h0_row_norms_min: 0.00636860262603
train_h1_col_norms_max: 5.96753835678
train_h1_col_norms_mean: 3.7890689373
train_h1_col_norms_min: 1.70706069469
train_h1_row_norms_max: 7.77079200745
train_h1_row_norms_mean: 5.38238239288
train_h1_row_norms_min: 2.96350455284
train_objective: 1.25564575195
train_term_0: 0.138607770205
train_term_1_weight_decay: 1.11704432964
train_y_col_norms_max: 1.14337110519
train_y_col_norms_mean: 0.994198083878
train_y_col_norms_min: 0.840297460556
train_y_max_max_class: 0.999992132187
train_y_mean_max_class: 0.945581674576
train_y_min_max_class: 0.42304289341
train_y_misclass: 0.0431200563908
train_y_nll: 0.138607770205
train_y_row_norms_max: 0.231140971184
train_y_row_norms_mean: 0.0897636190057
train_y_row_norms_min: 0.000477139052236
valid_h0_col_norms_max: 6.20446586609
valid_h0_col_norms_mean: 3.80589365959
valid_h0_col_norms_min: 2.05491876602
valid_h0_row_norms_max: 5.86368274689
valid_h0_row_norms_mean: 2.97183966637
valid_h0_row_norms_min: 0.00636858073995
valid_h1_col_norms_max: 5.96751737595
valid_h1_col_norms_mean: 3.78907322884
valid_h1_col_norms_min: 1.70705342293
valid_h1_row_norms_max: 7.77082681656
valid_h1_row_norms_mean: 5.38239336014
valid_h1_row_norms_min: 2.96349358559
valid_objective: 1.27460837364
valid_term_0: 0.157571211457
valid_term_1_weight_decay: 1.11703836918
valid_y_col_norms_max: 1.14337170124
valid_y_col_norms_mean: 0.994192421436
valid_y_col_norms_min: 0.840292572975
valid_y_max_max_class: 0.999996304512
valid_y_mean_max_class: 0.949614882469
valid_y_min_max_class: 0.442067503929
valid_y_misclass: 0.0465000085533
valid_y_nll: 0.157571211457
valid_y_row_norms_max: 0.231142029166
valid_y_row_norms_mean: 0.089763648808
valid_y_row_norms_min: 0.000477136200061
Time this epoch: 3.266055 seconds
Monitoring step:
Epochs seen: 5
Batches seen: 2500
Examples seen: 250000
learning_rate: 0.00999999046326
momentum: 0.717777192593
test_h0_col_norms_max: 6.19388818741
test_h0_col_norms_mean: 3.79951477051
test_h0_col_norms_min: 2.05230784416
test_h0_row_norms_max: 5.85298204422
test_h0_row_norms_mean: 2.96690416336
test_h0_row_norms_min: 0.00795079302043
test_h1_col_norms_max: 5.95764780045
test_h1_col_norms_mean: 3.78251552582
test_h1_col_norms_min: 1.70412421227
test_h1_row_norms_max: 7.7571387291
test_h1_row_norms_mean: 5.37306642532
test_h1_row_norms_min: 2.95853662491
test_objective: 1.25132834911
test_term_0: 0.138001933694
test_term_1_weight_decay: 1.11332631111
test_y_col_norms_max: 1.26581287384
test_y_col_norms_mean: 1.10778701305
test_y_col_norms_min: 0.922472834587
test_y_max_max_class: 0.999994754791
test_y_mean_max_class: 0.953354179859
test_y_min_max_class: 0.460847198963
test_y_misclass: 0.0430000051856
test_y_nll: 0.138001933694
test_y_row_norms_max: 0.258754551411
test_y_row_norms_mean: 0.100538700819
test_y_row_norms_min: 0.000593058066443
train_h0_col_norms_max: 6.19387769699
train_h0_col_norms_mean: 3.79953241348
train_h0_col_norms_min: 2.05230736732
train_h0_row_norms_max: 5.85295391083
train_h0_row_norms_mean: 2.96689033508
train_h0_row_norms_min: 0.0079507548362
train_h1_col_norms_max: 5.95761966705
train_h1_col_norms_mean: 3.78251123428
train_h1_col_norms_min: 1.70413899422
train_h1_row_norms_max: 7.75717258453
train_h1_row_norms_mean: 5.37306880951
train_h1_row_norms_min: 2.95853638649
train_objective: 1.21803998947
train_term_0: 0.104714490473
train_term_1_weight_decay: 1.11332845688
train_y_col_norms_max: 1.26581907272
train_y_col_norms_mean: 1.1077862978
train_y_col_norms_min: 0.922471702099
train_y_max_max_class: 0.999992728233
train_y_mean_max_class: 0.954178750515
train_y_min_max_class: 0.440906405449
train_y_misclass: 0.0312400292605
train_y_nll: 0.104714490473
train_y_row_norms_max: 0.258753240108
train_y_row_norms_mean: 0.100538358092
train_y_row_norms_min: 0.000593057950027
valid_h0_col_norms_max: 6.19388818741
valid_h0_col_norms_mean: 3.79951477051
valid_h0_col_norms_min: 2.05230784416
valid_h0_row_norms_max: 5.85298204422
valid_h0_row_norms_mean: 2.96690416336
valid_h0_row_norms_min: 0.00795079302043
valid_h1_col_norms_max: 5.95764780045
valid_h1_col_norms_mean: 3.78251552582
valid_h1_col_norms_min: 1.70412421227
valid_h1_row_norms_max: 7.7571387291
valid_h1_row_norms_mean: 5.37306642532
valid_h1_row_norms_min: 2.95853662491
valid_objective: 1.24973428249
valid_term_0: 0.136407867074
valid_term_1_weight_decay: 1.11332631111
valid_y_col_norms_max: 1.26581287384
valid_y_col_norms_mean: 1.10778701305
valid_y_col_norms_min: 0.922472834587
valid_y_max_max_class: 0.999996542931
valid_y_mean_max_class: 0.955720424652
valid_y_min_max_class: 0.447657436132
valid_y_misclass: 0.0386000014842
valid_y_nll: 0.136407867074
valid_y_row_norms_max: 0.258754551411
valid_y_row_norms_mean: 0.100538700819
valid_y_row_norms_min: 0.000593058066443
Time this epoch: 3.281634 seconds
Monitoring step:
Epochs seen: 6
Batches seen: 3000
Examples seen: 300000
learning_rate: 0.00999999046326
momentum: 0.772221684456
test_h0_col_norms_max: 6.18053913116
test_h0_col_norms_mean: 3.79164195061
test_h0_col_norms_min: 2.04931807518
test_h0_row_norms_max: 5.84014606476
test_h0_row_norms_mean: 2.96080875397
test_h0_row_norms_min: 0.00960826966912
test_h1_col_norms_max: 5.94511365891
test_h1_col_norms_mean: 3.77440404892
test_h1_col_norms_min: 1.70048546791
test_h1_row_norms_max: 7.74020195007
test_h1_row_norms_mean: 5.3615436554
test_h1_row_norms_min: 2.95202755928
test_objective: 1.23484170437
test_term_0: 0.126101091504
test_term_1_weight_decay: 1.10874140263
test_y_col_norms_max: 1.39184403419
test_y_col_norms_mean: 1.23041391373
test_y_col_norms_min: 1.02565836906
test_y_max_max_class: 0.999998748302
test_y_mean_max_class: 0.961094081402
test_y_min_max_class: 0.502607226372
test_y_misclass: 0.0397000052035
test_y_nll: 0.126101091504
test_y_row_norms_max: 0.288574844599
test_y_row_norms_mean: 0.112107351422
test_y_row_norms_min: 0.000744926044717
train_h0_col_norms_max: 6.18052864075
train_h0_col_norms_mean: 3.79166030884
train_h0_col_norms_min: 2.04932594299
train_h0_row_norms_max: 5.84012889862
train_h0_row_norms_mean: 2.96080327034
train_h0_row_norms_min: 0.0096082771197
train_h1_col_norms_max: 5.94514036179
train_h1_col_norms_mean: 3.77440428734
train_h1_col_norms_min: 1.7004776001
train_h1_row_norms_max: 7.74021291733
train_h1_row_norms_mean: 5.36155557632
train_h1_row_norms_min: 2.9520175457
train_objective: 1.19061946869
train_term_0: 0.0818792134523
train_term_1_weight_decay: 1.10874009132
train_y_col_norms_max: 1.39184439182
train_y_col_norms_mean: 1.2304173708
train_y_col_norms_min: 1.02565360069
train_y_max_max_class: 0.999994039536
train_y_mean_max_class: 0.963193774223
train_y_min_max_class: 0.475303918123
train_y_misclass: 0.0230600107461
train_y_nll: 0.0818792134523
train_y_row_norms_max: 0.288575559855
train_y_row_norms_mean: 0.112107902765
train_y_row_norms_min: 0.000744922028389
valid_h0_col_norms_max: 6.18053913116
valid_h0_col_norms_mean: 3.79164195061
valid_h0_col_norms_min: 2.04931807518
valid_h0_row_norms_max: 5.84014606476
valid_h0_row_norms_mean: 2.96080875397
valid_h0_row_norms_min: 0.00960826966912
valid_h1_col_norms_max: 5.94511365891
valid_h1_col_norms_mean: 3.77440404892
valid_h1_col_norms_min: 1.70048546791
valid_h1_row_norms_max: 7.74020195007
valid_h1_row_norms_mean: 5.3615436554
valid_h1_row_norms_min: 2.95202755928
valid_objective: 1.23645818233
valid_term_0: 0.127717524767
valid_term_1_weight_decay: 1.10874140263
valid_y_col_norms_max: 1.39184403419
valid_y_col_norms_mean: 1.23041391373
valid_y_col_norms_min: 1.02565836906
valid_y_max_max_class: 0.999998986721
valid_y_mean_max_class: 0.963711440563
valid_y_min_max_class: 0.479158580303
valid_y_misclass: 0.0373999997973
valid_y_nll: 0.127717524767
valid_y_row_norms_max: 0.288574844599
valid_y_row_norms_mean: 0.112107351422
valid_y_row_norms_min: 0.000744926044717
Time this epoch: 3.285549 seconds
Monitoring step:
Epochs seen: 7
Batches seen: 3500
Examples seen: 350000
learning_rate: 0.00999999046326
momentum: 0.826667308807
test_h0_col_norms_max: 6.16351413727
test_h0_col_norms_mean: 3.78127264977
test_h0_col_norms_min: 2.04552721977
test_h0_row_norms_max: 5.82413673401
test_h0_row_norms_mean: 2.95279192924
test_h0_row_norms_min: 0.0109715117142
test_h1_col_norms_max: 5.92860794067
test_h1_col_norms_mean: 3.7637283802
test_h1_col_norms_min: 1.69574940205
test_h1_row_norms_max: 7.71776247025
test_h1_row_norms_mean: 5.34639310837
test_h1_row_norms_min: 2.94415974617
test_objective: 1.2293548584
test_term_0: 0.126640558243
test_term_1_weight_decay: 1.1027148962
test_y_col_norms_max: 1.53999233246
test_y_col_norms_mean: 1.36674308777
test_y_col_norms_min: 1.134085536
test_y_max_max_class: 0.999998986721
test_y_mean_max_class: 0.962450027466
test_y_min_max_class: 0.520037055016
test_y_misclass: 0.0400000177324
test_y_nll: 0.126640558243
test_y_row_norms_max: 0.323384702206
test_y_row_norms_mean: 0.124884955585
test_y_row_norms_min: 0.000862787244841
train_h0_col_norms_max: 6.1635351181
train_h0_col_norms_mean: 3.78129315376
train_h0_col_norms_min: 2.04552340508
train_h0_row_norms_max: 5.82410860062
train_h0_row_norms_mean: 2.95280575752
train_h0_row_norms_min: 0.0109714772552
train_h1_col_norms_max: 5.92858171463
train_h1_col_norms_mean: 3.76370692253
train_h1_col_norms_min: 1.69575130939
train_h1_row_norms_max: 7.71779823303
train_h1_row_norms_mean: 5.34638214111
train_h1_row_norms_min: 2.94414997101
train_objective: 1.18144452572
train_term_0: 0.0787304490805
train_term_1_weight_decay: 1.10271286964
train_y_col_norms_max: 1.54000031948
train_y_col_norms_mean: 1.36673867702
train_y_col_norms_min: 1.13409137726
train_y_max_max_class: 0.999994158745
train_y_mean_max_class: 0.964662730694
train_y_min_max_class: 0.485619604588
train_y_misclass: 0.0242600161582
train_y_nll: 0.0787304490805
train_y_row_norms_max: 0.323384910822
train_y_row_norms_mean: 0.124885700643
train_y_row_norms_min: 0.000862783577759
valid_h0_col_norms_max: 6.16351413727
valid_h0_col_norms_mean: 3.78127264977
valid_h0_col_norms_min: 2.04552721977
valid_h0_row_norms_max: 5.82413673401
valid_h0_row_norms_mean: 2.95279192924
valid_h0_row_norms_min: 0.0109715117142
valid_h1_col_norms_max: 5.92860794067
valid_h1_col_norms_mean: 3.7637283802
valid_h1_col_norms_min: 1.69574940205
valid_h1_row_norms_max: 7.71776247025
valid_h1_row_norms_mean: 5.34639310837
valid_h1_row_norms_min: 2.94415974617
valid_objective: 1.22817146778
valid_term_0: 0.125456944108
valid_term_1_weight_decay: 1.1027148962
valid_y_col_norms_max: 1.53999233246
valid_y_col_norms_mean: 1.36674308777
valid_y_col_norms_min: 1.134085536
valid_y_max_max_class: 0.99999910593
valid_y_mean_max_class: 0.965774953365
valid_y_min_max_class: 0.481605708599
valid_y_misclass: 0.0360999889672
valid_y_nll: 0.125456944108
valid_y_row_norms_max: 0.323384702206
valid_y_row_norms_mean: 0.124884955585
valid_y_row_norms_min: 0.000862787244841
Time this epoch: 3.275973 seconds
Monitoring step:
Epochs seen: 8
Batches seen: 4000
Examples seen: 400000
learning_rate: 0.00999999046326
momentum: 0.881111502647
test_h0_col_norms_max: 6.13874149323
test_h0_col_norms_mean: 3.76625037193
test_h0_col_norms_min: 2.03984022141
test_h0_row_norms_max: 5.79944992065
test_h0_row_norms_mean: 2.94116210938
test_h0_row_norms_min: 0.0121828410774
test_h1_col_norms_max: 5.90430831909
test_h1_col_norms_mean: 3.74820208549
test_h1_col_norms_min: 1.68876981735
test_h1_row_norms_max: 7.68556308746
test_h1_row_norms_mean: 5.32432985306
test_h1_row_norms_min: 2.93232631683
test_objective: 1.21413767338
test_term_0: 0.12014952302
test_term_1_weight_decay: 1.09398806095
test_y_col_norms_max: 1.73185801506
test_y_col_norms_mean: 1.54484415054
test_y_col_norms_min: 1.28760778904
test_y_max_max_class: 0.999999284744
test_y_mean_max_class: 0.969546198845
test_y_min_max_class: 0.53670758009
test_y_misclass: 0.0355999991298
test_y_nll: 0.12014952302
test_y_row_norms_max: 0.390541791916
test_y_row_norms_mean: 0.141607090831
test_y_row_norms_min: 0.00119230698328
train_h0_col_norms_max: 6.13874340057
train_h0_col_norms_mean: 3.76626849174
train_h0_col_norms_min: 2.03984594345
train_h0_row_norms_max: 5.79946660995
train_h0_row_norms_mean: 2.94116210938
train_h0_row_norms_min: 0.0121827786788
train_h1_col_norms_max: 5.90427827835
train_h1_col_norms_mean: 3.74818611145
train_h1_col_norms_min: 1.68877720833
train_h1_row_norms_max: 7.68560028076
train_h1_row_norms_mean: 5.32434654236
train_h1_row_norms_min: 2.93231272697
train_objective: 1.15510380268
train_term_0: 0.0611156411469
train_term_1_weight_decay: 1.09398496151
train_y_col_norms_max: 1.73185968399
train_y_col_norms_mean: 1.54483699799
train_y_col_norms_min: 1.28760266304
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.97226446867
train_y_min_max_class: 0.523442387581
train_y_misclass: 0.0181600283831
train_y_nll: 0.0611156411469
train_y_row_norms_max: 0.390543580055
train_y_row_norms_mean: 0.141606390476
train_y_row_norms_min: 0.00119230675045
valid_h0_col_norms_max: 6.13874149323
valid_h0_col_norms_mean: 3.76625037193
valid_h0_col_norms_min: 2.03984022141
valid_h0_row_norms_max: 5.79944992065
valid_h0_row_norms_mean: 2.94116210938
valid_h0_row_norms_min: 0.0121828410774
valid_h1_col_norms_max: 5.90430831909
valid_h1_col_norms_mean: 3.74820208549
valid_h1_col_norms_min: 1.68876981735
valid_h1_row_norms_max: 7.68556308746
valid_h1_row_norms_mean: 5.32432985306
valid_h1_row_norms_min: 2.93232631683
valid_objective: 1.2128187418
valid_term_0: 0.118830725551
valid_term_1_weight_decay: 1.09398806095
valid_y_col_norms_max: 1.73185801506
valid_y_col_norms_mean: 1.54484415054
valid_y_col_norms_min: 1.28760778904
valid_y_max_max_class: 0.999999284744
valid_y_mean_max_class: 0.971059143543
valid_y_min_max_class: 0.500100016594
valid_y_misclass: 0.0353999920189
valid_y_nll: 0.118830725551
valid_y_row_norms_max: 0.390541791916
valid_y_row_norms_mean: 0.141607090831
valid_y_row_norms_min: 0.00119230698328
Time this epoch: 3.273986 seconds
Monitoring step:
Epochs seen: 9
Batches seen: 4500
Examples seen: 450000
learning_rate: 0.00999999046326
momentum: 0.935554862022
test_h0_col_norms_max: 6.09445524216
test_h0_col_norms_mean: 3.73940348625
test_h0_col_norms_min: 2.03072142601
test_h0_row_norms_max: 5.75560235977
test_h0_row_norms_mean: 2.92046833038
test_h0_row_norms_min: 0.014029703103
test_h1_col_norms_max: 5.86166810989
test_h1_col_norms_mean: 3.71971082687
test_h1_col_norms_min: 1.67665565014
test_h1_row_norms_max: 7.62777662277
test_h1_row_norms_mean: 5.2838845253
test_h1_row_norms_min: 2.91292881966
test_objective: 1.20774161816
test_term_0: 0.129474073648
test_term_1_weight_decay: 1.0782674551
test_y_col_norms_max: 2.063549757
test_y_col_norms_mean: 1.8654705286
test_y_col_norms_min: 1.53516829014
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.971782028675
test_y_min_max_class: 0.541796386242
test_y_misclass: 0.0371000058949
test_y_nll: 0.129474073648
test_y_row_norms_max: 0.496850013733
test_y_row_norms_mean: 0.171486049891
test_y_row_norms_min: 0.00181403872557
train_h0_col_norms_max: 6.09445524216
train_h0_col_norms_mean: 3.73938298225
train_h0_col_norms_min: 2.03072929382
train_h0_row_norms_max: 5.75560045242
train_h0_row_norms_mean: 2.92047595978
train_h0_row_norms_min: 0.0140297813341
train_h1_col_norms_max: 5.86169338226
train_h1_col_norms_mean: 3.71969389915
train_h1_col_norms_min: 1.67666423321
train_h1_row_norms_max: 7.62774133682
train_h1_row_norms_mean: 5.2838549614
train_h1_row_norms_min: 2.91291928291
train_objective: 1.14386320114
train_term_0: 0.0655960813165
train_term_1_weight_decay: 1.07827007771
train_y_col_norms_max: 2.06355881691
train_y_col_norms_mean: 1.86546158791
train_y_col_norms_min: 1.53517353535
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.976020038128
train_y_min_max_class: 0.536927580833
train_y_misclass: 0.0207600202411
train_y_nll: 0.0655960813165
train_y_row_norms_max: 0.4968495965
train_y_row_norms_mean: 0.17148527503
train_y_row_norms_min: 0.00181404093746
valid_h0_col_norms_max: 6.09445524216
valid_h0_col_norms_mean: 3.73940348625
valid_h0_col_norms_min: 2.03072142601
valid_h0_row_norms_max: 5.75560235977
valid_h0_row_norms_mean: 2.92046833038
valid_h0_row_norms_min: 0.014029703103
valid_h1_col_norms_max: 5.86166810989
valid_h1_col_norms_mean: 3.71971082687
valid_h1_col_norms_min: 1.67665565014
valid_h1_row_norms_max: 7.62777662277
valid_h1_row_norms_mean: 5.2838845253
valid_h1_row_norms_min: 2.91292881966
valid_objective: 1.21526145935
valid_term_0: 0.136994019151
valid_term_1_weight_decay: 1.0782674551
valid_y_col_norms_max: 2.063549757
valid_y_col_norms_mean: 1.8654705286
valid_y_col_norms_min: 1.53516829014
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.974301934242
valid_y_min_max_class: 0.516560852528
valid_y_misclass: 0.0349999815226
valid_y_nll: 0.136994019151
valid_y_row_norms_max: 0.496850013733
valid_y_row_norms_mean: 0.171486049891
valid_y_row_norms_min: 0.00181403872557
Time this epoch: 3.317775 seconds
Monitoring step:
Epochs seen: 10
Batches seen: 5000
Examples seen: 500000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 5.92522621155
test_h0_col_norms_mean: 3.73818850517
test_h0_col_norms_min: 2.15961098671
test_h0_row_norms_max: 5.7353053093
test_h0_row_norms_mean: 2.92477583885
test_h0_row_norms_min: 0.0341217853129
test_h1_col_norms_max: 5.61352205276
test_h1_col_norms_mean: 3.57546806335
test_h1_col_norms_min: 1.61370325089
test_h1_row_norms_max: 7.31059169769
test_h1_row_norms_mean: 5.08152914047
test_h1_row_norms_min: 2.99987840652
test_objective: 1.26450061798
test_term_0: 0.236082434654
test_term_1_weight_decay: 1.02841842175
test_y_col_norms_max: 4.73058700562
test_y_col_norms_mean: 4.18089103699
test_y_col_norms_min: 3.5669798851
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.968040525913
test_y_min_max_class: 0.498749941587
test_y_misclass: 0.059400010854
test_y_nll: 0.236082434654
test_y_row_norms_max: 0.891591668129
test_y_row_norms_mean: 0.392109334469
test_y_row_norms_min: 0.0124359438196
train_h0_col_norms_max: 5.92519760132
train_h0_col_norms_mean: 3.73817253113
train_h0_col_norms_min: 2.15960621834
train_h0_row_norms_max: 5.73533010483
train_h0_row_norms_mean: 2.92479014397
train_h0_row_norms_min: 0.0341217927635
train_h1_col_norms_max: 5.61354923248
train_h1_col_norms_mean: 3.57545208931
train_h1_col_norms_min: 1.61369478703
train_h1_row_norms_max: 7.31061649323
train_h1_row_norms_mean: 5.08150863647
train_h1_row_norms_min: 2.99989366531
train_objective: 1.2140481472
train_term_0: 0.185629963875
train_term_1_weight_decay: 1.02841842175
train_y_col_norms_max: 4.73060131073
train_y_col_norms_mean: 4.18090629578
train_y_col_norms_min: 3.56698012352
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.968826234341
train_y_min_max_class: 0.484800755978
train_y_misclass: 0.0509400516748
train_y_nll: 0.185629963875
train_y_row_norms_max: 0.891595542431
train_y_row_norms_mean: 0.392109185457
train_y_row_norms_min: 0.0124359484762
valid_h0_col_norms_max: 5.92522621155
valid_h0_col_norms_mean: 3.73818850517
valid_h0_col_norms_min: 2.15961098671
valid_h0_row_norms_max: 5.7353053093
valid_h0_row_norms_mean: 2.92477583885
valid_h0_row_norms_min: 0.0341217853129
valid_h1_col_norms_max: 5.61352205276
valid_h1_col_norms_mean: 3.57546806335
valid_h1_col_norms_min: 1.61370325089
valid_h1_row_norms_max: 7.31059169769
valid_h1_row_norms_mean: 5.08152914047
valid_h1_row_norms_min: 2.99987840652
valid_objective: 1.27066576481
valid_term_0: 0.242247447371
valid_term_1_weight_decay: 1.02841842175
valid_y_col_norms_max: 4.73058700562
valid_y_col_norms_mean: 4.18089103699
valid_y_col_norms_min: 3.5669798851
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.969310641289
valid_y_min_max_class: 0.485083043575
valid_y_misclass: 0.0584000013769
valid_y_nll: 0.242247447371
valid_y_row_norms_max: 0.891591668129
valid_y_row_norms_mean: 0.392109334469
valid_y_row_norms_min: 0.0124359438196
Time this epoch: 3.378083 seconds
Monitoring step:
Epochs seen: 11
Batches seen: 5500
Examples seen: 550000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 5.70130395889
test_h0_col_norms_mean: 3.63394594193
test_h0_col_norms_min: 2.06413507462
test_h0_row_norms_max: 5.63436841965
test_h0_row_norms_mean: 2.84383249283
test_h0_row_norms_min: 0.0585759952664
test_h1_col_norms_max: 5.33032464981
test_h1_col_norms_mean: 3.41074442863
test_h1_col_norms_min: 1.54273200035
test_h1_row_norms_max: 6.95094776154
test_h1_row_norms_mean: 4.84889364243
test_h1_row_norms_min: 2.85255265236
test_objective: 1.09497404099
test_term_0: 0.145816907287
test_term_1_weight_decay: 0.94915664196
test_y_col_norms_max: 4.7894949913
test_y_col_norms_mean: 4.32798671722
test_y_col_norms_min: 3.85334467888
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.976011812687
test_y_min_max_class: 0.533329129219
test_y_misclass: 0.0402000099421
test_y_nll: 0.145816907287
test_y_row_norms_max: 1.16048634052
test_y_row_norms_mean: 0.407433569431
test_y_row_norms_min: 0.0134850135073
train_h0_col_norms_max: 5.70130395889
train_h0_col_norms_mean: 3.63395118713
train_h0_col_norms_min: 2.06412315369
train_h0_row_norms_max: 5.63437128067
train_h0_row_norms_mean: 2.84384655952
train_h0_row_norms_min: 0.0585760846734
train_h1_col_norms_max: 5.33032464981
train_h1_col_norms_mean: 3.4107298851
train_h1_col_norms_min: 1.54273247719
train_h1_row_norms_max: 6.95091104507
train_h1_row_norms_mean: 4.84888315201
train_h1_row_norms_min: 2.8525583744
train_objective: 1.04947304726
train_term_0: 0.10031542182
train_term_1_weight_decay: 0.949151813984
train_y_col_norms_max: 4.78949642181
train_y_col_norms_mean: 4.32800579071
train_y_col_norms_min: 3.85332846642
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.977987766266
train_y_min_max_class: 0.520123898983
train_y_misclass: 0.0307400058955
train_y_nll: 0.10031542182
train_y_row_norms_max: 1.16048312187
train_y_row_norms_mean: 0.407431900501
train_y_row_norms_min: 0.0134850600734
valid_h0_col_norms_max: 5.70130395889
valid_h0_col_norms_mean: 3.63394594193
valid_h0_col_norms_min: 2.06413507462
valid_h0_row_norms_max: 5.63436841965
valid_h0_row_norms_mean: 2.84383249283
valid_h0_row_norms_min: 0.0585759952664
valid_h1_col_norms_max: 5.33032464981
valid_h1_col_norms_mean: 3.41074442863
valid_h1_col_norms_min: 1.54273200035
valid_h1_row_norms_max: 6.95094776154
valid_h1_row_norms_mean: 4.84889364243
valid_h1_row_norms_min: 2.85255265236
valid_objective: 1.09732854366
valid_term_0: 0.148171290755
valid_term_1_weight_decay: 0.94915664196
valid_y_col_norms_max: 4.7894949913
valid_y_col_norms_mean: 4.32798671722
valid_y_col_norms_min: 3.85334467888
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.977250099182
valid_y_min_max_class: 0.51136559248
valid_y_misclass: 0.0399999879301
valid_y_nll: 0.148171290755
valid_y_row_norms_max: 1.16048634052
valid_y_row_norms_mean: 0.407433569431
valid_y_row_norms_min: 0.0134850135073
Time this epoch: 3.333940 seconds
Monitoring step:
Epochs seen: 12
Batches seen: 6000
Examples seen: 600000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 5.4267373085
test_h0_col_norms_mean: 3.48842096329
test_h0_col_norms_min: 1.96254551411
test_h0_row_norms_max: 5.41478538513
test_h0_row_norms_mean: 2.7299387455
test_h0_row_norms_min: 0.0784849375486
test_h1_col_norms_max: 5.06470775604
test_h1_col_norms_mean: 3.24842214584
test_h1_col_norms_min: 1.46678352356
test_h1_row_norms_max: 6.60853338242
test_h1_row_norms_mean: 4.6184220314
test_h1_row_norms_min: 2.71205830574
test_objective: 0.985752701759
test_term_0: 0.119499914348
test_term_1_weight_decay: 0.866252303123
test_y_col_norms_max: 4.76992559433
test_y_col_norms_mean: 4.27050018311
test_y_col_norms_min: 3.78093886375
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.979267477989
test_y_min_max_class: 0.543375730515
test_y_misclass: 0.0315999910235
test_y_nll: 0.119499914348
test_y_row_norms_max: 0.912945210934
test_y_row_norms_mean: 0.402993023396
test_y_row_norms_min: 0.0216930937022
train_h0_col_norms_max: 5.42672777176
train_h0_col_norms_mean: 3.48842120171
train_h0_col_norms_min: 1.96254348755
train_h0_row_norms_max: 5.41479063034
train_h0_row_norms_mean: 2.72992825508
train_h0_row_norms_min: 0.0784846991301
train_h1_col_norms_max: 5.06472110748
train_h1_col_norms_mean: 3.24842524529
train_h1_col_norms_min: 1.46679055691
train_h1_row_norms_max: 6.60850334167
train_h1_row_norms_mean: 4.61840820312
train_h1_row_norms_min: 2.71205258369
train_objective: 0.922969102859
train_term_0: 0.0567165091634
train_term_1_weight_decay: 0.866256058216
train_y_col_norms_max: 4.76994085312
train_y_col_norms_mean: 4.27052545547
train_y_col_norms_min: 3.7809548378
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.982750058174
train_y_min_max_class: 0.558061778545
train_y_misclass: 0.018240025267
train_y_nll: 0.0567165091634
train_y_row_norms_max: 0.912941157818
train_y_row_norms_mean: 0.402991384268
train_y_row_norms_min: 0.0216932129115
valid_h0_col_norms_max: 5.4267373085
valid_h0_col_norms_mean: 3.48842096329
valid_h0_col_norms_min: 1.96254551411
valid_h0_row_norms_max: 5.41478538513
valid_h0_row_norms_mean: 2.7299387455
valid_h0_row_norms_min: 0.0784849375486
valid_h1_col_norms_max: 5.06470775604
valid_h1_col_norms_mean: 3.24842214584
valid_h1_col_norms_min: 1.46678352356
valid_h1_row_norms_max: 6.60853338242
valid_h1_row_norms_mean: 4.6184220314
valid_h1_row_norms_min: 2.71205830574
valid_objective: 0.983159482479
valid_term_0: 0.11690659076
valid_term_1_weight_decay: 0.866252303123
valid_y_col_norms_max: 4.76992559433
valid_y_col_norms_mean: 4.27050018311
valid_y_col_norms_min: 3.78093886375
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.981662929058
valid_y_min_max_class: 0.533038794994
valid_y_misclass: 0.0296999812126
valid_y_nll: 0.11690659076
valid_y_row_norms_max: 0.912945210934
valid_y_row_norms_mean: 0.402993023396
valid_y_row_norms_min: 0.0216930937022
Time this epoch: 3.286931 seconds
Monitoring step:
Epochs seen: 13
Batches seen: 6500
Examples seen: 650000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 5.16044139862
test_h0_col_norms_mean: 3.34162855148
test_h0_col_norms_min: 1.86588740349
test_h0_row_norms_max: 5.20599794388
test_h0_row_norms_mean: 2.61515665054
test_h0_row_norms_min: 0.0764672607183
test_h1_col_norms_max: 4.8263502121
test_h1_col_norms_mean: 3.09343934059
test_h1_col_norms_min: 1.39710497856
test_h1_row_norms_max: 6.2830324173
test_h1_row_norms_mean: 4.39829969406
test_h1_row_norms_min: 2.57847547531
test_objective: 0.8922701478
test_term_0: 0.102732278407
test_term_1_weight_decay: 0.789536893368
test_y_col_norms_max: 4.68681240082
test_y_col_norms_mean: 4.23078680038
test_y_col_norms_min: 3.78408479691
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.983542442322
test_y_min_max_class: 0.593747019768
test_y_misclass: 0.0273999907076
test_y_nll: 0.102732278407
test_y_row_norms_max: 0.966835141182
test_y_row_norms_mean: 0.398707449436
test_y_row_norms_min: 0.0218474734575
train_h0_col_norms_max: 5.16042280197
train_h0_col_norms_mean: 3.34164571762
train_h0_col_norms_min: 1.86587870121
train_h0_row_norms_max: 5.2060174942
train_h0_row_norms_mean: 2.61516785622
train_h0_row_norms_min: 0.0764675214887
train_h1_col_norms_max: 4.82632637024
train_h1_col_norms_mean: 3.09345006943
train_h1_col_norms_min: 1.39711165428
train_h1_row_norms_max: 6.28304100037
train_h1_row_norms_mean: 4.39832401276
train_h1_row_norms_min: 2.5784881115
train_objective: 0.829474568367
train_term_0: 0.0399360619485
train_term_1_weight_decay: 0.789532542229
train_y_col_norms_max: 4.6868262291
train_y_col_norms_mean: 4.23076534271
train_y_col_norms_min: 3.78406834602
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.987834095955
train_y_min_max_class: 0.609752178192
train_y_misclass: 0.0125200273469
train_y_nll: 0.0399360619485
train_y_row_norms_max: 0.96683973074
train_y_row_norms_mean: 0.398709416389
train_y_row_norms_min: 0.0218475684524
valid_h0_col_norms_max: 5.16044139862
valid_h0_col_norms_mean: 3.34162855148
valid_h0_col_norms_min: 1.86588740349
valid_h0_row_norms_max: 5.20599794388
valid_h0_row_norms_mean: 2.61515665054
valid_h0_row_norms_min: 0.0764672607183
valid_h1_col_norms_max: 4.8263502121
valid_h1_col_norms_mean: 3.09343934059
valid_h1_col_norms_min: 1.39710497856
valid_h1_row_norms_max: 6.2830324173
valid_h1_row_norms_mean: 4.39829969406
valid_h1_row_norms_min: 2.57847547531
valid_objective: 0.903808116913
valid_term_0: 0.114270374179
valid_term_1_weight_decay: 0.789536893368
valid_y_col_norms_max: 4.68681240082
valid_y_col_norms_mean: 4.23078680038
valid_y_col_norms_min: 3.78408479691
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.984707713127
valid_y_min_max_class: 0.566586375237
valid_y_misclass: 0.028899980709
valid_y_nll: 0.114270374179
valid_y_row_norms_max: 0.966835141182
valid_y_row_norms_mean: 0.398707449436
valid_y_row_norms_min: 0.0218474734575
Time this epoch: 3.373106 seconds
Monitoring step:
Epochs seen: 14
Batches seen: 7000
Examples seen: 700000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 4.90614843369
test_h0_col_norms_mean: 3.19672346115
test_h0_col_norms_min: 1.77398645878
test_h0_row_norms_max: 4.96580123901
test_h0_row_norms_mean: 2.50189256668
test_h0_row_norms_min: 0.0782802626491
test_h1_col_norms_max: 4.59008312225
test_h1_col_norms_mean: 2.94533538818
test_h1_col_norms_min: 1.32907187939
test_h1_row_norms_max: 5.97338581085
test_h1_row_norms_mean: 4.18786859512
test_h1_row_norms_min: 2.45148181915
test_objective: 0.819695711136
test_term_0: 0.100928872824
test_term_1_weight_decay: 0.718766570091
test_y_col_norms_max: 4.5962023735
test_y_col_norms_mean: 4.16727113724
test_y_col_norms_min: 3.66778349876
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.987643420696
test_y_min_max_class: 0.622340202332
test_y_misclass: 0.0252999924123
test_y_nll: 0.100928872824
test_y_row_norms_max: 0.958882212639
test_y_row_norms_mean: 0.392418205738
test_y_row_norms_min: 0.0207168832421
train_h0_col_norms_max: 4.90617132187
train_h0_col_norms_mean: 3.19671821594
train_h0_col_norms_min: 1.77399635315
train_h0_row_norms_max: 4.96578741074
train_h0_row_norms_mean: 2.50188994408
train_h0_row_norms_min: 0.0782798752189
train_h1_col_norms_max: 4.5900592804
train_h1_col_norms_mean: 2.94533443451
train_h1_col_norms_min: 1.32906579971
train_h1_row_norms_max: 5.97335529327
train_h1_row_norms_mean: 4.18784570694
train_h1_row_norms_min: 2.45149064064
train_objective: 0.745359420776
train_term_0: 0.0265923049301
train_term_1_weight_decay: 0.718770325184
train_y_col_norms_max: 4.5962138176
train_y_col_norms_mean: 4.16729164124
train_y_col_norms_min: 3.66780090332
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.990943193436
train_y_min_max_class: 0.66484606266
train_y_misclass: 0.00918001402169
train_y_nll: 0.0265923049301
train_y_row_norms_max: 0.95888376236
train_y_row_norms_mean: 0.392418503761
train_y_row_norms_min: 0.0207169353962
valid_h0_col_norms_max: 4.90614843369
valid_h0_col_norms_mean: 3.19672346115
valid_h0_col_norms_min: 1.77398645878
valid_h0_row_norms_max: 4.96580123901
valid_h0_row_norms_mean: 2.50189256668
valid_h0_row_norms_min: 0.0782802626491
valid_h1_col_norms_max: 4.59008312225
valid_h1_col_norms_mean: 2.94533538818
valid_h1_col_norms_min: 1.32907187939
valid_h1_row_norms_max: 5.97338581085
valid_h1_row_norms_mean: 4.18786859512
valid_h1_row_norms_min: 2.45148181915
valid_objective: 0.827313005924
valid_term_0: 0.108545988798
valid_term_1_weight_decay: 0.718766570091
valid_y_col_norms_max: 4.5962023735
valid_y_col_norms_mean: 4.16727113724
valid_y_col_norms_min: 3.66778349876
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.987360954285
valid_y_min_max_class: 0.601630806923
valid_y_misclass: 0.0260999873281
valid_y_nll: 0.108545988798
valid_y_row_norms_max: 0.958882212639
valid_y_row_norms_mean: 0.392418205738
valid_y_row_norms_min: 0.0207168832421
Time this epoch: 3.270202 seconds
Monitoring step:
Epochs seen: 15
Batches seen: 7500
Examples seen: 750000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 4.66921758652
test_h0_col_norms_mean: 3.05542969704
test_h0_col_norms_min: 1.68660902977
test_h0_row_norms_max: 4.75306463242
test_h0_row_norms_mean: 2.39132237434
test_h0_row_norms_min: 0.0765107423067
test_h1_col_norms_max: 4.3710064888
test_h1_col_norms_mean: 2.8040626049
test_h1_col_norms_min: 1.26379609108
test_h1_row_norms_max: 5.67917490005
test_h1_row_norms_mean: 3.98712182045
test_h1_row_norms_min: 2.33073425293
test_objective: 0.737741053104
test_term_0: 0.083799123764
test_term_1_weight_decay: 0.653942167759
test_y_col_norms_max: 4.54380941391
test_y_col_norms_mean: 4.11158180237
test_y_col_norms_min: 3.66944622993
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.987535178661
test_y_min_max_class: 0.639883577824
test_y_misclass: 0.0226999949664
test_y_nll: 0.083799123764
test_y_row_norms_max: 0.992673635483
test_y_row_norms_mean: 0.386759877205
test_y_row_norms_min: 0.0214904490858
train_h0_col_norms_max: 4.66919612885
train_h0_col_norms_mean: 3.05544400215
train_h0_col_norms_min: 1.68661606312
train_h0_row_norms_max: 4.75308895111
train_h0_row_norms_mean: 2.39132881165
train_h0_row_norms_min: 0.0765103250742
train_h1_col_norms_max: 4.37101602554
train_h1_col_norms_mean: 2.80404901505
train_h1_col_norms_min: 1.26379692554
train_h1_row_norms_max: 5.67914772034
train_h1_row_norms_mean: 3.98710203171
train_h1_row_norms_min: 2.33073854446
train_objective: 0.66691416502
train_term_0: 0.0129722505808
train_term_1_weight_decay: 0.653943121433
train_y_col_norms_max: 4.54378795624
train_y_col_norms_mean: 4.11155748367
train_y_col_norms_min: 3.66946792603
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.993395149708
train_y_min_max_class: 0.715351760387
train_y_misclass: 0.00405999692157
train_y_nll: 0.0129722505808
train_y_row_norms_max: 0.992678523064
train_y_row_norms_mean: 0.386759728193
train_y_row_norms_min: 0.0214903373271
valid_h0_col_norms_max: 4.66921758652
valid_h0_col_norms_mean: 3.05542969704
valid_h0_col_norms_min: 1.68660902977
valid_h0_row_norms_max: 4.75306463242
valid_h0_row_norms_mean: 2.39132237434
valid_h0_row_norms_min: 0.0765107423067
valid_h1_col_norms_max: 4.3710064888
valid_h1_col_norms_mean: 2.8040626049
valid_h1_col_norms_min: 1.26379609108
valid_h1_row_norms_max: 5.67917490005
valid_h1_row_norms_mean: 3.98712182045
valid_h1_row_norms_min: 2.33073425293
valid_objective: 0.734254300594
valid_term_0: 0.0803121104836
valid_term_1_weight_decay: 0.653942167759
valid_y_col_norms_max: 4.54380941391
valid_y_col_norms_mean: 4.11158180237
valid_y_col_norms_min: 3.66944622993
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.987278044224
valid_y_min_max_class: 0.594924449921
valid_y_misclass: 0.0219999905676
valid_y_nll: 0.0803121104836
valid_y_row_norms_max: 0.992673635483
valid_y_row_norms_mean: 0.386759877205
valid_y_row_norms_min: 0.0214904490858
Time this epoch: 3.281498 seconds
Monitoring step:
Epochs seen: 16
Batches seen: 8000
Examples seen: 800000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 4.46374130249
test_h0_col_norms_mean: 2.9173309803
test_h0_col_norms_min: 1.60354030132
test_h0_row_norms_max: 4.55304861069
test_h0_row_norms_mean: 2.28333234787
test_h0_row_norms_min: 0.0760971903801
test_h1_col_norms_max: 4.15992879868
test_h1_col_norms_mean: 2.66938233376
test_h1_col_norms_min: 1.20336544514
test_h1_row_norms_max: 5.3994641304
test_h1_row_norms_mean: 3.79574894905
test_h1_row_norms_min: 2.21593642235
test_objective: 0.674262106419
test_term_0: 0.0797682702541
test_term_1_weight_decay: 0.594494223595
test_y_col_norms_max: 4.41636514664
test_y_col_norms_mean: 4.05042076111
test_y_col_norms_min: 3.58171629906
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.986650168896
test_y_min_max_class: 0.618766665459
test_y_misclass: 0.0221999883652
test_y_nll: 0.0797682702541
test_y_row_norms_max: 0.987005531788
test_y_row_norms_mean: 0.380280554295
test_y_row_norms_min: 0.0215586218983
train_h0_col_norms_max: 4.46375417709
train_h0_col_norms_mean: 2.91732239723
train_h0_col_norms_min: 1.60354280472
train_h0_row_norms_max: 4.55305957794
train_h0_row_norms_mean: 2.2833340168
train_h0_row_norms_min: 0.0760968104005
train_h1_col_norms_max: 4.159927845
train_h1_col_norms_mean: 2.66937685013
train_h1_col_norms_min: 1.20335972309
train_h1_row_norms_max: 5.39946508408
train_h1_row_norms_mean: 3.79574465752
train_h1_row_norms_min: 2.21593785286
train_objective: 0.604817152023
train_term_0: 0.0103233894333
train_term_1_weight_decay: 0.594495952129
train_y_col_norms_max: 4.41635942459
train_y_col_norms_mean: 4.05040311813
train_y_col_norms_min: 3.58173537254
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.993671536446
train_y_min_max_class: 0.732953190804
train_y_misclass: 0.00291999848559
train_y_nll: 0.0103233894333
train_y_row_norms_max: 0.987000524998
train_y_row_norms_mean: 0.380278617144
train_y_row_norms_min: 0.0215585716069
valid_h0_col_norms_max: 4.46374130249
valid_h0_col_norms_mean: 2.9173309803
valid_h0_col_norms_min: 1.60354030132
valid_h0_row_norms_max: 4.55304861069
valid_h0_row_norms_mean: 2.28333234787
valid_h0_row_norms_min: 0.0760971903801
valid_h1_col_norms_max: 4.15992879868
valid_h1_col_norms_mean: 2.66938233376
valid_h1_col_norms_min: 1.20336544514
valid_h1_row_norms_max: 5.3994641304
valid_h1_row_norms_mean: 3.79574894905
valid_h1_row_norms_min: 2.21593642235
valid_objective: 0.67914390564
valid_term_0: 0.0846498459578
valid_term_1_weight_decay: 0.594494223595
valid_y_col_norms_max: 4.41636514664
valid_y_col_norms_mean: 4.05042076111
valid_y_col_norms_min: 3.58171629906
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.987098455429
valid_y_min_max_class: 0.597771346569
valid_y_misclass: 0.0203999932855
valid_y_nll: 0.0846498459578
valid_y_row_norms_max: 0.987005531788
valid_y_row_norms_mean: 0.380280554295
valid_y_row_norms_min: 0.0215586218983
Time this epoch: 3.317685 seconds
Monitoring step:
Epochs seen: 17
Batches seen: 8500
Examples seen: 850000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 4.26407384872
test_h0_col_norms_mean: 2.78463411331
test_h0_col_norms_min: 1.52455866337
test_h0_row_norms_max: 4.34796571732
test_h0_row_norms_mean: 2.17960953712
test_h0_row_norms_min: 0.0764012187719
test_h1_col_norms_max: 3.95875430107
test_h1_col_norms_mean: 2.54128909111
test_h1_col_norms_min: 1.14473068714
test_h1_row_norms_max: 5.13351964951
test_h1_row_norms_mean: 3.61375975609
test_h1_row_norms_min: 2.1067969799
test_objective: 0.610892295837
test_term_0: 0.0704278945923
test_term_1_weight_decay: 0.540464937687
test_y_col_norms_max: 4.3217663765
test_y_col_norms_mean: 4.00428628922
test_y_col_norms_min: 3.53744649887
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.988494455814
test_y_min_max_class: 0.640034735203
test_y_misclass: 0.0186999924481
test_y_nll: 0.0704278945923
test_y_row_norms_max: 0.994636058807
test_y_row_norms_mean: 0.3746727705
test_y_row_norms_min: 0.0214648172259
train_h0_col_norms_max: 4.26409387589
train_h0_col_norms_mean: 2.78462028503
train_h0_col_norms_min: 1.52456390858
train_h0_row_norms_max: 4.34795570374
train_h0_row_norms_mean: 2.17961931229
train_h0_row_norms_min: 0.0764016136527
train_h1_col_norms_max: 3.95873188972
train_h1_col_norms_mean: 2.54130077362
train_h1_col_norms_min: 1.14473164082
train_h1_row_norms_max: 5.13353729248
train_h1_row_norms_mean: 3.61374282837
train_h1_row_norms_min: 2.10678911209
train_objective: 0.5477257967
train_term_0: 0.007261632476
train_term_1_weight_decay: 0.540465056896
train_y_col_norms_max: 4.32178735733
train_y_col_norms_mean: 4.00426435471
train_y_col_norms_min: 3.5374417305
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.995666265488
train_y_min_max_class: 0.789768993855
train_y_misclass: 0.00194000091869
train_y_nll: 0.007261632476
train_y_row_norms_max: 0.994630157948
train_y_row_norms_mean: 0.374674469233
train_y_row_norms_min: 0.0214648637921
valid_h0_col_norms_max: 4.26407384872
valid_h0_col_norms_mean: 2.78463411331
valid_h0_col_norms_min: 1.52455866337
valid_h0_row_norms_max: 4.34796571732
valid_h0_row_norms_mean: 2.17960953712
valid_h0_row_norms_min: 0.0764012187719
valid_h1_col_norms_max: 3.95875430107
valid_h1_col_norms_mean: 2.54128909111
valid_h1_col_norms_min: 1.14473068714
valid_h1_row_norms_max: 5.13351964951
valid_h1_row_norms_mean: 3.61375975609
valid_h1_row_norms_min: 2.1067969799
valid_objective: 0.617604732513
valid_term_0: 0.0771402940154
valid_term_1_weight_decay: 0.540464937687
valid_y_col_norms_max: 4.3217663765
valid_y_col_norms_mean: 4.00428628922
valid_y_col_norms_min: 3.53744649887
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989329993725
valid_y_min_max_class: 0.605314671993
valid_y_misclass: 0.0208999905735
valid_y_nll: 0.0771402940154
valid_y_row_norms_max: 0.994636058807
valid_y_row_norms_mean: 0.3746727705
valid_y_row_norms_min: 0.0214648172259
Time this epoch: 3.269546 seconds
Monitoring step:
Epochs seen: 18
Batches seen: 9000
Examples seen: 900000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 4.07507371902
test_h0_col_norms_mean: 2.65655350685
test_h0_col_norms_min: 1.44946885109
test_h0_row_norms_max: 4.15038585663
test_h0_row_norms_mean: 2.07941555977
test_h0_row_norms_min: 0.0857979208231
test_h1_col_norms_max: 3.77103662491
test_h1_col_norms_mean: 2.41892313957
test_h1_col_norms_min: 1.08750927448
test_h1_row_norms_max: 4.88069534302
test_h1_row_norms_mean: 3.43979096413
test_h1_row_norms_min: 2.00302839279
test_objective: 0.562726557255
test_term_0: 0.0717355385423
test_term_1_weight_decay: 0.490990847349
test_y_col_norms_max: 4.28208780289
test_y_col_norms_mean: 3.93249392509
test_y_col_norms_min: 3.48496580124
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.990153551102
test_y_min_max_class: 0.659395575523
test_y_misclass: 0.0185999944806
test_y_nll: 0.0717355385423
test_y_row_norms_max: 0.942749202251
test_y_row_norms_mean: 0.367405802011
test_y_row_norms_min: 0.019349604845
train_h0_col_norms_max: 4.07505750656
train_h0_col_norms_mean: 2.65656781197
train_h0_col_norms_min: 1.44946610928
train_h0_row_norms_max: 4.15039014816
train_h0_row_norms_mean: 2.07942199707
train_h0_row_norms_min: 0.0857974886894
train_h1_col_norms_max: 3.77104258537
train_h1_col_norms_mean: 2.41892194748
train_h1_col_norms_min: 1.08750891685
train_h1_row_norms_max: 4.88067293167
train_h1_row_norms_mean: 3.43978619576
train_h1_row_norms_min: 2.00302529335
train_objective: 0.496095150709
train_term_0: 0.00510408030823
train_term_1_weight_decay: 0.490992516279
train_y_col_norms_max: 4.28208827972
train_y_col_norms_mean: 3.93247795105
train_y_col_norms_min: 3.48496460915
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.996478378773
train_y_min_max_class: 0.825236082077
train_y_misclass: 0.00105999980588
train_y_nll: 0.00510408030823
train_y_row_norms_max: 0.942744672298
train_y_row_norms_mean: 0.367404073477
train_y_row_norms_min: 0.0193495322019
valid_h0_col_norms_max: 4.07507371902
valid_h0_col_norms_mean: 2.65655350685
valid_h0_col_norms_min: 1.44946885109
valid_h0_row_norms_max: 4.15038585663
valid_h0_row_norms_mean: 2.07941555977
valid_h0_row_norms_min: 0.0857979208231
valid_h1_col_norms_max: 3.77103662491
valid_h1_col_norms_mean: 2.41892313957
valid_h1_col_norms_min: 1.08750927448
valid_h1_row_norms_max: 4.88069534302
valid_h1_row_norms_mean: 3.43979096413
valid_h1_row_norms_min: 2.00302839279
valid_objective: 0.568551659584
valid_term_0: 0.0775607377291
valid_term_1_weight_decay: 0.490990847349
valid_y_col_norms_max: 4.28208780289
valid_y_col_norms_mean: 3.93249392509
valid_y_col_norms_min: 3.48496580124
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989968895912
valid_y_min_max_class: 0.620238602161
valid_y_misclass: 0.0208999887109
valid_y_nll: 0.0775607377291
valid_y_row_norms_max: 0.942749202251
valid_y_row_norms_mean: 0.367405802011
valid_y_row_norms_min: 0.019349604845
Time this epoch: 3.304628 seconds
Monitoring step:
Epochs seen: 19
Batches seen: 9500
Examples seen: 950000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 3.88326621056
test_h0_col_norms_mean: 2.53280711174
test_h0_col_norms_min: 1.37807917595
test_h0_row_norms_max: 3.96461653709
test_h0_row_norms_mean: 1.98260319233
test_h0_row_norms_min: 0.09099239856
test_h1_col_norms_max: 3.58734297752
test_h1_col_norms_mean: 2.30225491524
test_h1_col_norms_min: 1.03453934193
test_h1_row_norms_max: 4.64029741287
test_h1_row_norms_mean: 3.27396249771
test_h1_row_norms_min: 1.90437150002
test_objective: 0.510573804379
test_term_0: 0.0647685080767
test_term_1_weight_decay: 0.445804834366
test_y_col_norms_max: 4.17118215561
test_y_col_norms_mean: 3.85513329506
test_y_col_norms_min: 3.38714289665
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.989724636078
test_y_min_max_class: 0.680286705494
test_y_misclass: 0.0178999938071
test_y_nll: 0.0647685080767
test_y_row_norms_max: 0.922487914562
test_y_row_norms_mean: 0.359323531389
test_y_row_norms_min: 0.0180249232799
train_h0_col_norms_max: 3.883248806
train_h0_col_norms_mean: 2.53280425072
train_h0_col_norms_min: 1.37807655334
train_h0_row_norms_max: 3.96463823318
train_h0_row_norms_mean: 1.982614398
train_h0_row_norms_min: 0.090992718935
train_h1_col_norms_max: 3.58735847473
train_h1_col_norms_mean: 2.30224943161
train_h1_col_norms_min: 1.03453481197
train_h1_row_norms_max: 4.64032030106
train_h1_row_norms_mean: 3.2739636898
train_h1_row_norms_min: 1.90436935425
train_objective: 0.449831366539
train_term_0: 0.00402596499771
train_term_1_weight_decay: 0.445802211761
train_y_col_norms_max: 4.17116117477
train_y_col_norms_mean: 3.85511660576
train_y_col_norms_min: 3.3871281147
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.996860563755
train_y_min_max_class: 0.843070626259
train_y_misclass: 0.000819999666419
train_y_nll: 0.00402596499771
train_y_row_norms_max: 0.922493100166
train_y_row_norms_mean: 0.359325319529
train_y_row_norms_min: 0.0180248413235
valid_h0_col_norms_max: 3.88326621056
valid_h0_col_norms_mean: 2.53280711174
valid_h0_col_norms_min: 1.37807917595
valid_h0_row_norms_max: 3.96461653709
valid_h0_row_norms_mean: 1.98260319233
valid_h0_row_norms_min: 0.09099239856
valid_h1_col_norms_max: 3.58734297752
valid_h1_col_norms_mean: 2.30225491524
valid_h1_col_norms_min: 1.03453934193
valid_h1_row_norms_max: 4.64029741287
valid_h1_row_norms_mean: 3.27396249771
valid_h1_row_norms_min: 1.90437150002
valid_objective: 0.517447412014
valid_term_0: 0.0716420337558
valid_term_1_weight_decay: 0.445804834366
valid_y_col_norms_max: 4.17118215561
valid_y_col_norms_mean: 3.85513329506
valid_y_col_norms_min: 3.38714289665
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.990103065968
valid_y_min_max_class: 0.65864700079
valid_y_misclass: 0.019399991259
valid_y_nll: 0.0716420337558
valid_y_row_norms_max: 0.922487914562
valid_y_row_norms_mean: 0.359323531389
valid_y_row_norms_min: 0.0180249232799
Time this epoch: 3.352973 seconds
Monitoring step:
Epochs seen: 20
Batches seen: 10000
Examples seen: 1000000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 3.69830107689
test_h0_col_norms_mean: 2.41490340233
test_h0_col_norms_min: 1.31020438671
test_h0_row_norms_max: 3.78174734116
test_h0_row_norms_mean: 1.89037334919
test_h0_row_norms_min: 0.0872991830111
test_h1_col_norms_max: 3.41410470009
test_h1_col_norms_mean: 2.19139242172
test_h1_col_norms_min: 0.983708977699
test_h1_row_norms_max: 4.41174936295
test_h1_row_norms_mean: 3.11639881134
test_h1_row_norms_min: 1.81057536602
test_objective: 0.4696611166
test_term_0: 0.064779728651
test_term_1_weight_decay: 0.404881685972
test_y_col_norms_max: 4.09565019608
test_y_col_norms_mean: 3.78634428978
test_y_col_norms_min: 3.3164498806
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.98881983757
test_y_min_max_class: 0.653255581856
test_y_misclass: 0.017599998042
test_y_nll: 0.064779728651
test_y_row_norms_max: 0.925839364529
test_y_row_norms_mean: 0.351857930422
test_y_row_norms_min: 0.0175057649612
train_h0_col_norms_max: 3.69831848145
train_h0_col_norms_mean: 2.41490244865
train_h0_col_norms_min: 1.31020689011
train_h0_row_norms_max: 3.78176569939
train_h0_row_norms_mean: 1.89036512375
train_h0_row_norms_min: 0.0872987210751
train_h1_col_norms_max: 3.41411972046
train_h1_col_norms_mean: 2.19141077995
train_h1_col_norms_min: 0.983713150024
train_h1_row_norms_max: 4.41172409058
train_h1_row_norms_mean: 3.11639785767
train_h1_row_norms_min: 1.81056690216
train_objective: 0.409165471792
train_term_0: 0.00428416905925
train_term_1_weight_decay: 0.404880315065
train_y_col_norms_max: 4.09566497803
train_y_col_norms_mean: 3.78633141518
train_y_col_norms_min: 3.31643605232
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.996947467327
train_y_min_max_class: 0.855619430542
train_y_misclass: 0.000879999657627
train_y_nll: 0.00428416905925
train_y_row_norms_max: 0.925839066505
train_y_row_norms_mean: 0.351859807968
train_y_row_norms_min: 0.0175057388842
valid_h0_col_norms_max: 3.69830107689
valid_h0_col_norms_mean: 2.41490340233
valid_h0_col_norms_min: 1.31020438671
valid_h0_row_norms_max: 3.78174734116
valid_h0_row_norms_mean: 1.89037334919
valid_h0_row_norms_min: 0.0872991830111
valid_h1_col_norms_max: 3.41410470009
valid_h1_col_norms_mean: 2.19139242172
valid_h1_col_norms_min: 0.983708977699
valid_h1_row_norms_max: 4.41174936295
valid_h1_row_norms_mean: 3.11639881134
valid_h1_row_norms_min: 1.81057536602
valid_objective: 0.475686132908
valid_term_0: 0.0708047524095
valid_term_1_weight_decay: 0.404881685972
valid_y_col_norms_max: 4.09565019608
valid_y_col_norms_mean: 3.78634428978
valid_y_col_norms_min: 3.3164498806
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989249825478
valid_y_min_max_class: 0.616850614548
valid_y_misclass: 0.0192999914289
valid_y_nll: 0.0708047524095
valid_y_row_norms_max: 0.925839364529
valid_y_row_norms_mean: 0.351857930422
valid_y_row_norms_min: 0.0175057649612
Time this epoch: 3.278321 seconds
Monitoring step:
Epochs seen: 21
Batches seen: 10500
Examples seen: 1050000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 3.52711653709
test_h0_col_norms_mean: 2.30246567726
test_h0_col_norms_min: 1.2456703186
test_h0_row_norms_max: 3.60426926613
test_h0_row_norms_mean: 1.80239653587
test_h0_row_norms_min: 0.0854785442352
test_h1_col_norms_max: 3.24995541573
test_h1_col_norms_mean: 2.08604121208
test_h1_col_norms_min: 0.935500979424
test_h1_row_norms_max: 4.19445514679
test_h1_row_norms_mean: 2.96661686897
test_h1_row_norms_min: 1.72139751911
test_objective: 0.435198038816
test_term_0: 0.0674102455378
test_term_1_weight_decay: 0.367787539959
test_y_col_norms_max: 4.01598834991
test_y_col_norms_mean: 3.72248363495
test_y_col_norms_min: 3.24742627144
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.988867938519
test_y_min_max_class: 0.670085370541
test_y_misclass: 0.0185999963433
test_y_nll: 0.0674102455378
test_y_row_norms_max: 0.902759611607
test_y_row_norms_mean: 0.344950795174
test_y_row_norms_min: 0.0167198460549
train_h0_col_norms_max: 3.52713274956
train_h0_col_norms_mean: 2.30245995522
train_h0_col_norms_min: 1.24567604065
train_h0_row_norms_max: 3.60428500175
train_h0_row_norms_mean: 1.80238819122
train_h0_row_norms_min: 0.0854781419039
train_h1_col_norms_max: 3.24996852875
train_h1_col_norms_mean: 2.08604311943
train_h1_col_norms_min: 0.935496866703
train_h1_row_norms_max: 4.19447517395
train_h1_row_norms_mean: 2.96663236618
train_h1_row_norms_min: 1.72139537334
train_objective: 0.372235387564
train_term_0: 0.00444769486785
train_term_1_weight_decay: 0.367789417505
train_y_col_norms_max: 4.01601171494
train_y_col_norms_mean: 3.7224612236
train_y_col_norms_min: 3.24741005898
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.996587693691
train_y_min_max_class: 0.846141993999
train_y_misclass: 0.000799999630544
train_y_nll: 0.00444769486785
train_y_row_norms_max: 0.902763843536
train_y_row_norms_mean: 0.344951033592
train_y_row_norms_min: 0.0167199298739
valid_h0_col_norms_max: 3.52711653709
valid_h0_col_norms_mean: 2.30246567726
valid_h0_col_norms_min: 1.2456703186
valid_h0_row_norms_max: 3.60426926613
valid_h0_row_norms_mean: 1.80239653587
valid_h0_row_norms_min: 0.0854785442352
valid_h1_col_norms_max: 3.24995541573
valid_h1_col_norms_mean: 2.08604121208
valid_h1_col_norms_min: 0.935500979424
valid_h1_row_norms_max: 4.19445514679
valid_h1_row_norms_mean: 2.96661686897
valid_h1_row_norms_min: 1.72139751911
valid_objective: 0.439748078585
valid_term_0: 0.0719603598118
valid_term_1_weight_decay: 0.367787539959
valid_y_col_norms_max: 4.01598834991
valid_y_col_norms_mean: 3.72248363495
valid_y_col_norms_min: 3.24742627144
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989000380039
valid_y_min_max_class: 0.610602736473
valid_y_misclass: 0.0192999914289
valid_y_nll: 0.0719603598118
valid_y_row_norms_max: 0.902759611607
valid_y_row_norms_mean: 0.344950795174
valid_y_row_norms_min: 0.0167198460549
Time this epoch: 3.292022 seconds
Monitoring step:
Epochs seen: 22
Batches seen: 11000
Examples seen: 1100000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 3.36697125435
test_h0_col_norms_mean: 2.19646573067
test_h0_col_norms_min: 1.18431913853
test_h0_row_norms_max: 3.44316983223
test_h0_row_norms_mean: 1.71948647499
test_h0_row_norms_min: 0.0820252001286
test_h1_col_norms_max: 3.09565114975
test_h1_col_norms_mean: 1.98627471924
test_h1_col_norms_min: 0.889862000942
test_h1_row_norms_max: 3.98788499832
test_h1_row_norms_mean: 2.82480931282
test_h1_row_norms_min: 1.63661336899
test_objective: 0.394841223955
test_term_0: 0.0604076348245
test_term_1_weight_decay: 0.334433555603
test_y_col_norms_max: 4.01139307022
test_y_col_norms_mean: 3.67662405968
test_y_col_norms_min: 3.18460655212
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.989187180996
test_y_min_max_class: 0.643190681934
test_y_misclass: 0.0180999971926
test_y_nll: 0.0604076348245
test_y_row_norms_max: 0.901866018772
test_y_row_norms_mean: 0.339687138796
test_y_row_norms_min: 0.01714236103
train_h0_col_norms_max: 3.36695551872
train_h0_col_norms_mean: 2.19647264481
train_h0_col_norms_min: 1.18431949615
train_h0_row_norms_max: 3.44315481186
train_h0_row_norms_mean: 1.7194788456
train_h0_row_norms_min: 0.0820248499513
train_h1_col_norms_max: 3.09563612938
train_h1_col_norms_mean: 1.98626804352
train_h1_col_norms_min: 0.889865934849
train_h1_row_norms_max: 3.98790216446
train_h1_row_norms_mean: 2.82479405403
train_h1_row_norms_min: 1.63662087917
train_objective: 0.337020277977
train_term_0: 0.00258666928858
train_term_1_weight_decay: 0.334432244301
train_y_col_norms_max: 4.01139307022
train_y_col_norms_mean: 3.67662858963
train_y_col_norms_min: 3.18459391594
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.997638344765
train_y_min_max_class: 0.894991517067
train_y_misclass: 0.000179999973625
train_y_nll: 0.00258666928858
train_y_row_norms_max: 0.901870131493
train_y_row_norms_mean: 0.339686959982
train_y_row_norms_min: 0.0171423424035
valid_h0_col_norms_max: 3.36697125435
valid_h0_col_norms_mean: 2.19646573067
valid_h0_col_norms_min: 1.18431913853
valid_h0_row_norms_max: 3.44316983223
valid_h0_row_norms_mean: 1.71948647499
valid_h0_row_norms_min: 0.0820252001286
valid_h1_col_norms_max: 3.09565114975
valid_h1_col_norms_mean: 1.98627471924
valid_h1_col_norms_min: 0.889862000942
valid_h1_row_norms_max: 3.98788499832
valid_h1_row_norms_mean: 2.82480931282
valid_h1_row_norms_min: 1.63661336899
valid_objective: 0.399684429169
valid_term_0: 0.065250813961
valid_term_1_weight_decay: 0.334433555603
valid_y_col_norms_max: 4.01139307022
valid_y_col_norms_mean: 3.67662405968
valid_y_col_norms_min: 3.18460655212
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989832878113
valid_y_min_max_class: 0.622855961323
valid_y_misclass: 0.0177999921143
valid_y_nll: 0.065250813961
valid_y_row_norms_max: 0.901866018772
valid_y_row_norms_mean: 0.339687138796
valid_y_row_norms_min: 0.01714236103
Time this epoch: 3.278771 seconds
Monitoring step:
Epochs seen: 23
Batches seen: 11500
Examples seen: 1150000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 3.2081348896
test_h0_col_norms_mean: 2.09370923042
test_h0_col_norms_min: 1.12598621845
test_h0_row_norms_max: 3.28446102142
test_h0_row_norms_mean: 1.63902020454
test_h0_row_norms_min: 0.0778670459986
test_h1_col_norms_max: 2.94689941406
test_h1_col_norms_mean: 1.89091038704
test_h1_col_norms_min: 0.846523106098
test_h1_row_norms_max: 3.79147481918
test_h1_row_norms_mean: 2.68924212456
test_h1_row_norms_min: 1.55600595474
test_objective: 0.359032511711
test_term_0: 0.0552162267268
test_term_1_weight_decay: 0.303816497326
test_y_col_norms_max: 3.92041349411
test_y_col_norms_mean: 3.61436057091
test_y_col_norms_min: 3.12086963654
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.989588081837
test_y_min_max_class: 0.669209182262
test_y_misclass: 0.016299996525
test_y_nll: 0.0552162267268
test_y_row_norms_max: 0.909698069096
test_y_row_norms_mean: 0.333013266325
test_y_row_norms_min: 0.0162505507469
train_h0_col_norms_max: 3.20815110207
train_h0_col_norms_mean: 2.09371638298
train_h0_col_norms_min: 1.12598991394
train_h0_row_norms_max: 3.28445625305
train_h0_row_norms_mean: 1.63901221752
train_h0_row_norms_min: 0.0778671503067
train_h1_col_norms_max: 2.94688630104
train_h1_col_norms_mean: 1.89090168476
train_h1_col_norms_min: 0.846526682377
train_h1_row_norms_max: 3.79145789146
train_h1_row_norms_mean: 2.68924236298
train_h1_row_norms_min: 1.55599808693
train_objective: 0.306422680616
train_term_0: 0.00260642380454
train_term_1_weight_decay: 0.303818255663
train_y_col_norms_max: 3.92042994499
train_y_col_norms_mean: 3.61433935165
train_y_col_norms_min: 3.12087059021
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.997653722763
train_y_min_max_class: 0.897151112556
train_y_misclass: 0.000119999996969
train_y_nll: 0.00260642380454
train_y_row_norms_max: 0.909693837166
train_y_row_norms_mean: 0.333012223244
train_y_row_norms_min: 0.0162506196648
valid_h0_col_norms_max: 3.2081348896
valid_h0_col_norms_mean: 2.09370923042
valid_h0_col_norms_min: 1.12598621845
valid_h0_row_norms_max: 3.28446102142
valid_h0_row_norms_mean: 1.63902020454
valid_h0_row_norms_min: 0.0778670459986
valid_h1_col_norms_max: 2.94689941406
valid_h1_col_norms_mean: 1.89091038704
valid_h1_col_norms_min: 0.846523106098
valid_h1_row_norms_max: 3.79147481918
valid_h1_row_norms_mean: 2.68924212456
valid_h1_row_norms_min: 1.55600595474
valid_objective: 0.370760649443
valid_term_0: 0.0669444948435
valid_term_1_weight_decay: 0.303816497326
valid_y_col_norms_max: 3.92041349411
valid_y_col_norms_mean: 3.61436057091
valid_y_col_norms_min: 3.12086963654
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989882349968
valid_y_min_max_class: 0.639726042747
valid_y_misclass: 0.0178999956697
valid_y_nll: 0.0669444948435
valid_y_row_norms_max: 0.909698069096
valid_y_row_norms_mean: 0.333013266325
valid_y_row_norms_min: 0.0162505507469
Time this epoch: 3.283699 seconds
Monitoring step:
Epochs seen: 24
Batches seen: 12000
Examples seen: 1200000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 3.0553176403
test_h0_col_norms_mean: 1.99559020996
test_h0_col_norms_min: 1.07052779198
test_h0_row_norms_max: 3.1363966465
test_h0_row_norms_mean: 1.56222510338
test_h0_row_norms_min: 0.0742355883121
test_h1_col_norms_max: 2.80656790733
test_h1_col_norms_mean: 1.80023908615
test_h1_col_norms_min: 0.805699706078
test_h1_row_norms_max: 3.60472822189
test_h1_row_norms_mean: 2.5603313446
test_h1_row_norms_min: 1.47936725616
test_objective: 0.333813428879
test_term_0: 0.0577764734626
test_term_1_weight_decay: 0.276036947966
test_y_col_norms_max: 3.85618805885
test_y_col_norms_mean: 3.55425548553
test_y_col_norms_min: 3.04648113251
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.989487826824
test_y_min_max_class: 0.649445652962
test_y_misclass: 0.0160999950022
test_y_nll: 0.0577764734626
test_y_row_norms_max: 0.918738484383
test_y_row_norms_mean: 0.326478481293
test_y_row_norms_min: 0.0155664272606
train_h0_col_norms_max: 3.055331707
train_h0_col_norms_mean: 1.99557840824
train_h0_col_norms_min: 1.07052719593
train_h0_row_norms_max: 3.1363966465
train_h0_row_norms_mean: 1.56223297119
train_h0_row_norms_min: 0.0742354020476
train_h1_col_norms_max: 2.80655431747
train_h1_col_norms_mean: 1.80023896694
train_h1_col_norms_min: 0.805703580379
train_h1_row_norms_max: 3.60474324226
train_h1_row_norms_mean: 2.56033945084
train_h1_row_norms_min: 1.47937119007
train_objective: 0.278264194727
train_term_0: 0.00222728447989
train_term_1_weight_decay: 0.276037305593
train_y_col_norms_max: 3.85618400574
train_y_col_norms_mean: 3.55425071716
train_y_col_norms_min: 3.04648280144
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.997917592525
train_y_min_max_class: 0.919997572899
train_y_misclass: 7.9999997979e-05
train_y_nll: 0.00222728447989
train_y_row_norms_max: 0.918739795685
train_y_row_norms_mean: 0.326479077339
train_y_row_norms_min: 0.0155664980412
valid_h0_col_norms_max: 3.0553176403
valid_h0_col_norms_mean: 1.99559020996
valid_h0_col_norms_min: 1.07052779198
valid_h0_row_norms_max: 3.1363966465
valid_h0_row_norms_mean: 1.56222510338
valid_h0_row_norms_min: 0.0742355883121
valid_h1_col_norms_max: 2.80656790733
valid_h1_col_norms_mean: 1.80023908615
valid_h1_col_norms_min: 0.805699706078
valid_h1_row_norms_max: 3.60472822189
valid_h1_row_norms_mean: 2.5603313446
valid_h1_row_norms_min: 1.47936725616
valid_objective: 0.338611006737
valid_term_0: 0.062574096024
valid_term_1_weight_decay: 0.276036947966
valid_y_col_norms_max: 3.85618805885
valid_y_col_norms_mean: 3.55425548553
valid_y_col_norms_min: 3.04648113251
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.98962688446
valid_y_min_max_class: 0.629032611847
valid_y_misclass: 0.0176999941468
valid_y_nll: 0.062574096024
valid_y_row_norms_max: 0.918738484383
valid_y_row_norms_mean: 0.326478481293
valid_y_row_norms_min: 0.0155664272606
Time this epoch: 3.319413 seconds
Monitoring step:
Epochs seen: 25
Batches seen: 12500
Examples seen: 1250000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.91628909111
test_h0_col_norms_mean: 1.90174818039
test_h0_col_norms_min: 1.01780152321
test_h0_row_norms_max: 2.98749780655
test_h0_row_norms_mean: 1.48878085613
test_h0_row_norms_min: 0.0709456577897
test_h1_col_norms_max: 2.6732199192
test_h1_col_norms_mean: 1.71399199963
test_h1_col_norms_min: 0.766523241997
test_h1_row_norms_max: 3.42718911171
test_h1_row_norms_mean: 2.43771839142
test_h1_row_norms_min: 1.40650296211
test_objective: 0.305504858494
test_term_0: 0.0546990483999
test_term_1_weight_decay: 0.2508058846
test_y_col_norms_max: 3.78892588615
test_y_col_norms_mean: 3.49624156952
test_y_col_norms_min: 3.00044965744
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.989326953888
test_y_min_max_class: 0.664230465889
test_y_misclass: 0.0153999980539
test_y_nll: 0.0546990483999
test_y_row_norms_max: 0.914359211922
test_y_row_norms_mean: 0.320046216249
test_y_row_norms_min: 0.0148301701993
train_h0_col_norms_max: 2.9162979126
train_h0_col_norms_mean: 1.90174603462
train_h0_col_norms_min: 1.01780331135
train_h0_row_norms_max: 2.98750209808
train_h0_row_norms_mean: 1.48878598213
train_h0_row_norms_min: 0.0709457397461
train_h1_col_norms_max: 2.67322587967
train_h1_col_norms_mean: 1.71399140358
train_h1_col_norms_min: 0.766523063183
train_h1_row_norms_max: 3.42717552185
train_h1_row_norms_mean: 2.4377117157
train_h1_row_norms_min: 1.40649604797
train_objective: 0.252682715654
train_term_0: 0.00187695620116
train_term_1_weight_decay: 0.250807106495
train_y_col_norms_max: 3.78894209862
train_y_col_norms_mean: 3.49625706673
train_y_col_norms_min: 3.00046014786
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.998175680637
train_y_min_max_class: 0.935915350914
train_y_misclass: 0.0
train_y_nll: 0.00187695620116
train_y_row_norms_max: 0.914363384247
train_y_row_norms_mean: 0.320046842098
train_y_row_norms_min: 0.0148302586749
valid_h0_col_norms_max: 2.91628909111
valid_h0_col_norms_mean: 1.90174818039
valid_h0_col_norms_min: 1.01780152321
valid_h0_row_norms_max: 2.98749780655
valid_h0_row_norms_mean: 1.48878085613
valid_h0_row_norms_min: 0.0709456577897
valid_h1_col_norms_max: 2.6732199192
valid_h1_col_norms_mean: 1.71399199963
valid_h1_col_norms_min: 0.766523241997
valid_h1_row_norms_max: 3.42718911171
valid_h1_row_norms_mean: 2.43771839142
valid_h1_row_norms_min: 1.40650296211
valid_objective: 0.310465872288
valid_term_0: 0.0596601851285
valid_term_1_weight_decay: 0.2508058846
valid_y_col_norms_max: 3.78892588615
valid_y_col_norms_mean: 3.49624156952
valid_y_col_norms_min: 3.00044965744
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989892303944
valid_y_min_max_class: 0.631849765778
valid_y_misclass: 0.0163999944925
valid_y_nll: 0.0596601851285
valid_y_row_norms_max: 0.914359211922
valid_y_row_norms_mean: 0.320046216249
valid_y_row_norms_min: 0.0148301701993
Time this epoch: 3.332601 seconds
Monitoring step:
Epochs seen: 26
Batches seen: 13000
Examples seen: 1300000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.78018331528
test_h0_col_norms_mean: 1.81279492378
test_h0_col_norms_min: 0.967669785023
test_h0_row_norms_max: 2.84347510338
test_h0_row_norms_mean: 1.41918671131
test_h0_row_norms_min: 0.0677683353424
test_h1_col_norms_max: 2.54604840279
test_h1_col_norms_mean: 1.63219892979
test_h1_col_norms_min: 0.7294241786
test_h1_row_norms_max: 3.26415896416
test_h1_row_norms_mean: 2.32143187523
test_h1_row_norms_min: 1.33722984791
test_objective: 0.282132327557
test_term_0: 0.0540966317058
test_term_1_weight_decay: 0.228035539389
test_y_col_norms_max: 3.75175428391
test_y_col_norms_mean: 3.44715094566
test_y_col_norms_min: 2.93773794174
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.989186286926
test_y_min_max_class: 0.657333433628
test_y_misclass: 0.0153999971226
test_y_nll: 0.0540966317058
test_y_row_norms_max: 0.922168970108
test_y_row_norms_mean: 0.314452946186
test_y_row_norms_min: 0.0141052464023
train_h0_col_norms_max: 2.7801964283
train_h0_col_norms_mean: 1.81280255318
train_h0_col_norms_min: 0.967675149441
train_h0_row_norms_max: 2.84348917007
train_h0_row_norms_mean: 1.41918671131
train_h0_row_norms_min: 0.0677684471011
train_h1_col_norms_max: 2.54606103897
train_h1_col_norms_mean: 1.63219988346
train_h1_col_norms_min: 0.729421555996
train_h1_row_norms_max: 3.26416134834
train_h1_row_norms_mean: 2.3214328289
train_h1_row_norms_min: 1.33723008633
train_objective: 0.230251327157
train_term_0: 0.00221558846533
train_term_1_weight_decay: 0.228034421802
train_y_col_norms_max: 3.75175452232
train_y_col_norms_mean: 3.44716668129
train_y_col_norms_min: 2.93775320053
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.997844338417
train_y_min_max_class: 0.929183900356
train_y_misclass: 0.0
train_y_nll: 0.00221558846533
train_y_row_norms_max: 0.922172784805
train_y_row_norms_mean: 0.314453363419
train_y_row_norms_min: 0.0141053134575
valid_h0_col_norms_max: 2.78018331528
valid_h0_col_norms_mean: 1.81279492378
valid_h0_col_norms_min: 0.967669785023
valid_h0_row_norms_max: 2.84347510338
valid_h0_row_norms_mean: 1.41918671131
valid_h0_row_norms_min: 0.0677683353424
valid_h1_col_norms_max: 2.54604840279
valid_h1_col_norms_mean: 1.63219892979
valid_h1_col_norms_min: 0.7294241786
valid_h1_row_norms_max: 3.26415896416
valid_h1_row_norms_mean: 2.32143187523
valid_h1_row_norms_min: 1.33722984791
valid_objective: 0.287775874138
valid_term_0: 0.0597402378917
valid_term_1_weight_decay: 0.228035539389
valid_y_col_norms_max: 3.75175428391
valid_y_col_norms_mean: 3.44715094566
valid_y_col_norms_min: 2.93773794174
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989176094532
valid_y_min_max_class: 0.624788343906
valid_y_misclass: 0.0166999921203
valid_y_nll: 0.0597402378917
valid_y_row_norms_max: 0.922168970108
valid_y_row_norms_mean: 0.314452946186
valid_y_row_norms_min: 0.0141052464023
Time this epoch: 3.284030 seconds
Monitoring step:
Epochs seen: 27
Batches seen: 13500
Examples seen: 1350000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.65619587898
test_h0_col_norms_mean: 1.72903525829
test_h0_col_norms_min: 0.920009553432
test_h0_row_norms_max: 2.71527957916
test_h0_row_norms_mean: 1.3536605835
test_h0_row_norms_min: 0.0648870319128
test_h1_col_norms_max: 2.42550611496
test_h1_col_norms_mean: 1.55485773087
test_h1_col_norms_min: 0.694191396236
test_h1_row_norms_max: 3.11506414413
test_h1_row_norms_mean: 2.21147465706
test_h1_row_norms_min: 1.27136671543
test_objective: 0.261537849903
test_term_0: 0.0539344884455
test_term_1_weight_decay: 0.207603096962
test_y_col_norms_max: 3.72367501259
test_y_col_norms_mean: 3.41610884666
test_y_col_norms_min: 2.90950012207
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.98896753788
test_y_min_max_class: 0.650016546249
test_y_misclass: 0.0154999988154
test_y_nll: 0.0539344884455
test_y_row_norms_max: 0.928149938583
test_y_row_norms_mean: 0.310383200645
test_y_row_norms_min: 0.0134189818054
train_h0_col_norms_max: 2.6562101841
train_h0_col_norms_mean: 1.72902774811
train_h0_col_norms_min: 0.920005261898
train_h0_row_norms_max: 2.71527171135
train_h0_row_norms_mean: 1.35366332531
train_h0_row_norms_min: 0.0648870840669
train_h1_col_norms_max: 2.42551159859
train_h1_col_norms_mean: 1.5548504591
train_h1_col_norms_min: 0.694188058376
train_h1_row_norms_max: 3.1150598526
train_h1_row_norms_mean: 2.21146249771
train_h1_row_norms_min: 1.27136695385
train_objective: 0.20999661088
train_term_0: 0.002393146744
train_term_1_weight_decay: 0.207603022456
train_y_col_norms_max: 3.7236533165
train_y_col_norms_mean: 3.41609406471
train_y_col_norms_min: 2.90950846672
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.997686505318
train_y_min_max_class: 0.923012793064
train_y_misclass: 1.99999994948e-05
train_y_nll: 0.002393146744
train_y_row_norms_max: 0.928141772747
train_y_row_norms_mean: 0.310382217169
train_y_row_norms_min: 0.0134189641103
valid_h0_col_norms_max: 2.65619587898
valid_h0_col_norms_mean: 1.72903525829
valid_h0_col_norms_min: 0.920009553432
valid_h0_row_norms_max: 2.71527957916
valid_h0_row_norms_mean: 1.3536605835
valid_h0_row_norms_min: 0.0648870319128
valid_h1_col_norms_max: 2.42550611496
valid_h1_col_norms_mean: 1.55485773087
valid_h1_col_norms_min: 0.694191396236
valid_h1_row_norms_max: 3.11506414413
valid_h1_row_norms_mean: 2.21147465706
valid_h1_row_norms_min: 1.27136671543
valid_objective: 0.268219769001
valid_term_0: 0.0606164671481
valid_term_1_weight_decay: 0.207603096962
valid_y_col_norms_max: 3.72367501259
valid_y_col_norms_mean: 3.41610884666
valid_y_col_norms_min: 2.90950012207
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989122092724
valid_y_min_max_class: 0.619553089142
valid_y_misclass: 0.0165999922901
valid_y_nll: 0.0606164671481
valid_y_row_norms_max: 0.928149938583
valid_y_row_norms_mean: 0.310383200645
valid_y_row_norms_min: 0.0134189818054
Time this epoch: 3.261071 seconds
Monitoring step:
Epochs seen: 28
Batches seen: 14000
Examples seen: 1400000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.53800392151
test_h0_col_norms_mean: 1.65007030964
test_h0_col_norms_min: 0.874696552753
test_h0_row_norms_max: 2.59223008156
test_h0_row_norms_mean: 1.29187560081
test_h0_row_norms_min: 0.0620772130787
test_h1_col_norms_max: 2.31092762947
test_h1_col_norms_mean: 1.48166322708
test_h1_col_norms_min: 0.660871863365
test_h1_row_norms_max: 2.96952915192
test_h1_row_norms_mean: 2.10743236542
test_h1_row_norms_min: 1.20874655247
test_objective: 0.244267836213
test_term_0: 0.0550341755152
test_term_1_weight_decay: 0.189233824611
test_y_col_norms_max: 3.7156329155
test_y_col_norms_mean: 3.3953909874
test_y_col_norms_min: 2.88775634766
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.988679349422
test_y_min_max_class: 0.66010850668
test_y_misclass: 0.015999995172
test_y_nll: 0.0550341755152
test_y_row_norms_max: 0.940686166286
test_y_row_norms_mean: 0.307195395231
test_y_row_norms_min: 0.0127685274929
train_h0_col_norms_max: 2.53800177574
train_h0_col_norms_mean: 1.65007722378
train_h0_col_norms_min: 0.874697685242
train_h0_row_norms_max: 2.59221696854
train_h0_row_norms_mean: 1.29187226295
train_h0_row_norms_min: 0.0620768405497
train_h1_col_norms_max: 2.3109228611
train_h1_col_norms_mean: 1.48165631294
train_h1_col_norms_min: 0.660868704319
train_h1_row_norms_max: 2.96951341629
train_h1_row_norms_mean: 2.10743737221
train_h1_row_norms_min: 1.208745718
train_objective: 0.19228720665
train_term_0: 0.00305363954976
train_term_1_weight_decay: 0.18923483789
train_y_col_norms_max: 3.71563386917
train_y_col_norms_mean: 3.39540982246
train_y_col_norms_min: 2.88775682449
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.99712729454
train_y_min_max_class: 0.899743020535
train_y_misclass: 9.99999974738e-05
train_y_nll: 0.00305363954976
train_y_row_norms_max: 0.940682113171
train_y_row_norms_mean: 0.307196319103
train_y_row_norms_min: 0.0127684678882
valid_h0_col_norms_max: 2.53800392151
valid_h0_col_norms_mean: 1.65007030964
valid_h0_col_norms_min: 0.874696552753
valid_h0_row_norms_max: 2.59223008156
valid_h0_row_norms_mean: 1.29187560081
valid_h0_row_norms_min: 0.0620772130787
valid_h1_col_norms_max: 2.31092762947
valid_h1_col_norms_mean: 1.48166322708
valid_h1_col_norms_min: 0.660871863365
valid_h1_row_norms_max: 2.96952915192
valid_h1_row_norms_mean: 2.10743236542
valid_h1_row_norms_min: 1.20874655247
valid_objective: 0.250370264053
valid_term_0: 0.0611366219819
valid_term_1_weight_decay: 0.189233824611
valid_y_col_norms_max: 3.7156329155
valid_y_col_norms_mean: 3.3953909874
valid_y_col_norms_min: 2.88775634766
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.98891800642
valid_y_min_max_class: 0.613278985023
valid_y_misclass: 0.0181999895722
valid_y_nll: 0.0611366219819
valid_y_row_norms_max: 0.940686166286
valid_y_row_norms_mean: 0.307195395231
valid_y_row_norms_min: 0.0127685274929
Time this epoch: 3.281839 seconds
Monitoring step:
Epochs seen: 29
Batches seen: 14500
Examples seen: 1450000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.42693471909
test_h0_col_norms_mean: 1.57674181461
test_h0_col_norms_min: 0.831614494324
test_h0_row_norms_max: 2.47894763947
test_h0_row_norms_mean: 1.23453938961
test_h0_row_norms_min: 0.0597868897021
test_h1_col_norms_max: 2.20548892021
test_h1_col_norms_mean: 1.41274940968
test_h1_col_norms_min: 0.629146695137
test_h1_row_norms_max: 2.83711600304
test_h1_row_norms_mean: 2.00946569443
test_h1_row_norms_min: 1.14921236038
test_objective: 0.231429338455
test_term_0: 0.0585358664393
test_term_1_weight_decay: 0.17289365828
test_y_col_norms_max: 3.72800803185
test_y_col_norms_mean: 3.39454960823
test_y_col_norms_min: 2.87621188164
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.988135576248
test_y_min_max_class: 0.641624808311
test_y_misclass: 0.0174999944866
test_y_nll: 0.0585358664393
test_y_row_norms_max: 0.94206482172
test_y_row_norms_mean: 0.305722147226
test_y_row_norms_min: 0.0122694317251
train_h0_col_norms_max: 2.42693591118
train_h0_col_norms_mean: 1.57673621178
train_h0_col_norms_min: 0.831611156464
train_h0_row_norms_max: 2.47895431519
train_h0_row_norms_mean: 1.23453593254
train_h0_row_norms_min: 0.0597870908678
train_h1_col_norms_max: 2.20549035072
train_h1_col_norms_mean: 1.41274940968
train_h1_col_norms_min: 0.629147231579
train_h1_row_norms_max: 2.83710312843
train_h1_row_norms_mean: 2.0094628334
train_h1_row_norms_min: 1.14921247959
train_objective: 0.176759794354
train_term_0: 0.00386637193151
train_term_1_weight_decay: 0.172893241048
train_y_col_norms_max: 3.72802448273
train_y_col_norms_mean: 3.39456868172
train_y_col_norms_min: 2.87620210648
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.996502935886
train_y_min_max_class: 0.867810547352
train_y_misclass: 0.000219999958063
train_y_nll: 0.00386637193151
train_y_row_norms_max: 0.942059278488
train_y_row_norms_mean: 0.305722266436
train_y_row_norms_min: 0.012269385159
valid_h0_col_norms_max: 2.42693471909
valid_h0_col_norms_mean: 1.57674181461
valid_h0_col_norms_min: 0.831614494324
valid_h0_row_norms_max: 2.47894763947
valid_h0_row_norms_mean: 1.23453938961
valid_h0_row_norms_min: 0.0597868897021
valid_h1_col_norms_max: 2.20548892021
valid_h1_col_norms_mean: 1.41274940968
valid_h1_col_norms_min: 0.629146695137
valid_h1_row_norms_max: 2.83711600304
valid_h1_row_norms_mean: 2.00946569443
valid_h1_row_norms_min: 1.14921236038
valid_objective: 0.233349367976
valid_term_0: 0.0604558549821
valid_term_1_weight_decay: 0.17289365828
valid_y_col_norms_max: 3.72800803185
valid_y_col_norms_mean: 3.39454960823
valid_y_col_norms_min: 2.87621188164
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.988323628902
valid_y_min_max_class: 0.623263895512
valid_y_misclass: 0.0180999934673
valid_y_nll: 0.0604558549821
valid_y_row_norms_max: 0.94206482172
valid_y_row_norms_mean: 0.305722147226
valid_y_row_norms_min: 0.0122694317251
Time this epoch: 3.305740 seconds
Monitoring step:
Epochs seen: 30
Batches seen: 15000
Examples seen: 1500000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.32734417915
test_h0_col_norms_mean: 1.50859749317
test_h0_col_norms_min: 0.790655136108
test_h0_row_norms_max: 2.3708486557
test_h0_row_norms_mean: 1.18130934238
test_h0_row_norms_min: 0.0573941357434
test_h1_col_norms_max: 2.10413718224
test_h1_col_norms_mean: 1.34777581692
test_h1_col_norms_min: 0.599398136139
test_h1_row_norms_max: 2.71429800987
test_h1_row_norms_mean: 1.91710066795
test_h1_row_norms_min: 1.09260857105
test_objective: 0.213433161378
test_term_0: 0.0551191605628
test_term_1_weight_decay: 0.158313959837
test_y_col_norms_max: 3.74245023727
test_y_col_norms_mean: 3.40365672112
test_y_col_norms_min: 2.87184524536
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.988063156605
test_y_min_max_class: 0.63681012392
test_y_misclass: 0.0161999966949
test_y_nll: 0.0551191605628
test_y_row_norms_max: 0.970815420151
test_y_row_norms_mean: 0.305047929287
test_y_row_norms_min: 0.0117727546021
train_h0_col_norms_max: 2.32733273506
train_h0_col_norms_mean: 1.50859475136
train_h0_col_norms_min: 0.790655434132
train_h0_row_norms_max: 2.37083745003
train_h0_row_norms_mean: 1.18130576611
train_h0_row_norms_min: 0.0573940649629
train_h1_col_norms_max: 2.10414910316
train_h1_col_norms_mean: 1.34777891636
train_h1_col_norms_min: 0.599396586418
train_h1_row_norms_max: 2.71428489685
train_h1_row_norms_mean: 1.91711127758
train_h1_row_norms_min: 1.09261226654
train_objective: 0.161947190762
train_term_0: 0.00363326980732
train_term_1_weight_decay: 0.158314481378
train_y_col_norms_max: 3.74245476723
train_y_col_norms_mean: 3.40366172791
train_y_col_norms_min: 2.87186050415
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.996790409088
train_y_min_max_class: 0.880268752575
train_y_misclass: 0.0002599999425
train_y_nll: 0.00363326980732
train_y_row_norms_max: 0.970812141895
train_y_row_norms_mean: 0.305047690868
train_y_row_norms_min: 0.0117728123441
valid_h0_col_norms_max: 2.32734417915
valid_h0_col_norms_mean: 1.50859749317
valid_h0_col_norms_min: 0.790655136108
valid_h0_row_norms_max: 2.3708486557
valid_h0_row_norms_mean: 1.18130934238
valid_h0_row_norms_min: 0.0573941357434
valid_h1_col_norms_max: 2.10413718224
valid_h1_col_norms_mean: 1.34777581692
valid_h1_col_norms_min: 0.599398136139
valid_h1_row_norms_max: 2.71429800987
valid_h1_row_norms_mean: 1.91710066795
valid_h1_row_norms_min: 1.09260857105
valid_objective: 0.221332803369
valid_term_0: 0.0630188435316
valid_term_1_weight_decay: 0.158313959837
valid_y_col_norms_max: 3.74245023727
valid_y_col_norms_mean: 3.40365672112
valid_y_col_norms_min: 2.87184524536
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.988815426826
valid_y_min_max_class: 0.629059970379
valid_y_misclass: 0.0181999914348
valid_y_nll: 0.0630188435316
valid_y_row_norms_max: 0.970815420151
valid_y_row_norms_mean: 0.305047929287
valid_y_row_norms_min: 0.0117727546021
Time this epoch: 3.277658 seconds
Monitoring step:
Epochs seen: 31
Batches seen: 15500
Examples seen: 1550000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.22986006737
test_h0_col_norms_mean: 1.44450759888
test_h0_col_norms_min: 0.751711428165
test_h0_row_norms_max: 2.27136349678
test_h0_row_norms_mean: 1.13126826286
test_h0_row_norms_min: 0.055449090898
test_h1_col_norms_max: 2.01123976707
test_h1_col_norms_mean: 1.28632330894
test_h1_col_norms_min: 0.570827186108
test_h1_row_norms_max: 2.59384894371
test_h1_row_norms_mean: 1.82977592945
test_h1_row_norms_min: 1.03879511356
test_objective: 0.199405178428
test_term_0: 0.0542121008039
test_term_1_weight_decay: 0.145193070173
test_y_col_norms_max: 3.75564265251
test_y_col_norms_mean: 3.41556763649
test_y_col_norms_min: 2.89411330223
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.98805475235
test_y_min_max_class: 0.651372611523
test_y_misclass: 0.0173999965191
test_y_nll: 0.0542121008039
test_y_row_norms_max: 0.984490036964
test_y_row_norms_mean: 0.30471482873
test_y_row_norms_min: 0.0115118613467
train_h0_col_norms_max: 2.22986245155
train_h0_col_norms_mean: 1.44451439381
train_h0_col_norms_min: 0.751711428165
train_h0_row_norms_max: 2.27136206627
train_h0_row_norms_mean: 1.13127017021
train_h0_row_norms_min: 0.0554490871727
train_h1_col_norms_max: 2.01124000549
train_h1_col_norms_mean: 1.28632640839
train_h1_col_norms_min: 0.570830464363
train_h1_row_norms_max: 2.59386348724
train_h1_row_norms_mean: 1.8297867775
train_h1_row_norms_min: 1.03879117966
train_objective: 0.148557990789
train_term_0: 0.00336487870663
train_term_1_weight_decay: 0.145192667842
train_y_col_norms_max: 3.75566291809
train_y_col_norms_mean: 3.41558241844
train_y_col_norms_min: 2.89412260056
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.996838271618
train_y_min_max_class: 0.885646343231
train_y_misclass: 7.9999997979e-05
train_y_nll: 0.00336487870663
train_y_row_norms_max: 0.98448997736
train_y_row_norms_mean: 0.304714143276
train_y_row_norms_min: 0.0115119209513
valid_h0_col_norms_max: 2.22986006737
valid_h0_col_norms_mean: 1.44450759888
valid_h0_col_norms_min: 0.751711428165
valid_h0_row_norms_max: 2.27136349678
valid_h0_row_norms_mean: 1.13126826286
valid_h0_row_norms_min: 0.055449090898
valid_h1_col_norms_max: 2.01123976707
valid_h1_col_norms_mean: 1.28632330894
valid_h1_col_norms_min: 0.570827186108
valid_h1_row_norms_max: 2.59384894371
valid_h1_row_norms_mean: 1.82977592945
valid_h1_row_norms_min: 1.03879511356
valid_objective: 0.204451009631
valid_term_0: 0.0592579171062
valid_term_1_weight_decay: 0.145193070173
valid_y_col_norms_max: 3.75564265251
valid_y_col_norms_mean: 3.41556763649
valid_y_col_norms_min: 2.89411330223
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.989104747772
valid_y_min_max_class: 0.617950022221
valid_y_misclass: 0.0168999936432
valid_y_nll: 0.0592579171062
valid_y_row_norms_max: 0.984490036964
valid_y_row_norms_mean: 0.30471482873
valid_y_row_norms_min: 0.0115118613467
Time this epoch: 3.299106 seconds
Monitoring step:
Epochs seen: 32
Batches seen: 16000
Examples seen: 1600000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.14349389076
test_h0_col_norms_mean: 1.38710844517
test_h0_col_norms_min: 0.714687824249
test_h0_row_norms_max: 2.18458104134
test_h0_row_norms_mean: 1.08642613888
test_h0_row_norms_min: 0.0531989820302
test_h1_col_norms_max: 1.92048859596
test_h1_col_norms_mean: 1.22870218754
test_h1_col_norms_min: 0.54377913475
test_h1_row_norms_max: 2.48094320297
test_h1_row_norms_mean: 1.74789762497
test_h1_row_norms_min: 0.987630963326
test_objective: 0.198874086142
test_term_0: 0.0651714801788
test_term_1_weight_decay: 0.13370269537
test_y_col_norms_max: 3.78167033195
test_y_col_norms_mean: 3.43723106384
test_y_col_norms_min: 2.910176754
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.986017048359
test_y_min_max_class: 0.594191014767
test_y_misclass: 0.0199999921024
test_y_nll: 0.0651714801788
test_y_row_norms_max: 0.997191548347
test_y_row_norms_mean: 0.305199384689
test_y_row_norms_min: 0.0112372441217
train_h0_col_norms_max: 2.14350128174
train_h0_col_norms_mean: 1.38711500168
train_h0_col_norms_min: 0.714688956738
train_h0_row_norms_max: 2.18457770348
train_h0_row_norms_mean: 1.08642041683
train_h0_row_norms_min: 0.0531990006566
train_h1_col_norms_max: 1.92047715187
train_h1_col_norms_mean: 1.22869598866
train_h1_col_norms_min: 0.543776392937
train_h1_row_norms_max: 2.4809448719
train_h1_row_norms_mean: 1.74790513515
train_h1_row_norms_min: 0.987626254559
train_objective: 0.142097592354
train_term_0: 0.00839497055858
train_term_1_weight_decay: 0.133703291416
train_y_col_norms_max: 3.78167462349
train_y_col_norms_mean: 3.43724656105
train_y_col_norms_min: 2.91016840935
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.993861615658
train_y_min_max_class: 0.764626443386
train_y_misclass: 0.00168000021949
train_y_nll: 0.00839497055858
train_y_row_norms_max: 0.997187256813
train_y_row_norms_mean: 0.305199593306
train_y_row_norms_min: 0.0112373000011
valid_h0_col_norms_max: 2.14349389076
valid_h0_col_norms_mean: 1.38710844517
valid_h0_col_norms_min: 0.714687824249
valid_h0_row_norms_max: 2.18458104134
valid_h0_row_norms_mean: 1.08642613888
valid_h0_row_norms_min: 0.0531989820302
valid_h1_col_norms_max: 1.92048859596
valid_h1_col_norms_mean: 1.22870218754
valid_h1_col_norms_min: 0.54377913475
valid_h1_row_norms_max: 2.48094320297
valid_h1_row_norms_mean: 1.74789762497
valid_h1_row_norms_min: 0.987630963326
valid_objective: 0.206293180585
valid_term_0: 0.0725905746222
valid_term_1_weight_decay: 0.13370269537
valid_y_col_norms_max: 3.78167033195
valid_y_col_norms_mean: 3.43723106384
valid_y_col_norms_min: 2.910176754
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.98618721962
valid_y_min_max_class: 0.584758162498
valid_y_misclass: 0.0210999920964
valid_y_nll: 0.0725905746222
valid_y_row_norms_max: 0.997191548347
valid_y_row_norms_mean: 0.305199384689
valid_y_row_norms_min: 0.0112372441217
Time this epoch: 3.316685 seconds
Monitoring step:
Epochs seen: 33
Batches seen: 16500
Examples seen: 1650000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.07966470718
test_h0_col_norms_mean: 1.34304857254
test_h0_col_norms_min: 0.67948693037
test_h0_row_norms_max: 2.12437868118
test_h0_row_norms_mean: 1.05228435993
test_h0_row_norms_min: 0.0516484305263
test_h1_col_norms_max: 1.84021937847
test_h1_col_norms_mean: 1.17601656914
test_h1_col_norms_min: 0.518966257572
test_h1_row_norms_max: 2.38222265244
test_h1_row_norms_mean: 1.67307877541
test_h1_row_norms_min: 0.938986539841
test_objective: 0.1945425421
test_term_0: 0.0701323673129
test_term_1_weight_decay: 0.124409988523
test_y_col_norms_max: 3.7988409996
test_y_col_norms_mean: 3.48081755638
test_y_col_norms_min: 2.99444794655
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.985891222954
test_y_min_max_class: 0.610743761063
test_y_misclass: 0.021299995482
test_y_nll: 0.0701323673129
test_y_row_norms_max: 1.03870010376
test_y_row_norms_mean: 0.307827204466
test_y_row_norms_min: 0.0110923619941
train_h0_col_norms_max: 2.07966327667
train_h0_col_norms_mean: 1.34305346012
train_h0_col_norms_min: 0.679490327835
train_h0_row_norms_max: 2.12437534332
train_h0_row_norms_mean: 1.05228734016
train_h0_row_norms_min: 0.0516487248242
train_h1_col_norms_max: 1.84022164345
train_h1_col_norms_mean: 1.17601895332
train_h1_col_norms_min: 0.518965959549
train_h1_row_norms_max: 2.3822286129
train_h1_row_norms_mean: 1.67308568954
train_h1_row_norms_min: 0.93898332119
train_objective: 0.135569825768
train_term_0: 0.0111596826464
train_term_1_weight_decay: 0.124409854412
train_y_col_norms_max: 3.79884195328
train_y_col_norms_mean: 3.48080062866
train_y_col_norms_min: 2.99444794655
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.992844820023
train_y_min_max_class: 0.72961461544
train_y_misclass: 0.00289999856614
train_y_nll: 0.0111596826464
train_y_row_norms_max: 1.03869926929
train_y_row_norms_mean: 0.307827889919
train_y_row_norms_min: 0.0110923871398
valid_h0_col_norms_max: 2.07966470718
valid_h0_col_norms_mean: 1.34304857254
valid_h0_col_norms_min: 0.67948693037
valid_h0_row_norms_max: 2.12437868118
valid_h0_row_norms_mean: 1.05228435993
valid_h0_row_norms_min: 0.0516484305263
valid_h1_col_norms_max: 1.84021937847
valid_h1_col_norms_mean: 1.17601656914
valid_h1_col_norms_min: 0.518966257572
valid_h1_row_norms_max: 2.38222265244
valid_h1_row_norms_mean: 1.67307877541
valid_h1_row_norms_min: 0.938986539841
valid_objective: 0.197794348001
valid_term_0: 0.0733841732144
valid_term_1_weight_decay: 0.124409988523
valid_y_col_norms_max: 3.7988409996
valid_y_col_norms_mean: 3.48081755638
valid_y_col_norms_min: 2.99444794655
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.986917078495
valid_y_min_max_class: 0.61943089962
valid_y_misclass: 0.0203999932855
valid_y_nll: 0.0733841732144
valid_y_row_norms_max: 1.03870010376
valid_y_row_norms_mean: 0.307827204466
valid_y_row_norms_min: 0.0110923619941
Time this epoch: 3.390813 seconds
Monitoring step:
Epochs seen: 34
Batches seen: 17000
Examples seen: 1700000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.09944272041
test_h0_col_norms_mean: 1.31166350842
test_h0_col_norms_min: 0.646019160748
test_h0_row_norms_max: 2.08779644966
test_h0_row_norms_mean: 1.02819681168
test_h0_row_norms_min: 0.0544924363494
test_h1_col_norms_max: 1.76557374001
test_h1_col_norms_mean: 1.12808454037
test_h1_col_norms_min: 0.495299696922
test_h1_row_norms_max: 2.29631304741
test_h1_row_norms_mean: 1.60503029823
test_h1_row_norms_min: 0.892738819122
test_objective: 0.189725786448
test_term_0: 0.0726886093616
test_term_1_weight_decay: 0.117037259042
test_y_col_norms_max: 3.87525558472
test_y_col_norms_mean: 3.52211046219
test_y_col_norms_min: 3.00069046021
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.983940660954
test_y_min_max_class: 0.607474207878
test_y_misclass: 0.0210999920964
test_y_nll: 0.0726886093616
test_y_row_norms_max: 1.04711163044
test_y_row_norms_mean: 0.310424894094
test_y_row_norms_min: 0.0110520040616
train_h0_col_norms_max: 2.09944820404
train_h0_col_norms_mean: 1.31166100502
train_h0_col_norms_min: 0.646022617817
train_h0_row_norms_max: 2.08778810501
train_h0_row_norms_mean: 1.02820193768
train_h0_row_norms_min: 0.0544921904802
train_h1_col_norms_max: 1.76556527615
train_h1_col_norms_mean: 1.12807917595
train_h1_col_norms_min: 0.495299696922
train_h1_row_norms_max: 2.29631876945
train_h1_row_norms_mean: 1.60503292084
train_h1_row_norms_min: 0.892735242844
train_objective: 0.138252094388
train_term_0: 0.0212149638683
train_term_1_weight_decay: 0.117037393153
train_y_col_norms_max: 3.87525558472
train_y_col_norms_mean: 3.5221259594
train_y_col_norms_min: 3.00070405006
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.989364624023
train_y_min_max_class: 0.659551143646
train_y_misclass: 0.00660000368953
train_y_nll: 0.0212149638683
train_y_row_norms_max: 1.04711127281
train_y_row_norms_mean: 0.310425490141
train_y_row_norms_min: 0.0110520040616
valid_h0_col_norms_max: 2.09944272041
valid_h0_col_norms_mean: 1.31166350842
valid_h0_col_norms_min: 0.646019160748
valid_h0_row_norms_max: 2.08779644966
valid_h0_row_norms_mean: 1.02819681168
valid_h0_row_norms_min: 0.0544924363494
valid_h1_col_norms_max: 1.76557374001
valid_h1_col_norms_mean: 1.12808454037
valid_h1_col_norms_min: 0.495299696922
valid_h1_row_norms_max: 2.29631304741
valid_h1_row_norms_mean: 1.60503029823
valid_h1_row_norms_min: 0.892738819122
valid_objective: 0.204115614295
valid_term_0: 0.0870784968138
valid_term_1_weight_decay: 0.117037259042
valid_y_col_norms_max: 3.87525558472
valid_y_col_norms_mean: 3.52211046219
valid_y_col_norms_min: 3.00069046021
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.984562575817
valid_y_min_max_class: 0.597014904022
valid_y_misclass: 0.0247999858111
valid_y_nll: 0.0870784968138
valid_y_row_norms_max: 1.04711163044
valid_y_row_norms_mean: 0.310424894094
valid_y_row_norms_min: 0.0110520040616
Time this epoch: 3.325348 seconds
Monitoring step:
Epochs seen: 35
Batches seen: 17500
Examples seen: 1750000
learning_rate: 0.00999999046326
momentum: 0.989998817444
test_h0_col_norms_max: 2.33471369743
test_h0_col_norms_mean: 1.30764365196
test_h0_col_norms_min: 0.614201545715
test_h0_row_norms_max: 2.11369776726
test_h0_row_norms_mean: 1.02605807781
test_h0_row_norms_min: 0.0789580345154
test_h1_col_norms_max: 1.72045576572
test_h1_col_norms_mean: 1.08744347095
test_h1_col_norms_min: 0.471859395504
test_h1_row_norms_max: 2.22104668617
test_h1_row_norms_mean: 1.54732775688
test_h1_row_norms_min: 0.848768413067
test_objective: 0.186729609966
test_term_0: 0.0738602727652
test_term_1_weight_decay: 0.112869426608
test_y_col_norms_max: 3.81233644485
test_y_col_norms_mean: 3.53644061089
test_y_col_norms_min: 3.07366251945
test_y_max_max_class: 0.999999344349
test_y_mean_max_class: 0.982964873314
test_y_min_max_class: 0.570746660233
test_y_misclass: 0.0224999897182
test_y_nll: 0.0738602727652
test_y_row_norms_max: 1.04587638378
test_y_row_norms_mean: 0.311368614435
test_y_row_norms_min: 0.0108088394627
train_h0_col_norms_max: 2.33471369743
train_h0_col_norms_mean: 1.30764782429
train_h0_col_norms_min: 0.614198505878
train_h0_row_norms_max: 2.11369967461
train_h0_row_norms_mean: 1.02606165409
train_h0_row_norms_min: 0.0789578035474
train_h1_col_norms_max: 1.72044575214
train_h1_col_norms_mean: 1.08744776249
train_h1_col_norms_min: 0.471859931946
train_h1_row_norms_max: 2.2210419178
train_h1_row_norms_mean: 1.54733288288
train_h1_row_norms_min: 0.848769664764
train_objective: 0.133081272244
train_term_0: 0.020211936906
train_term_1_weight_decay: 0.112869039178
train_y_col_norms_max: 3.81231951714
train_y_col_norms_mean: 3.53645634651
train_y_col_norms_min: 3.07366323471
train_y_max_max_class: 0.999994218349
train_y_mean_max_class: 0.989763617516
train_y_min_max_class: 0.656112134457
train_y_misclass: 0.00610000034794
train_y_nll: 0.020211936906
train_y_row_norms_max: 1.04588091373
train_y_row_norms_mean: 0.311368972063
train_y_row_norms_min: 0.0108088953421
valid_h0_col_norms_max: 2.33471369743
valid_h0_col_norms_mean: 1.30764365196
valid_h0_col_norms_min: 0.614201545715
valid_h0_row_norms_max: 2.11369776726
valid_h0_row_norms_mean: 1.02605807781
valid_h0_row_norms_min: 0.0789580345154
valid_h1_col_norms_max: 1.72045576572
valid_h1_col_norms_mean: 1.08744347095
valid_h1_col_norms_min: 0.471859395504
valid_h1_row_norms_max: 2.22104668617
valid_h1_row_norms_mean: 1.54732775688
valid_h1_row_norms_min: 0.848768413067
valid_objective: 0.191735550761
valid_term_0: 0.0788661986589
valid_term_1_weight_decay: 0.112869426608
valid_y_col_norms_max: 3.81233644485
valid_y_col_norms_mean: 3.53644061089
valid_y_col_norms_min: 3.07366251945
valid_y_max_max_class: 0.999999344349
valid_y_mean_max_class: 0.985468804836
valid_y_min_max_class: 0.593561589718
valid_y_misclass: 0.0224999915808
valid_y_nll: 0.0788661986589
valid_y_row_norms_max: 1.04587638378
valid_y_row_norms_mean: 0.311368614435
valid_y_row_norms_min: 0.0108088394627
In [11]:
!print_monitor.py mlp_3_best.pkl | grep test_y_misclass
Using gpu device 2: GeForce GTX 285
/u/goodfeli/pylearn2/models/mlp.py:36: UserWarning: MLP changing the recursion limit.
warnings.warn("MLP changing the recursion limit.")
test_y_misclass : 0.0153999980539
Using a simple form of regularization thus brought the test error rate for this MLP down from 1.75% to 1.54%.
You can find more information on MLPs from the following sources:
LISA lab's Deep Learning Tutorials: Multilayer Perception
This is by no means a complete list.
Content source: alexjc/pylearn2
Similar notebooks: