Applying GridSearchCV
GridSearchCV(cv=10, error_score='raise',
estimator=MLPClassifier(activation='relu', alpha=0.0001, batch_size='auto', beta_1=0.9,
beta_2=0.999, early_stopping=False, epsilon=1e-08,
hidden_layer_sizes=(100,), learning_rate='constant',
learning_rate_init=0.001, max_iter=10000, momentum=0.9,
nesterovs_momentum=True, power_t=0.5, random_state=0, shuffle=True,
solver='adam', tol=0.0001, validation_fraction=0.1, verbose=True,
warm_start=False),
fit_params={}, iid=True, n_jobs=5,
param_grid=[{'hidden_layer_sizes': [(4000, 4000), (10000, 10000)], 'solver': ['lbfgs', 'adam'], 'alpha': [1e-05, 1e-06, 1e-07]}],
pre_dispatch='2*n_jobs', refit=True, return_train_score=True,
scoring='accuracy', verbose=3)
Fitting 10 folds for each of 12 candidates, totalling 120 fits
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.552298, total=47.4min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.529019, total=48.0min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.548837, total=50.7min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.544910, total=72.7min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.548430, total=28.2min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs ......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.541750, total=58.1min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.566088, total=60.4min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.569713, total=62.0min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
Iteration 1, loss = 2.45367549
Iteration 1, loss = 2.55806715
Iteration 1, loss = 2.40329087
Iteration 2, loss = 1.27681042
Iteration 2, loss = 1.28909332
Iteration 2, loss = 1.27608957
Iteration 3, loss = 1.07480692
Iteration 3, loss = 1.08779706
Iteration 3, loss = 1.06899125
Iteration 4, loss = 0.90373455
Iteration 4, loss = 0.90002090
Iteration 4, loss = 0.88626068
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.550134, total=25.3min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
Iteration 5, loss = 0.73185105
Iteration 5, loss = 0.73528887
Iteration 5, loss = 0.70786132
Iteration 1, loss = 2.35044718
Iteration 6, loss = 0.60034330
Iteration 6, loss = 0.58791025
Iteration 6, loss = 0.57947657
Iteration 2, loss = 1.28509445
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=lbfgs, score=0.564784, total=137.3min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
Iteration 7, loss = 0.43673650
Iteration 7, loss = 0.46782895
Iteration 7, loss = 0.44320832
Iteration 3, loss = 1.07362364
Iteration 1, loss = 2.51171166
Iteration 8, loss = 0.33185790
Iteration 8, loss = 0.35359105
Iteration 8, loss = 0.33748645
Iteration 4, loss = 0.90818882
Iteration 2, loss = 1.29798144
Iteration 9, loss = 0.26511041
Iteration 9, loss = 0.24792497
Iteration 9, loss = 0.24755836
Iteration 5, loss = 0.74124272
Iteration 3, loss = 1.08821590
Iteration 10, loss = 0.22646547
Iteration 10, loss = 0.20814078
Iteration 10, loss = 0.24830070
Iteration 6, loss = 0.55283975
Iteration 4, loss = 0.89425301
Iteration 11, loss = 0.23249461
Iteration 11, loss = 0.22614044
Iteration 11, loss = 0.23213833
Iteration 7, loss = 0.45030449
Iteration 5, loss = 0.71534155
Iteration 12, loss = 0.18049567
Iteration 12, loss = 0.19031600
Iteration 12, loss = 0.20163612
Iteration 8, loss = 0.38440937
Iteration 6, loss = 0.54462218
Iteration 13, loss = 0.18089851
Iteration 13, loss = 0.17944259
Iteration 13, loss = 0.17916289
Iteration 9, loss = 0.27928261
Iteration 7, loss = 0.43196994
Iteration 14, loss = 0.15945754
Iteration 14, loss = 0.17021985
Iteration 14, loss = 0.16076194
Iteration 10, loss = 0.22422981
Iteration 8, loss = 0.32770748
Iteration 15, loss = 0.17478265
Iteration 15, loss = 0.16885325
Iteration 15, loss = 0.17640926
Iteration 11, loss = 0.18407366
Iteration 9, loss = 0.26725923
Iteration 16, loss = 0.21668699
Iteration 16, loss = 0.14614159
Iteration 16, loss = 0.18001944
Iteration 12, loss = 0.16241629
Iteration 10, loss = 0.22509662
Iteration 17, loss = 0.14471037
Iteration 17, loss = 0.14261822
Iteration 17, loss = 0.11742494
Iteration 13, loss = 0.15987167
Iteration 11, loss = 0.23382807
Iteration 18, loss = 0.15040296
Iteration 18, loss = 0.15042264
Iteration 18, loss = 0.12265081
Iteration 14, loss = 0.16200298
Iteration 12, loss = 0.18065278
Iteration 19, loss = 0.12641182
Iteration 19, loss = 0.10519276
Iteration 19, loss = 0.17325939
Iteration 15, loss = 0.18307085
Iteration 13, loss = 0.19568438
Iteration 20, loss = 0.13744671
Iteration 20, loss = 0.09757935
Iteration 20, loss = 0.14874983
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.498337, total=61.4min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
Iteration 16, loss = 0.15843668
Iteration 14, loss = 0.15953159
Iteration 21, loss = 0.10852285
Iteration 21, loss = 0.11245735
Iteration 1, loss = 2.57920656
Iteration 17, loss = 0.16520328
Iteration 15, loss = 0.17536104
Iteration 22, loss = 0.12078719
Iteration 22, loss = 0.13282511
Iteration 2, loss = 1.28971675
Iteration 18, loss = 0.17341410
Iteration 16, loss = 0.15937394
Iteration 23, loss = 0.13193125
Iteration 23, loss = 0.16368529
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
Iteration 3, loss = 1.09383899
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.518272, total=69.2min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
Iteration 19, loss = 0.11712959
Iteration 17, loss = 0.13293729
Iteration 24, loss = 0.12629127
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
Iteration 4, loss = 0.89290114
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.548837, total=71.9min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
Iteration 1, loss = 2.46507493
Iteration 20, loss = 0.09787932
Iteration 18, loss = 0.11947852
Iteration 5, loss = 0.73439112
Iteration 1, loss = 2.60517922
Iteration 2, loss = 1.30047551
Iteration 21, loss = 0.08777164
Iteration 19, loss = 0.14196052
Iteration 6, loss = 0.54941104
Iteration 2, loss = 1.28844010
Iteration 3, loss = 1.08924522
Iteration 22, loss = 0.10077082
Iteration 20, loss = 0.16089464
Iteration 3, loss = 1.07566189
Iteration 7, loss = 0.42673338
Iteration 4, loss = 0.90688817
Iteration 23, loss = 0.13433211
Iteration 21, loss = 0.15202479
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.509673, total=53.7min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
Iteration 4, loss = 0.89743857
Iteration 8, loss = 0.33555804
Iteration 5, loss = 0.76347906
Iteration 24, loss = 0.17157544
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.528981, total=61.8min
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam .......
Iteration 1, loss = 2.59685660
Iteration 5, loss = 0.72169954
Iteration 9, loss = 0.30552475
Iteration 6, loss = 0.57858769
Iteration 1, loss = 2.53743054
Iteration 2, loss = 1.29689216
Iteration 10, loss = 0.23396609
Iteration 6, loss = 0.55036025
Iteration 7, loss = 0.45747271
Iteration 2, loss = 1.27628879
Iteration 3, loss = 1.07887161
Iteration 11, loss = 0.19732654
Iteration 8, loss = 0.35842324
Iteration 7, loss = 0.44530091
Iteration 3, loss = 1.07081736
Iteration 4, loss = 0.89829101
Iteration 12, loss = 0.18758343
Iteration 8, loss = 0.34052841
Iteration 9, loss = 0.26892185
Iteration 4, loss = 0.89269666
Iteration 5, loss = 0.74964890
Iteration 13, loss = 0.14807711
Iteration 10, loss = 0.23602726
Iteration 9, loss = 0.31118949
Iteration 5, loss = 0.69475854
Iteration 6, loss = 0.56542023
Iteration 14, loss = 0.19781279
Iteration 11, loss = 0.19876806
Iteration 10, loss = 0.27157367
Iteration 6, loss = 0.54705022
Iteration 7, loss = 0.46004633
Iteration 15, loss = 0.22615660
Iteration 12, loss = 0.19228830
Iteration 11, loss = 0.20596504
Iteration 7, loss = 0.43310402
Iteration 8, loss = 0.35839723
Iteration 16, loss = 0.15452530
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.533022, total=40.9min
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....
Iteration 13, loss = 0.15358480
Iteration 12, loss = 0.17674314
Iteration 8, loss = 0.38078830
Iteration 9, loss = 0.26546172
Iteration 14, loss = 0.16062745
Iteration 13, loss = 0.16767307
Iteration 9, loss = 0.30904294
Iteration 10, loss = 0.23029203
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....
Iteration 15, loss = 0.16634873
Iteration 14, loss = 0.16678191
Iteration 10, loss = 0.23058053
Iteration 11, loss = 0.21076696
Iteration 16, loss = 0.14315158
Iteration 15, loss = 0.21734186
Iteration 11, loss = 0.19407601
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....
Iteration 12, loss = 0.21338632
Iteration 17, loss = 0.13505005
Iteration 16, loss = 0.21813641
Iteration 12, loss = 0.20276737
Iteration 13, loss = 0.21084919
Iteration 17, loss = 0.16323088
Iteration 18, loss = 0.13126977
Iteration 13, loss = 0.18653227
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....
Iteration 14, loss = 0.17061555
Iteration 18, loss = 0.13923319
Iteration 19, loss = 0.14386768
Iteration 14, loss = 0.14873926
Iteration 15, loss = 0.14101007
Iteration 19, loss = 0.12948430
Iteration 20, loss = 0.17335086
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....
Iteration 15, loss = 0.14243400
Iteration 16, loss = 0.12779293
Iteration 20, loss = 0.08658310
Iteration 21, loss = 0.15156287
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.535381, total=53.2min
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....
Iteration 16, loss = 0.16958481
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....
Iteration 17, loss = 0.16673244
Iteration 21, loss = 0.07573377
Iteration 17, loss = 0.15278038
Iteration 18, loss = 0.13248244
Iteration 22, loss = 0.11927752
Iteration 18, loss = 0.19543298
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.533422, total=45.7min
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....
Iteration 19, loss = 0.11563703
Iteration 23, loss = 0.19204418
Iteration 20, loss = 0.14195020
Iteration 24, loss = 0.17012589
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.537074, total=60.4min
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....
Iteration 21, loss = 0.13707383
Iteration 22, loss = 0.11802498
Training loss did not improve more than tol=0.000100 for two consecutive epochs. Stopping.
[CV] alpha=1e-05, hidden_layer_sizes=(4000, 4000), solver=adam, score=0.509018, total=53.6min
[CV] alpha=1e-05, hidden_layer_sizes=(10000, 10000), solver=lbfgs ....