learning CNTK
AlexNet_demo.py : error = 10.88%
Alex_jitter.py : input add jitter error = 11.48%
NIN_test.py: error = 36.48%
NIN_test2.py: change some parameters, error = 28.93%
NIN_test3.py: change some parameters, error = 27.98%
NIN_test4.py: Using AveragePooling without dense layer , error = 25.43%
NIN_test5.py: Using AveragePooling with dense layer, error = 21.20%
NIN_test6.py: remove one 1 * 1 layer, error = 23.07%
vgg_test1.py: adjusted vgg13, failed to converge
vgg_test2.py: fixed the bug, reduce learning rate, error = 13.63%
vgg_test3.py: adjusted vgg16
- max_epoch = 40, with bathnormalization, error = 12.87%
- max_epoch = 40, without batchnormalization, error = 27.93%
- max_epoch = 80, with batchnormalization, error = 8.83%
vgg_test4.py: adjusted vgg19, with batchnormalization, error = 8.48%
Inceptionv1_test1.py
- max_epoch = 80, lr = AlexNet , error = 32.28%
- max_epoch = 160, lr = 0.01, every two epoch , lr = lr * 0.94, error = 18.97%
Inceptionv2_test1.py : adjusted InceptionV2, error = 20.54%
Inceptionv3_test1.py : adjusted InceptionV3, error = 7.55%, model size = 63M
Inceptionv3_test3.py : changed some parameters, error = 7.37%, model size = 8M
Inceptionv4_test1.py : adjusted InceptionV4, error = 27.13%, model size = 15M
Inception_resNet_v1_test1.py : error = 17.94%, model size = 26M
Inception_ResNet_v2_test1.py : error = 20.31%, model size = 18M
BN_Inception_test1.py :add an inception block after 4b , error = 5.86%
BN_Inception_test2.py : change some parameters, error = 5.58%
BN_Inception_test3.py : add a 3x3 conv, slower, error = 11.52%
BN_Inception_test4.py :change 3x3 filters numbers, error = 18.01%
BN_Inception_test5.py : try to form a structure like vgg, no effect, error = 17.00%
resnet_test1.py, original shortcut connections
- resnet20, error = 8.60%, model size = 1M
- resnet110, error = 6.50%, model size = 7M
- failed to train resnet160
resnet_test2.py, constant scaling connections
- resnet20, error = 8.86%, model size = 1M
resnet_test3.py, exclusive gating connections
- resnet20, error = 8.34%, model size = 1M
- resnet110, fail
resnet_test4.py, shortcut-only connections
- resnet20, error = 8.68%, model size = 1M
- resnet110, error = 15.42%
resnet_test5.py, conv short cut
- resnet20, error = 8.97%, model size = 1M
resnet_test6.py, dropout shortcut
- resnet20, error = 10.16%, model size = 1M
mobilenet_test1.py : adjusted mobilenet, error = 69.66%, model size = 0.1M
mobilenet_test2.py : adjusted mobilenet, error = 66.40%, model size = 0.1M
mobilnet_test3.py : change some parameters, error = 70.53%
mobilnet_test5.py : using group convolution , error = 11.39%, model size = 17M
UPD 2018.10.1 It seems to have some mistakes.
UPD 2018.10.1 The code is wrong.
UPD 2018.10.2
mobilenet_test4.py : modified the depwise convolution, error = 61.84%
It still seems to have bug.
UPD 2018.10 17
using group convolution
mobilenetv2_test1.bs & mobilenetv2_test1.cntk : resize CIFAR 224 x 224 , error = 12.23% , model size = 15M
mobilenet_test1.bs : refined mobilenet v1 for CIFAR-10 , error = 13%, model size = 6M
mobilenet_test2.bs : changed some parameters , error = 12.3%, model size = 6.6M
mobilenet_test3.bs : for resized CIFAR-10 224 x 224, error = 10.67%, model size = 12M
mobilenetv2_test1.bs : for resized CIFAR-10 224 x 224 , error = 12.38%, model size = 91M