TY - GEN
T1 - CrescendoNet
T2 - 17th IEEE International Conference on Machine Learning and Applications, ICMLA 2018
AU - Zhang, Xiang
AU - Vishwamitra, Nishant
AU - Hu, Hongxin
AU - Luo, Feng
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/2
Y1 - 2018/7/2
N2 - We introduce a new deep convolutional neural network, CrescendoNet, by stacking simple building blocks without residual connections. Each Crescendo block contains independent convolution paths with increased depths. The numbers of convolution layers and parameters are only increased linearly in Crescendo blocks. In experiments, CrescendoNet with only 15 layers outperforms almost all networks without residual connections on benchmark datasets, CIFAR10, CIFAR100, and SVHN. Given sufficient amount of data as in SVHN dataset, CrescendoNet with 15 layers and 4.1M parameters can match the performance of DenseNet-BC with 250 layers and 15.3M parameters. CrescendoNet provides a new way to construct high performance deep convolutional neural networks with simple network architecture. Moreover, by investigating a various combination of subnetworks in CrescendoNet, we note that the high performance of CrescendoNet may come from its implicit ensemble behavior, which gives CrescendoNet an anytime classification property. Furthermore, the independence between paths in CrescendoNet allows us to introduce a new path-wise training procedure, which can reduce the memory needed for training.
AB - We introduce a new deep convolutional neural network, CrescendoNet, by stacking simple building blocks without residual connections. Each Crescendo block contains independent convolution paths with increased depths. The numbers of convolution layers and parameters are only increased linearly in Crescendo blocks. In experiments, CrescendoNet with only 15 layers outperforms almost all networks without residual connections on benchmark datasets, CIFAR10, CIFAR100, and SVHN. Given sufficient amount of data as in SVHN dataset, CrescendoNet with 15 layers and 4.1M parameters can match the performance of DenseNet-BC with 250 layers and 15.3M parameters. CrescendoNet provides a new way to construct high performance deep convolutional neural networks with simple network architecture. Moreover, by investigating a various combination of subnetworks in CrescendoNet, we note that the high performance of CrescendoNet may come from its implicit ensemble behavior, which gives CrescendoNet an anytime classification property. Furthermore, the independence between paths in CrescendoNet allows us to introduce a new path-wise training procedure, which can reduce the memory needed for training.
KW - Convolutional neural networks
KW - Deep learning
KW - Ensemble
UR - https://www.scopus.com/pages/publications/85062233084
U2 - 10.1109/ICMLA.2018.00053
DO - 10.1109/ICMLA.2018.00053
M3 - Conference contribution
AN - SCOPUS:85062233084
T3 - Proceedings - 17th IEEE International Conference on Machine Learning and Applications, ICMLA 2018
SP - 311
EP - 317
BT - Proceedings - 17th IEEE International Conference on Machine Learning and Applications, ICMLA 2018
A2 - Wani, M. Arif
A2 - Kantardzic, Mehmed
A2 - Sayed-Mouchaweh, Moamar
A2 - Gama, Joao
A2 - Lughofer, Edwin
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 17 December 2018 through 20 December 2018
ER -