Large Margin In Softmax Cross-Entropy Loss
Takumi Kobayashi (National Institute of Advanced Industrial Science and Technology) AbstractDeep convolutional neural networks (CNNs) are trained mostly based on the softmax cross-entropy loss to produce promising performance on various image classification tasks. While much research effort has been made to improve the building blocks of CNNs, the classifier margin in the loss attracts less attention for optimizing CNNs in contrast to the kernel-based methods, such as SVM. In this paper, we propose a novel method to induce a large-margin CNN for improving the classification performance. By analyzing the formulation of the softmax loss, we clarify the margin embedded in the loss as well as its connection to the distribution of softmax logits. Based on this analysis, the proposed method is formulated as regularization imposed on the logits to induce a large-margin classifier in a compatible form with the softmax loss. The experimental results on image classification using various CNNs demonstrate that the proposed method favorably improves performance compared to the other large-margin losses.
DOI
10.5244/C.33.126
https://dx.doi.org/10.5244/C.33.126
Files
BibTeX
@inproceedings{BMVC2019,
title={Large Margin In Softmax Cross-Entropy Loss},
author={Takumi Kobayashi},
year={2019},
month={September},
pages={126.1--126.12},
articleno={126},
numpages={12},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Kirill Sidorov and Yulia Hicks},
doi={10.5244/C.33.126},
url={https://dx.doi.org/10.5244/C.33.126}
}
title={Large Margin In Softmax Cross-Entropy Loss},
author={Takumi Kobayashi},
year={2019},
month={September},
pages={126.1--126.12},
articleno={126},
numpages={12},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Kirill Sidorov and Yulia Hicks},
doi={10.5244/C.33.126},
url={https://dx.doi.org/10.5244/C.33.126}
}