Data Augmentation Based on Multiscale Radon Transform for Seven Segment Display Recognition

Abstract

To alleviate the problem of limited data in creating rotation, scale, perspective, and illumination invariant of the neural network training sets, the multiscale Radon transform is proposed in this study to enhance the data augmentation for seven segment display recognition. Resizing, smoothing, and coefficient shifting generate the desired invariant effects for the training model. The accuracy rates from the experiment demonstrate the superiority of the proposed method over other data augmentation techniques with the best overall accuracy performance of 87.05% – outperforming other data augmentation techniques by 6-13%. The convolutional neural network model generated from the proposed multiscale Radon transform data augmentation is suitable for seven segment display recognition and could become beneficial to other type of self-luminous type of images.

The code with our dataset is free to use for academic researches. All publications which use our resource should acknowledge and reference to

S. Popayorm, T. Titijaroonroj, T. Phoka and W. Massagram, “Data Augmentation Based on Multiscale Radon Transform for Seven Segment Display Recognition,” 2020 12th International Conference on Knowledge and Smart Technology (KST), Pattaya, Chonburi, Thailand, 2020, pp. 47-51.

It can be downloaded below.

Download

Please contact Dr. Taravichet Titijaroonroj (taravichet@it.kmitl.ac.th) if you have any questions or comments about this code and/or dataset.