Printed Thai Character Recognition Using Standard Descriptor

Abstract

The various font-types, font-sizes, and font-styles have a great impact on recognition performance of optical character recognition (OCR) systems. This becomes a grand challenge for recognition improvement. In order to enhance the performance, this paper proposes the printed Thai character recognition using a standard descriptor. The descriptor construction consists of two principal phases—preprocessing and feature extraction. In the former phase, the preprocessing provides a standard form for each character image. In the latter phase, the singular value decomposition (SVD) is applied to all font-type, fontsize, and font-style character images to extract features. Then the standard descriptor is constructed from the suitable order selection of the SVD feature decomposition. Finally, the projection matrix technique is applied to the recognition phase in order to measure the cosine similarity between the standard descriptor and test set. The experimental results show that the proposed method achieves a high recognition rate and is invariant to font-types, font-sizes, and font-styles.

Ref:http://link.springer.com/chapter/10.1007%2F978-3-642-37371-8_20