A segmentation system for touching handwritten Japanese characters

T. Yamaguchi1, S. Tsuruoka1, T. Yoshikawa1, T. Shinogi1, E. Makimoto2, H. Ogata2, M. Shridhar3
1Department of Electrical and Electronic Engineering, Faculty of Engineering, Mie University, Tsu, Mie, Japan
2Mechatronics Systems Division, Hitachi and Limited, Owari-Asahi, Aichi, Japan
3Department of Electrical and Computer Engineering, University of Michigan, Dearborn, Dearborn, MI, USA

Tóm tắt

The present character recognition system needs the segmentation process as preprocessing for an input image of the touching character string. The process divides it into some isolated characters. To improve the total performance of the character recognition system, it is required to lessen the number of retrieval path in the candidate character lattice. We proposed the segmentation method based on connecting condition of the neighbor lines to resolve these problems in the previous paper. In this paper, we modify, our segmentation algorithm, and we evaluate the performance of these segmentation methods by a direct evaluation system with permissible degree and an indirect evaluation using the character recognition. We confirm the usefulness of our new method for 456 touching handwritten Japanese "Kanji" image with 948 characters by the comparison of three segmentation methods. The correct segmentation rates are 61.4% (conventional method), 75.1% (previous method), and 83.0% (proposed method).

Từ khóa

#Image segmentation #Character recognition #Joining processes #Image analysis #Mechatronics #Data preprocessing #Lattices #Text analysis #Humans #Conferences

Tài liệu tham khảo

10.1016/0262-8856(87)90071-0 kimura, 1994, A lexicon directed algorithm for recognition of unconstrained handwritten words, IEICE Trans Inf &Syst, e77 d, 785 tsuruoka, 1983, Thinning algorithm for digital picture and their application to handprinted characters recognition, Trans of IEICE, j66 d, 525 10.1109/ICDAR.1993.395791 10.1109/ICDAR.2001.953905 ino, 1997, Handwritten address segmentation algorithm based on stroke information, Information Processing Society of Japan Trans, 38, 280 10.1109/ICDAR.2001.953902 10.1109/ICDAR.1993.395613 murase, 1986, Segmentation and recognition of handwritten character string using linguistic information, Trans of IEICE, j69 d, 1292 liu, 2001, Lexicon-driven handwritten character string recognition for japanese address reading, Proc of 6th International Conference on Document Analysis and Recognition (ICDAR 2001), 877 10.1117/12.410847