Detection of Text Lines of Handwritten Arabic Manuscripts using Markov Decision Processes
DOI:
https://doi.org/10.9781/ijimai.2016.416Keywords:
Text Classification, Hidden Markov Models, Arabic Documents, ComponentsAbstract
In a character recognition systems, the segmentation phase is critical since the accuracy of the recognition depend strongly on it. In this paper we present an approach based on Markov Decision Processes to extract text lines from binary images of Arabic handwritten documents. The proposed approach detects the connected components belonging to the same line by making use of knowledge about features and arrangement of those components. The initial results show that the system is promising for extracting Arabic handwritten lines.Downloads
References
[1] Likforman-Sulem, L., Zahour, A., & Taconet, B. (2007). Text line segmentation of historical documents: a survey. International Journal of Document Analysis and Recognition (IJDAR), 9(2-4), 123-138.
[2] Li, Y., Zheng, Y., Doermann, D., & Jaeger, S. (2008). Script-independent text line segmentation in freestyle handwritten documents. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 30(8), 1313-1329
[3] Ouwayed, N., & Belaïd, A. (2012). A general approach for multi-oriented text line extraction of handwritten documents. International Journal on Document Analysis and Recognition (IJDAR), 15(4), 297-314
[4] Razak, Z., Zulkiflee, K., Idris, M. Y. I., Tamil, E. M., Noor, M. N. M., Salleh, R., ... & Yaacob, M. (2008). Off-line handwriting text line segmentation: A review. International journal of computer science and network security, 8(7), 12-20.
[5] Stamatopoulos, N., Gatos, B., Louloudis, G., Pal, U., & Alaei, A. (2013, August). Icdar 2013 handwriting segmentation contest. In Document Analysis and Recognition (ICDAR), 2013 12th International Conference on (pp. 1402-1406). IEEE
[6] Gatos, B., Stamatopoulos, N., & Louloudis, G. (2010, November). Icfhr 2010 handwriting segmentation contest. In Frontiers in handwriting recognition (icfhr), 2010 international conference on (pp. 737-742). IEEE.
[7] Bertsekas D. P., Dynamic Programming : Deterministic and Stochastic Models, Prentice-Hall, 1987.
[8] Puterman M., Markov Decision Processes : Discrete Stochastic Dynamic Programming, John Wiley & Sons, Inc., New York, USA, 1994.
[9] Bennasri, A., Zahour, A., & Taconet, B. (1999). Extraction des lignes d’un texte manuscrit arabe. In Vision interface (Vol. 99, pp. 42-48).
[10] Nicolaou, A., & Gatos, B. (2009, July). Handwritten text line segmentation by shredding text into its lines. In Document Analysis and Recognition, 2009. ICDAR’09. 10th International Conference on (pp. 626-630). IEEE
[11] Adiguzel, H.; Sahin, E.; Duygulu, P., “A Hybrid for Line Segmentation in Handwritten Documents,” Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on , vol., no., pp.503,508, 18-20 Sept. 2012
[12] Zahour, A., Likforman-Sulem, L., Boussalaa, W., & Taconet, B. (2007, September). Text line segmentation of historical arabic documents. In Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on (Vol. 1, pp. 138-142). IEEE.
[13] Khandelwal, A., Choudhury, P., Sarkar, R., Basu, S., Nasipuri, M., & Das, N. (2009). Text line segmentation for unconstrained handwritten document images using neighborhood connected component analysis. In Pattern Recognition and Machine Intelligence (pp. 369-374). Springer Berlin Heidelberg
[14] Khayyat, M., Lam, L., Suen, C. Y., Yin, F., & Liu, C. L. (2012, March). Arabic handwritten text line extraction by applying an adaptive mask to morphological dilation. In Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on (pp. 100-104). IEEE
[15] Shi, Z., Setlur, S., & Govindaraju, V. (2009, July). A steerable directional local profile technique for extraction of handwritten arabic text lines. In Document Analysis and Recognition, 2009. ICDAR’09. 10th International Conference on(pp. 176-180). IEEE.
[16] Kumar, J., Abd-Almageed, W., Kang, L., & Doermann, D. (2010, June). Handwritten Arabic text line segmentation using affinity propagation. InProceedings of the 9th IAPR International Workshop on Document Analysis Systems (pp. 135-142). ACM
[17] Kumar, J., Kang, L., Doermann, D., & Abd-Almageed, W. (2011, September). Segmentation of handwritten text lines in presence of touching components. In Document Analysis and Recognition (ICDAR), 2011 International Conference on (pp. 109-113). IEEE
[18] Handwritten Arabic Proximity Datasets. Language and Media Processing Laboratory.http://lampsrv02.umiacs.umd.edu/projdb/project.php?id=65
[19] Phillips, I. T., & Chhabra, A. K. (1999). Empirical performance evaluation of graphics recognition systems. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 21(9), 849-87
Downloads
Published
-
Abstract45
-
PDF34






