Neighborhood Structure-Based Model for Multilingual Arbitrarily-Oriented Text Localization in Images/Videos

H.T. Basavaraju; V.N. Manjunath Aradhya; D.S. Guru

Author	H.T. Basavaraju V.N. Manjunath Aradhya D.S. Guru
Keywords	Multilingual Text Clustering Computer vision Image Processing Maxmin Cluster Arbitrarily-Oriented
Abstract	The text matter in an image or a video provides more important clue and semantic information of the particular event in the actual situation. Text localization task stands an interesting and challenging research-oriented process in the zone of image processing due to irregular alignments, brightness, degradation, and complexbackground. The multilingual textual information has different types of geometrical shapes and it makes further complex to locate the text information. In this work, an effective model is presented to locate the multilingual arbitrary oriented text. The proposed method developed a neighborhood structure model to locate the text region. Initially, the maxmin cluster is applied along with 3X3 sliding window to sharpen the text region. The neighborhood structure creates the boundary for every component using normal deviation calculated from the sharpened image. Finally, the double stroke structure model is employed to locate the accurate text region. The presented model is analyzed on five standard datasets such as NUS, arbitrarily oriented text, Hua's, MRRC and real-time video dataset with performance metrics such as recall, precision, and f-measure.
Year of Publication	2021
Journal	International Journal of Interactive Multimedia and Artificial Intelligence
Volume	7
Issue	Regular Issue
Number	2
Number of Pages	134-140
Date Published	12/2021
ISSN Number	1989-1660
URL	https://www.ijimai.org/journal/sites/default/files/2021-11/ijimai7_2_12_0.pdf
DOI	10.9781/ijimai.2021.05.003
	DOI Google Scholar BibTeX EndNote X3 XML EndNote 7 XML Endnote tagged Marc RIS
Attachment	ijimai7_2_12_0.pdf1.63 MB