A Hybrid Algorithm for Recognizing the Position of Ezafe Constructions in Persian Texts

Mehrnoush Shamsfard; Samira Noferesti

Author	Mehrnoush Shamsfard Samira Noferesti
Keywords	Genetic Algorithms Rule Based Scripts Ezafe
Abstract	In the Persian language, an Ezafe construction is a linking element which joins the head of a phrase to its modifiers. The Ezafe in its simplest form is pronounced as –e, but generally not indicated in writing. Determining the position of an Ezafe is advantageous for disambiguating the boundary of the syntactic phrases which is a fundamental task in most natural language processing applications. This paper introduces a framework for combining genetic algorithms with rule-based models that brings the advantages of both approaches and overcomes their problems. This framework was used for recognizing the position of Ezafe constructions in Persian written texts. At the first stage, the rule-based model was applied to tag some tokens of an input sentence. Then, in the second stage, the search capabilities of the genetic algorithm were used to assign the Ezafe tag to untagged tokens using the previously captured training information. The proposed framework was evaluated on Peykareh corpus and it achieved 95.26 percent accuracy. Test results show that this proposed approach outperformed other approaches for recognizing the position of Ezafe constructions.
Year of Publication	2014
Journal	International Journal of Interactive Multimedia and Artificial Intelligence
Volume	2
Issue	Regular Issue
Number	6
Number of Pages	17-25
Date Published	06/2014
ISSN Number	1989-1660
Citation Key
URL	http://www.ijimai.org/journal/sites/default/files/files/2014/03/ijimai20142_6_2_pdf_23890.pdf
DOI	10.9781/ijimai.2014.262
	DOI Google Scholar BibTeX EndNote X3 XML EndNote 7 XML Endnote tagged Marc RIS
Attachment	IJIMAI20142_6_2.pdf993.83 KB