Generic Automatic Recognition System for Handwritten Arabic Script Documents
Agency / Branch:
DOD / ARMY
This proposal addresses the feasibility of developing a generic system framework for automatically recognizing handwritten text for non-Arabic languages using Arabic-style script such as Urdu or Pashto. Despite recent progress in automatic handwritten Arabic recognition, little attention has been paid to non-Arabic languages using Arabic-style scripts. Nonetheless, automatic recognition of non-Arabic languages using Arabic-style scripts such as Urdu and Pashto has great potential in both military and commercial applications. Hence, the goal of this proposal is to develop a generic system framework for automatically recognizing handwritten text for non-Arabic languages using Arabic-style script. In particular, we focus on extending the optical Arabic text recognition capability to the Urdu language in which computing standards are established. The generic Arabic-style script recognition system will be based on the Hidden Markov Model approach that is suitable for cursive and context-sensitive Urdu scripts. Various feature-extraction algorithms will be implemented based on the proposed generic handwritten Arabic-style script recognition framework. We shall evaluate the basic performance of the developed system using a handwritten Urdu database collected during Phase I.
Small Business Information at Submission:
OPTIMAL SYNTHESIS, INC.
868 San Antonio Road Palo Alto, CA 94303
Number of Employees: