You are here

Scene Text Extraction and Translation for Handheld and Mobile Devices

Award Information
Agency: Department of Defense
Branch: Army
Contract: W911QX-04-C-0015
Agency Tracking Number: A022-1390
Amount: $1,287,870.00
Phase: Phase II
Program: SBIR
Solicitation Topic Code: A02-038
Solicitation Number: 2002.2
Timeline
Solicitation Year: 2002
Award Year: 2004
Award Start Date (Proposal Award Date): 2003-10-28
Award End Date (Contract End Date): 2007-06-18
Small Business Information
425 Costa Mesa Ter #H
Sunnyvale, CA 94085
United States
DUNS: 115713492
HUBZone Owned: No
Woman Owned: No
Socially and Economically Disadvantaged: No
Principal Investigator
 Esin Haritaoglu
 Senior Scientist
 (408) 735-1054
 esin.darici@polar-rain.com
Business Contact
 Ismail Haritaoglu
Title: President
Phone: (408) 506-3064
Email: ismail.haritaoglu@polar-rain.com
Research Institution
N/A
Abstract

We propose novel solution to develop and prototype a full automatic scene text detection and character and graphic recognition system for road sign translation which will run on a commercially off the shelf Personal digital Assistants (PDA). We propose to develop a prototype of a road sign/document translation system that can be run in a commercial of the shelf PDA. At the end of Phase-II effort, Army soldier will be able to use Polar Rains prototype software for road/informative sign translation and translate some of text section in document using their digital camera attached PDAs. They will point the PDA to signboard or document and press a single button to start transition. The system will capture the image of a foreign text and on board system automatically, extract the text from the image, recognize them, and translate to desired language, so they will able to see the foreign sign in their language in a less than 10 seconds. Recent advances in mobile device technology and high resolution digital camera availability for PDAs.This allows us to extend the capabilities of the proposed system by adding (a) fast low-level image processing techniques to enhance to image better text detection and character segmentation (b) feature extraction modules where the text identified, and skew and orientation distortion are corrected (c) a multi-level dynamic shape histogram based method for latin and non-latin character recognition (d) context aware machine translation module translates the recognized text in to desired language. (e) support other functionality, such as not only the road sign translation but also other informative text and graphical road signs and text in document translation (e.g., menu, advertisement, text on piece of a paper) (f) process single frames up to 1200x1024 resolution (g) increasing accuracy of the recognition and handle smaller size characters with new image super-resolution based magnification (h) live video-processing to use redundant information to obtain higher resolution and larger image to increase the recognition performance.

* Information listed above is at the time of submission. *

US Flag An Official Website of the United States Government