USA flag logo/image

An Official Website of the United States Government

AUDITORY MODEL SIGNAL PROCESSING FOR SPEECH RECOGNITION

Award Information

Agency:
Department of Defense
Branch:
Defense Advanced Research Projects Agency
Award ID:
13566
Program Year/Program:
1990 / SBIR
Agency Tracking Number:
13566
Solicitation Year:
N/A
Solicitation Topic Code:
N/A
Solicitation Number:
N/A
Small Business Information
Votan
4487 Technology Dr Fremont, CA 94538
View profile »
Woman-Owned: No
Minority-Owned: No
HUBZone-Owned: No
 
Phase 1
Fiscal Year: 1990
Title: AUDITORY MODEL SIGNAL PROCESSING FOR SPEECH RECOGNITION
Agency / Branch: DOD / DARPA
Contract: N/A
Award Amount: $46,000.00
 

Abstract:

STATE-OF-THE-ART VOICE RECOGNITION TECHNOLOGY IS BASED ON MATCHING SPECTRAL VOICE PATTERNS (ACOUSTIC ENERGY AS A FUNCTION OF TIME AND FREQUENCY). THE SIGNAL PROCESSING REQUIREMENTS OF SPECTRAL PATTERN MATCHING ARE CURRENTLY SERVED BY SPECIAL PURPOSE FILTER BANKS OR DIGITAL TRANSFORMS PERFORMED ON HIGH PERFORMANCE DSP CHIPS. FOR SEVERAL YEARS VOTAN HAS BEEN PERFORMING AN IN-DEPTH STUDY OF THE HUMAN AUDITORY SYSTEM TO OBTAIN A BETTER UNDERSTANDING OF HOW SIGNALS ARE PROCESSED AND SPEECH FEATURES EXTRACTED BY A HUMAN BEING. AS PART OF THIS RESEARCH, DETAILED MATHEMATICAL MODELS OF THE PHYSICS, CHEMISTRY, AND NEUROPHYSIOLOGY OF THE AUDITORY SYSTEM HAVE BEEN DEVELOPED AND COMPARED WITH AVAILABLE EXPERIMENTAL DATA. THIS RESEARCH HAS DEMONSTRATED THAT THE SIGNAL PROCESSING AND FEATURE EXTRACTION PROCESS IN A HUMAN BEING ARE RADICALLY DIFFERENT FROM THE SPECTRAL PATTERN APPROACH OF CURRENT VOICE RECOGNITION SYSTEMS. THE AUDITORY SYSTEM IS EXTREMELY SENSITIVE TO FEATURES NOT PRESENT IN THE SPECTRAL PATTERN (PRINCIPALLY PHASE AND TIMING FEATURES), AND CONVERSELY IS INSENSITIVE TO FEATURES THAT ARE PROMINENT IN THE SPECTRAL PATTERN. THESE DIFFERENCES ARE OF VITAL IMPORTANCE FOR ACCURATE SPEECH RECOGNITION. THE OBJECTIVE OF THE PROPOSED PHASE I EFFORT IS TO DETERMINE THE FEASIBILITY OF DEVELOPING A PREPROCESSOR FOR SPEECH RECOGNITION WHICH INCORPORATES AS ACCURATELY AS POSSIBLE A MODEL OF THE HUMAN AUDITORY SYSTEM. THIS PREPROCESSOR WOULD PERFORM THE FUNCTIONS OF THE OUTER EAR, THE MIDDLE EAR, THE INNER EAR (COCHLEA), HAIR CELL NEURAL TRANSDUCTION, AND NEURAL SIGNAL PROCESSING IN THE COCHLEAR NUCLEUS. THE OUTPUT OF THE PREPROCESSOR WOULD BE ACOUSTIC FEATURES SUITABLE FOR SPEECH RECOGNITION SYSTEMS USING EITHER CONVENTIONAL PATTERN MATCHING TECHNIQUES OR THE NEWER NEURAL NET TECHNIQUES. ANTICIPATED BENEFITS/POTENTIAL COMMERCIAL APPLICATIONS - SPEECH RECOGNITION IS AN EXTREMELY IMPORTANT AREA FOR BOTH COMMERCIAL AND DEFENSE APPLICATIONS. RECOGNITION ACCURACY, PARTICULARLY FOR LARGE VOCABULARY, CONTINUOUS, SPEAKER INDEPENDENT RECOGNITION OVER

Principal Investigator:

Dr Stephen Gill
4154907600

Business Contact:

Small Business Information at Submission:

Votan
4487 Technology Dr Fremont, CA 94538

EIN/Tax ID:
DUNS: N/A
Number of Employees:
Woman-Owned: No
Minority-Owned: No
HUBZone-Owned: No