SBIR Phase I: Novel Spatial Speech Separation Techniques to Improve Speaker Identification and Speech Recognition

Award Information
Agency:
National Science Foundation
Branch
n/a
Amount:
$99,999.00
Award Year:
2008
Program:
SBIR
Phase:
Phase I
Contract:
0810012
Award Id:
88378
Agency Tracking Number:
0810012
Solicitation Year:
n/a
Solicitation Topic Code:
n/a
Solicitation Number:
n/a
Small Business Information
13619 Valley Oak Circle, Rockville, MD, 20850
Hubzone Owned:
N
Minority Owned:
N
Woman Owned:
N
Duns:
620282256
Principal Investigator:
Chiman Kwan
PhD
(301) 315-2322
chiman.kwan@signalpro.net
Business Contact:
Chiman Kwan
PhD
(301) 315-2322
chiman.kwan@signalpro.net
Research Institute:
n/a
Abstract
This Small Business Innovation Research (SBIR) Phase I research project focuses on a novel approach that uses multiple microphones to spatially separate speech and hence enhance speaker identification and speech recognition accuracy. The first component of the project will apply feature-driven adaptive beam forming to improve speech recognition accuracy in noisy and reverberant environments. Unlike conventional beam forming methods, this proposed method performs weight adaptation based on feedback from the feature extraction and recognition modules. The arrays can eliminate directional interferences and reduce reverberant effects. For the second component of the proposed project, a new continuously-variable mask estimation method will be used to estimate which parts of the separated speech contain speech with good signal-to noise ratio (SNR), applying speech reconstruction to repair these regions. The repaired speech will then be fed into the speech recognition/speaker identification engine. Thirdly, the project will apply ideas motivated by human binaural perception. The potential market for robust speaker identification and speech recognition is projected to be large, ranging from voice dictation in noisy offices, voiceprint verification for financial transactions, biometrics, speech recognition in automated voice response systems, voice-based search for mobile devices, and several others. Additionally there are many military applications in military command, control, and communications segments, where it is important to convey the correct commands to the other recipients.

* information listed above is at the time of submission.

Agency Micro-sites


SBA logo

Department of Agriculture logo

Department of Commerce logo

Department of Defense logo

Department of Education logo

Department of Energy logo

Department of Health and Human Services logo

Department of Homeland Security logo

Department of Transportation logo

Enviromental Protection Agency logo

National Aeronautics and Space Administration logo

National Science Foundation logo
US Flag An Official Website of the United States Government