Biologically Based Non-Language Speech Sound Detection

Award Information
Agency:
Department of Defense
Branch
Air Force
Amount:
$749,380.00
Award Year:
2008
Program:
SBIR
Phase:
Phase II
Contract:
FA8750-08-C-0119
Agency Tracking Number:
F071-079-1624
Solicitation Year:
n/a
Solicitation Topic Code:
n/a
Solicitation Number:
n/a
Small Business Information
ADVANCED ACOUSTIC CONCEPTS, INC.
425 Oser Avenue, Hauppauge, NY, 11788
Hubzone Owned:
N
Minority Owned:
N
Woman Owned:
N
Duns:
606421105
Principal Investigator:
Bruce Stewart
Senior Systems Engineer
(631) 273-5700
BStewart@LIO.AACISD.com
Business Contact:
Richard Lawless
Vice President Operations
(631) 273-5700
RLawless@LIO.AACISD.com
Research Institution:
n/a
Abstract
This proposal addresses the application of new acoustic processing technologies to automatically identify and eliminate non-language speech sounds as a pre-processing stage to improve audio processing. Non-language speech sounds (coughing, breathing, 'ah,' 'uhmm') make up a large part of natural human language use, but contemporary speech recognition data preparation relies on hand-labeling of non-language speech sounds. The proposed work will extend, improve, and refine the capabilities of a computational model of auditory cortical processing based on multiscale spectro-temporal modulation features to the automated detection of non-language speech sounds. Advanced Acoustic Concepts and the University of Maryland have extensive experience applying the computational model to a variety of speech processing problems. Phase I results indicate that cortical processing algorithms are highly capable of identifying non-language speech sounds. Recognition algorithms will be trained and tested to distinguish non-language speech sounds collectively and individually, to classify them by individual type, and to determine accurate segmentation of the sound stream. Speech enhancement algorithms based on the same cortical processing model will be designed or adapted and applied to improve the identification of non-language speech sounds under noisy conditions. The feasibility of performing identification, classification, and segmentation in near real time will be demonstrated.

* information listed above is at the time of submission.

Agency Micro-sites


SBA logo

Department of Agriculture logo

Department of Commerce logo

Department of Defense logo

Department of Education logo

Department of Energy logo

Department of Health and Human Services logo

Department of Homeland Security logo

Department of Transportation logo

Enviromental Protection Agency logo

National Aeronautics and Space Administration logo

National Science Foundation logo
US Flag An Official Website of the United States Government