USA flag logo/image

An Official Website of the United States Government

User Adaptation of AAC Device Voices

Award Information

Agency:
Department of Health and Human Services
Branch:
N/A
Award ID:
85526
Program Year/Program:
2007 / STTR
Agency Tracking Number:
DC008712
Solicitation Year:
N/A
Solicitation Topic Code:
N/A
Solicitation Number:
N/A
Small Business Information
BIOSPEECH, INC.
940 UPPER DEVON LANE LAKE OSWEGO, OR 97034-
View profile »
Woman-Owned: No
Minority-Owned: No
HUBZone-Owned: No
 
Phase 1
Fiscal Year: 2007
Title: User Adaptation of AAC Device Voices
Agency: HHS
Contract: 1R41DC008712-01
Award Amount: $150,117.00
 

Abstract:

DESCRIPTION (provided by applicant): A wide range of individuals cannot communicate by voice. Voice enabled Augmentative and Alternative Communication (AAC) devices are often the only channel available by which these individuals can communicate. While many voice enabled AAC devices are currently available, they lack the important ability to generate customized speech that mimics aspects of the user's past or intermittently available speech. Modern concatenative speech synthesis technology can mimic a give n speaker's voice, by excising speech fragments from a recorded speech data base ( acoustic inventory ) and recombining these into output speech using sophisticated algorithms. It requires, however, a large amount of recordings and a high degree of consist ency of pronunciation of the speaker. Many AAC users cannot meet these requirements because they already have lost the capability to speak or they cannot speak with adequate consistency of pronunciation. A new type of technology, voice transformation (VT) technology, is available that can transform speech spoken by a source speaker into speech that is perceived as spoken by a specific target speaker. To tune the transformation system, parallel training recordings of the same text are needed from the s ource and target speakers. The amount of training recordings is far less than what is needed for a high-quality acoustic inventory. We propose to use VT in combination with speech synthesis to convert the synthesis system's acoustic inventory into an acous tic inventory that mimics the target speaker's voice. The training recordings can consist of old home videos, or fragmented recordings produced during periods of intact speech, provided that they contain at least one sample of each phoneme. In Phase I, we will develop and evaluate a VT based synthesis system. The project will use high- quality and home-video quality recordings from male and female adults and children to create limited acoustic inventories (adequate to generate a specific set of test sentenc es) and VT training recordings. Perceptual experiments will be conducted to evaluate voice quality and perceived speaker identity. Phase II will focus on developing complete acoustic inventories for several canonical speakers that will be selected to cover a range of speaker characteristics, and on producing portable, user-friendly software. The anticipated commercial offering consists of (i) software components to be licensed to AAC vendors and (ii) a service consisting of collection and processing of reco rdings and creation of personalized acoustic inventories. Speech communication ability is impaired or absent in millions of Americans due to neurological disorders and diseases and to trauma, including autism, Parkinson's disease, and stroke. Augmentative and Alternative Communication (AAC) devices that are operated via switches, keyboards, and a broad range of other input devices, and that have synthetic speech as output, are often the only manner in which these individuals can communicate. Without AAC dev ices, these individuals may suffer from severe social and psychological isolation, and may be unable to lead productive lives. A psychologically important feature that no currently available systems have is the ability to speak with the user's voice, i.e., the ability to produce speech that mimics the individual's pre-morbid speech or speech that the individual may be able to intermittently produce. The proposed project will use voice transformation (VT) technology to accomplish this goal. VT technology req uires recordings of the user to be available, but there is substantial flexibility as to the nature and quantity of these recordings; they may consist of home videos or of fragmentary speech, provided that at least some samples are available of each speech sound in the language. The goal of the application is to develop a synthetic voice for an AAC system that sounds like the individual using the system (befo

Principal Investigator:

Jan Vansanten
5033411192
VANSANTEN@BIOSPEECH.COM

Business Contact:

Small Business Information at Submission:

BIOSPEECH, INC.
940 UPPER DEVON LANE LAKE OSWEGO, OR 97034

EIN/Tax ID: 200883610
DUNS: N/A
Number of Employees: N/A
Woman-Owned: No
Minority-Owned: No
HUBZone-Owned: No
Research Institution Information:
OREGON HEALTH AND SCIENCE UNIVERSITY
OREGON HEALTH AND SCIENCE UNIVERSITY
3181 SW Sam Jackson Pk Rd
PORTLAND, OR 97239 3098
RI Type: Nonprofit college or university