You are here

Distributed System for Privacy Protecting Speech Processing

Award Information
Agency: Department of Homeland Security
Branch: N/A
Contract: HSHQDC-15-C-00022
Agency Tracking Number: HSHQDC-15-R-00017-H-SB015.1-004-0006-I
Amount: $100,000.00
Phase: Phase I
Program: SBIR
Solicitation Topic Code: H-SB015.1-004
Solicitation Number: HSHQDC-15-R-00017
Timeline
Solicitation Year: 2015
Award Year: 2015
Award Start Date (Proposal Award Date): 2015-05-01
Award End Date (Contract End Date): 2015-10-31
Small Business Information
2150 SHATTUCK AVE, PENTHOUSE
BERKELEY, CA 94704-1370
United States
DUNS: 078834495
HUBZone Owned: No
Woman Owned: No
Socially and Economically Disadvantaged: Yes
Principal Investigator
 Arlo Faria
 President
 (510) 705-3223
 arlo@mod9.com
Business Contact
 Arlo Faria
Title: President
Phone: (510) 705-3223
Email: arlo@mod9.com
Research Institution
N/A
Abstract

This proposal explores feature representations for automatic speech processing algorithms with a focus on preserving the clients' privacy. We address two different use cases of privacy: a system that transcribes speech, but is unable to infer the identity of speakers, such as an anonymous tip hotline; and a system that identifies speakers, but is unable to recognize the communicated messages, useful in scenario where a many conversations may be under surveillance in order to locate and isolate a targeted individual. We investigate the feasibility of implementing such systems by focusing on the audio signal representations in a distributed system, where embedded devices compute representations and transmit them to a Big Data analytics service via the Internet. Basic spectral acoustic features, tandem/bottleneck features, and high-dimensional outputs from deep neural networks will all be evaluated for both automatic speech recognition (ASR) and speaker identification (SID) tasks. Performance will be assessed to identify configurations favorable for both use cases. We additionally consider the possibility that an adversarial service operator may attempt to associate identities of speakers by clustering received requests, or reconstructing messages that have been transmitted over a mixture network.

In addition to serving the homeland security mission, the private-sector commercialization potential of this research is substantial. It could prove useful to Remeeting, which is being developed as a mobile app and cloud service to record personal conversations; significant privacy concerns serve as barriers to adoption by individual and enterprise customers.

* Information listed above is at the time of submission. *

US Flag An Official Website of the United States Government