You are here

Secure outsourced computation of genomic data

Award Information
Agency: Department of Health and Human Services
Branch: National Institutes of Health
Contract: 1R41HG010978-01
Agency Tracking Number: R41HG010978
Amount: $344,948.00
Phase: Phase I
Program: STTR
Solicitation Topic Code: 172
Solicitation Number: PA18-575
Solicitation Year: 2018
Award Year: 2019
Award Start Date (Proposal Award Date): 2019-09-09
Award End Date (Contract End Date): 2020-08-31
Small Business Information
7919 AVENIDA KIRJAH, La Jolla, CA, 92037-3507
DUNS: 080795543
HUBZone Owned: N
Woman Owned: N
Socially and Economically Disadvantaged: N
Principal Investigator
 (617) 953-1662
Business Contact
Phone: (617) 294-9269
Research Institution
HOUSTON, TX, 77030-5400
 Nonprofit college or university
Project Summary In the age of precision medicinegenomic data are being integrated with other health care data to support personalized and calibrated clinical decision makingGenomic sequence data are too large to be stored in electronic health recordEHRsystems and need to be separately storedWhile cloud computing offers a cost efficient and scalable platformthe privacy and security concerns about outsourcing genomic data are challenging issuesThe common perception is that the ease of access to remote data and the protection of privacy are at odds with each otherWe propose a new genomics archiving and communications systemGACSthat meets both requirements by using state of the art homomorphic encryption algorithms and matrix representation of data and queriesIn this systemvariants are represented as vectorsthat are homomorphically encrypted by a client and stored on the GACS serverWhen analysis is requireda query is generated in the form of a matrixThis matrix is encryptedor can remain in plaintext depending on the taskand sent to the GACS serverThe server computes on encrypted dataproduces an encrypted result and returns it to the clientwho has the secret key to decode itThe GACS is not able to decrypt the data or the encrypted queriesthus guaranteeing that privacy and security are maintained on the GACSPreliminary results of the algorithms show that after decryptionthe results are the same as results from computing on plaintextIn this projectwe will implement our GACS system software modules and demonstrate the use of the system with examples from three usecasespharmacogenomicsclinical trials eligibility and analysis for disease risksWe will measure performance speed and memory consumption in all three use casesA GACS system as a cloud hosted service can reduce the computational burden on healthcare facilitiesIt can provide small healthcare facilities with the same genomic analysis capability available to larger hospitalsIn additionclinical decision supportCDScan be deployed on the GACSAs clinical guidelines evolve in response to new discoveries linking genetic variants to disease and medicineshealthcare facilities can stay in compliance with the guidelines Project Narrative The use of genomic data in clinical decision making is rapidly increasingSince the size of genomic sequence data are largethey cannot be stored easily in electronic health record systemsFurthermoresince genomic data are highly sensitive in naturethey must be protected in storage and during analysisWe propose a new genomics archiving and communications systemGACSthat satisfies the requirement of easy access to the data by clinical systems and provides strong protection for privacyThis system is based on state of theart encryption algorithmsGenome data are encrypted and stored in the GACSThe data are analyzed while remaining encryptedThe GACS learns neither the data nor the analysis questionsthus guaranteeing that privacy is maintained on the GACS serverWe will test the new system on three use casespharmacogenomicsclinical trials eligibilityand gene analysis for disease risk

* Information listed above is at the time of submission. *

US Flag An Official Website of the United States Government