High-throughput Epistasis Screening Using Genetical Genomics
Small Business Information
Insilicos, 111 Queen Anne Ave N., #500, SEATTLE, WA, 98109
AbstractDESCRIPTION (provided by applicant): High-throughput Epistasis Screening using Genetical Genomics A fast software tool is proposed for identifying potential sets of interacting genes involved in human disease pathways. A meta-analysis of marker and express ion-trait studies is performed using penalized regression software running in parallel on commodity graphics cards. The research team includes experts from genomics, statistics and software acceleration. Data will come from published studies. Initial resul ts suggest promise for our approach. Epistasis is a key area of investigation in the elucidation of human- disease pathways. eQTL experiments have shown promise in identifying epistasis for given expression traits. We will leverage the success of eQTLs by employing the results of GWAS experiments to suggest specific expression traits to study. In this way we will exploit the findings of multiple, disparate studies in an overall meta-analysis of a disease trait. Various forms of regression analysis are curre ntly used to screen eQTL data for epistasis, especially stepwise linear regression. We will employ penalized regression techniques, because of their speed advantage, their ability to identify multiple candidates simultaneously and their relative novelty. W e will apply several distinct types of penalized regression, each with its own predictor-selection characteristics. We have strong in-house expertise in penalized regression. As more and larger genomic data sets become available, effective means for combin ing and mining them become essential. The sheer mass of the data, moreover, will require high-performance software in order to provide analysis in reasonable time. Parallel computation is one promising area for improving software performance. We will emplo y the new generation of inexpensive, widely-available graphics coprocessors to run our software in parallel. Successful application will demonstrate that relevant, large- data bioinformatics solutions can be implemented on modestly-priced desktop hardware. PUBLIC HEALTH RELEVANCE: Personalized medicine is based on the observation that susceptibility to disease has a strong genetic component. This genetic component consists of groups of highly interacting genes. We will develop high- speed software ab le to process the huge amounts of data needed to identify these interactions and the role they play in disease susceptibility.
* information listed above is at the time of submission.