Exact Regression Software for Correlated Categorical Data

Award Information
Agency: Department of Health and Human Services
Branch: National Institutes of Health
Contract: 1R43GM112335-01A1
Agency Tracking Number: R43GM112335
Amount: $99,562.00
Phase: Phase I
Program: SBIR
Awards Year: 2015
Solicitation Year: 2015
Solicitation Topic Code: 400
Solicitation Number: PA14-071
Small Business Information
675 MASSACHUSETTS AVE, Cambridge, MA, 02139-3309
DUNS: 183012277
HUBZone Owned: N
Woman Owned: N
Socially and Economically Disadvantaged: N
Principal Investigator
 (617) 661-2011
Business Contact
Phone: (617) 661-2011
Email: mehta@cytel.com
Research Institution
DESCRIPTION provided by applicant The overarching goal of the proposed research is to develop practical modeling tools including exact regression procedures for small or sparse samples of correlated categorical data Such outcomes are common in biomedical research especially in areas such as genetics ophthalmology and teratology One can encounter correlated categorical data wherever multiple outcomes are measured on an individual over time or on several different individuals who share common genetic or environmental exposures A large body of methods has been developed for analyzing correlated categorical outcomes which conventionally rely on large sample distributional assumptions e g approximate normality to justify their inferences When faced with a small or sparse sample of categorical data investigators have few viable analytic options and none that allow for exact inferences with regard to estimation Our proposed work will fill this gap building on critical recent developments of both appropriate models and computational technology During Phase I of this project we will accomplish this by developing an analogue to conditional logistic regression for correlated categorical data constructing an efficient network graphical algorithm for rapi computation of the exact distribution in Aim and Investigating the feasibility of incorporating these procedures into a SAS PROC We plan to expand this work in Phase II by incorporating our new tools as a module in the LogXact software package extending the exact regression procedure to accommodate Poisson and polychromous regression for correlated data and significantly improving the computational efficiency of these new tools through efficient Monte Carlo sampling and parallel processing We will also create a module for a SAS PROC making these methods as widely available as possible to researchers and analysts PUBLIC HEALTH RELEVANCE Many biomedical and public health studies make observations that are correlated or related e g when individuals are measured repeatedly over time or when subjects are sampled from the same family or group When such samples are small conventional statistical methods that account for this correlation may be inaccurate This project will develop new software tools to help investigators more accurately analyze data from studies that involve small samples of correlated data

* Information listed above is at the time of submission. *

Agency Micro-sites

SBA logo
Department of Agriculture logo
Department of Commerce logo
Department of Defense logo
Department of Education logo
Department of Energy logo
Department of Health and Human Services logo
Department of Homeland Security logo
Department of Transportation logo
Environmental Protection Agency logo
National Aeronautics and Space Administration logo
National Science Foundation logo
US Flag An Official Website of the United States Government