Software for Missing Covariate Data in Cancer Trials.

Award Information
Agency:
Department of Health and Human Services
Branch
n/a
Amount:
$750,000.00
Award Year:
2004
Program:
SBIR
Phase:
Phase II
Contract:
2R44CA080484-02
Award Id:
45637
Agency Tracking Number:
1R43CA080484-01A1
Solicitation Year:
n/a
Solicitation Topic Code:
n/a
Solicitation Number:
n/a
Small Business Information
CYTEL SOFTWARE CORPORATION, 675 MASSACHUSETTS AVE, CAMBRIDGE, MA, 02139
Hubzone Owned:
N
Minority Owned:
N
Woman Owned:
N
Duns:
n/a
Principal Investigator:
CYRUS MEHTA
(617) 661-2011
MEHTA@CYTEL.COM
Business Contact:
CYRUS MEHTA
(617) 661-2011
MEHTA@CYTEL.COM
Research Institute:
n/a
Abstract
DESCRIPTION (provided by applicant): This is a Phase II SBIR proposal for completing the development of a comprehensive collection of statistical tools embedded in LogXact, in EGRET, in SAS as PROCs and in SPLUS as functions. This set of tools will compute maximum likelihood estimates for generalized linear models (GLMs) and parametric survival models with missing categorical covariates, where the missing covariates are assumed to be missing at random (MAR). In this Phase II effort, we will expand the current version of tools available in prototype software XMISS to handle: (i) missing categorical covariates for binomial response models with Iogit, probit, or complementary log-log links, (ii) missing categorical covariates for conditional logistic regression for matched case-control data, (iii) missing categorical covariates for Poisson regression models, (iv) missing categorical covariates for normal linear regression models, (v) missing categorical covariates for ordinal response regression models, (vi) missing categorical covariates for exponential, Weibull and log-normal regression models allowing for right censoring in the response variable. In the development of the Phase II software for all of the GLM's and survival models considered above, we will allow a missing covariate to have up to 5 categories, any of which may have missing values. Also, in Phase II development, we will allow up to 50 covariates total, of which 10 binary covariates can be missing. In addition, we will investigate methods for speeding up the EM algorithm as well as develop new algorithms for obtaining good starting values for the EM algorithm. Missing covariate data is very common problem with cancer clinical trials. There exists no commercial software to handle missing covariate data by maximum likelihood method for the range of models listed above.

* information listed above is at the time of submission.

Agency Micro-sites


SBA logo

Department of Agriculture logo

Department of Commerce logo

Department of Defense logo

Department of Education logo

Department of Energy logo

Department of Health and Human Services logo

Department of Homeland Security logo

Department of Transportation logo

Enviromental Protection Agency logo

National Aeronautics and Space Administration logo

National Science Foundation logo
US Flag An Official Website of the United States Government