You are here
Management software for genetic analysis GeMS
Phone: (206) 854-3533
Email: TOMPGER@AOL.COM
Phone: (206) 240-7388
Email: TOMPGER@AOL.COM
Address:
Type: Domestic Nonprofit Research Organization
DESCRIPTION (provided by applicant): The human genome project, officially begun over a decade ago, has continued to yield superb technologies and vast datasets applicable to a wide diversity of scientific interests. While the technology to acquire vast amounts of data is now well established and continues to expand, the ability to deal with such data, from the process of acquisition, storage, and analysis depends fundamentally on a solid informatics infrastructure as an essential component. Indeed, most of the major gains in productivity in this field are to be realized on the informatics front, in automating data acquisition, defining and sorting data in databases for quality control and analysis, facilitating access to data, and for the large variety of analyses that such data will be subjected. Over the past 2 years we have been involved in establishing an informatics infrastructure in a small genetics laboratory focused on resequencing human immune response genes. From this experience we have built a genetics management system (GEMS) that is applicable in a general way to the management of data acquired from automatic DNA sequencers. This resource includes support for three types of data acquisition: 1) PCR resequencing which can be used for SNP discovery and genotyping; 2) shotgun sequencing projects, applicable to primary data acquisition as well as resequencing of target regions from cohorts in disease studies; and 3) genotyping methods that use capillary-based data acquisition (e.g. micro satellite, SSP-SNP, HLA typing). To this end, we propose to accomplish the following specific aims: 1) To develop a software package that improves the throughput and productivity of small to medium sized sequence-based genotyping and sequencing labs. A prototype of the final package has already gone through two complete code revisions, and is being used to design DNA sequencing trials, organize the sequencing process, gather data, and assess the quality of the sequencing output; and 2) To work in parallel with existing collaborations aimed at enhancing the ability for diverse genetics data producers to share data. We have identified six laboratories that produce genetic sequence data that have agreed to collaborate with us in testing the existing software. From this experience, we will learn and incorporate new functionality into the software with the aim of producing a package that can effectively support the activities of all six laboratories.
* Information listed above is at the time of submission. *