A Data Skimming Grid Portal
Small Business Information
5621 Arapahoe Avenue, Suite A, Boulder, CO, 80301
Abstract72872-High energy physics data sets are currently very large and will continue to grow. A time consuming and labor intensive stage of any experimental effort is the event selection (or skimming) that must be performed at a remote site in order to deliver reasonably sized data sets to the end user. This project will develop a Web tool for remote data skimming. A rich client application will be developed for download and execution from any browser. It will have graphical widgets for forming pipelines of data selection criteria, as well as job submission and monitoring controls. The tool will be secure and collaborative, support multiple clients, and provide a repository for generated skims and skimmed data. Phase I developed a prototype Data Skimming Grid Portal (DSGP) with a capability for selecting from a set of skims located on the server, reconfiguring the skims, and generating new skims using graphical components. The prototype grid service invokes the skim application, provides notification of status and delivers the skimmed data to the client. Phase II will develop a well-documented release version of the DSGP, along with a full-featured reference implementation using the latest Globus middleware. The system will include a rich graphical client application and a command-line interface for submitting skim jobs with custom C++ code to be run in parallel on a test grid. A catalog grid service will be developed to store references to skims and datasets, enhance monitoring capabilities, add chat features for collaboration, and activate security. Commercial Applications and Other Benefits as described by awardee: The data skimming tool could be used in various data intensive experiments such as high energy physics experiments (CMS, ATLAS etc.), space science observations, and climate modeling. Similarly, it could be used for the selection of data in e-commerce and e-banking applications.
* information listed above is at the time of submission.