FAULT-TOLERANT DISTRIBUTED COMPUTING ON NETWORKS OF WORKSTATIONS

Award Information
Agency: National Science Foundation
Branch: N/A
Contract: N/A
Agency Tracking Number: 21664
Amount: $49,973.00
Phase: Phase I
Program: SBIR
Awards Year: 1993
Solitcitation Year: N/A
Solitcitation Topic Code: N/A
Solitcitation Number: N/A
Small Business Information
Scientific Computing Associate
1 Century Tower, 265 Church St, New Haven, CT, 06510
Duns: N/A
Hubzone Owned: N
Woman Owned: N
Socially and Economically Disadvantaged: N
Principal Investigator
 Jens M Nielsen
 (203) 777-7442
Business Contact
Phone: () -
Research Institution
N/A
Abstract
A "HYPERCOMPUTER" IS THE PARALLEL COMPUTER THAT EMERGES WHEN ONE SUMS THE UNUSED CYCLES OVER ALL THE COMPUTER NODES ON A LOCAL AREA NETWORK. HOWEVER, FAULT-TOLERANCE IS A CRITICAL ISSUE FOR LONGER-LIVED APPLICATIONS RUNNING ON TYPICAL LANS WHERE NODE FAILURES ARE NOT UNCOMMON. PROCESS CHECKPOINT AND RESTART MECHANISMS THAT ALLOW PARALLEL NETWORK COMPUTATIONS TO PROCEED EVEN AS THEIR CONSTITUENT PROCESSES FAIL ARE BEING DEVELOPED. THE RESULTING SYSTEM IS A FAULT-TOLERANT, EXTENSIBLE, COST-EFFECTIVE SUPERCOMPUTER BASED ON LOW COST, HIGH PERFORMANCE SCIENTIFIC WORKSTATIONS CONNECTED VIA A LOCAL AREA NETWORK.

* information listed above is at the time of submission.

Agency Micro-sites

US Flag An Official Website of the United States Government