High Performance Computing (HPC) Tools for Topology Aware Mapping of Inter-node communication

Award Information
Agency: Department of Defense
Branch: Defense Threat Reduction Agency
Contract: HDTRA118P0009
Agency Tracking Number: T172-002-0009
Amount: $149,998.00
Phase: Phase I
Program: SBIR
Solicitation Topic Code: DTRA172-002
Solicitation Number: 2017.2
Timeline
Solicitation Year: 2017
Award Year: 2018
Award Start Date (Proposal Award Date): 2018-01-17
Award End Date (Contract End Date): 2018-08-16
Small Business Information
6210 Kellers Church Road, Pipersville, PA, 18947
DUNS: 929950012
HUBZone Owned: N
Woman Owned: N
Socially and Economically Disadvantaged: N
Principal Investigator
 Donald Kenzakowski
 (215) 766-1520
 kenzakow@craft-tech.com
Business Contact
 Brian York
Phone: (215) 766-1520
Email: york@craft-tech.com
Research Institution
N/A
Abstract
This proposal describes the development of a generalized toolkit that enables improved and automated mapping of partitioned subdomains onto available distributed compute nodes for applications operating within pure-MPI or hybrid-MPI parallel runtime environments. This toolkit may be invoked either as an independent pre-processing step or as a dynamic library, improving an applications real-time domain decomposition and placement decisions based on available hardware nodes. The purpose of using this toolkit is to significantly reduce runtime bottlenecking costs incurred from message passing data inefficiently across modern compute platform interconnect topologies. Our Phase I effort will start using available information on hardware node organization for HPC platforms. Software libraries will be developed to assist standard partitioning algorithms to optimize subdomain organization so that communication costs across nodes are minimized. Topology mapping will be incorporated into partition decision-making to supplement overall workload balancing strategy. A ping test routine will be constructed to help identify current system communication latency costs during simulation runtime. Several partitioning strategies, varying decomposition strategy and hybrid parallelism, will be evaluated as an optimization to minimize inter-node communication traffic. Communication profiling information will also be collected via TAU profiling, display, and database management to help develop an optimized topology awareness strategy.

* Information listed above is at the time of submission. *

Agency Micro-sites

SBA logo
Department of Agriculture logo
Department of Commerce logo
Department of Defense logo
Department of Education logo
Department of Energy logo
Department of Health and Human Services logo
Department of Homeland Security logo
Department of Transportation logo
Environmental Protection Agency logo
National Aeronautics and Space Administration logo
National Science Foundation logo
US Flag An Official Website of the United States Government