You are here

Librarian - AI Driven Multi-Int Unifying Platform Software Tool

Award Information
Agency: Department of Defense
Branch: Navy
Contract: N68335-23-C-0071
Agency Tracking Number: N222-118-0407
Amount: $139,955.00
Phase: Phase I
Program: SBIR
Solicitation Topic Code: N222-118
Solicitation Number: 22.2
Solicitation Year: 2022
Award Year: 2023
Award Start Date (Proposal Award Date): 2022-11-07
Award End Date (Contract End Date): 2023-05-09
Small Business Information
540 Fort Evans Road Suite 300
Leesburg, VA 20176-3379
United States
DUNS: 164558376
HUBZone Owned: No
Woman Owned: No
Socially and Economically Disadvantaged: No
Principal Investigator
 Ledger West
 (800) 405-8576
Business Contact
 Miguel Peko
Phone: (757) 618-3306
Research Institution

The number of sources and amount of data that Naval intelligence analysts are required to manually sift through in order to ensure maritime forces have both actionable intel and provide decision advantage for their commanders is daunting. Today, existing tools are time-consuming, workforce intensive, and cumbersome to process and distribute in a timely fashion. Fortunately, this is a problem that can be addressed using modern machine learning (ML). Mosaic ATM, Inc. is proposing the development of a software tool called Librarian, that a) accepts, parses, and stores multi-modal data sources; b) annotates the data source metadata; c) employs a collection of transformer-based neural network models that annotate and represent the content of the data as natural language or in an embedding structure that is common across all modes to allow cross-modality search; and d) exposes a user interface (UI) or application programming interface (API) that lets analysts easily query data across modalities. Mosaic's Phase I technical approach will be to first implement a solution using pretrained models to perform their native task or a related task using zero-shot inference. Example pretrained models that could be utilized include BERT-(language), CLIP-(visual-language), GPT-3-(language), and DETR-(visual-language and object detection). Second, Mosaic will enhance the performance of the innovation by incorporating domain-specific data. For language models, Mosaic will explore a self-supervised method called Generative Pseudo-Labeling (GPL). Mosaic will also explore an image model, specifically a technique called SimCLR. The third and final step will be advanced multi-modal inference which will be approached as a causal language modeling (CLM) problem, i.e., text generation. OpenAI GPT-3 is the current state of the art for such models and Mosaic will also explore a more cost-efficient variant of GPT-3, GPT-J or GPT-2 for fine tuning. The three scenarios that Mosaic proposes to utilize for ML algorithm development and model training are: Scenario #1) Intelligence, surveillance, and reconnaissance (ISR), protection, and defense of a key international maritime port utilized by commercial and military vessels of various nations; Scenario #2) Support of local LEAs conducting security at a high-risk event such as the Boston Marathon; and Scenario #3) Intelligence support of a Carrier Strike Group conducting an international strait transit such as through the Strait of Hormuz. As a means to bring capability to the end users in the most efficient and timely manner, Ultra has agreed to team with Mosaic for Phase II technology development and Phase III commercialization and U.S. Navy implementation. Ultra's Situational Awareness Management System (SAMS) is a proven ISR platform that is perfectly suited to host future Mosaic technology that will solve the Navy's, and perhaps DoD, challenges with respect to intelligence analysis efficiency and performance.

* Information listed above is at the time of submission. *

US Flag An Official Website of the United States Government