DocMark: A Rule-based and Probabilistic Document Marking System

Award Information
Agency: Department of Defense
Branch: Air Force
Contract: FA8750-15-C-0032
Agency Tracking Number: F131-052-0425
Amount: $749,908.00
Phase: Phase II
Program: SBIR
Solicitation Topic Code: AF131-052
Solicitation Number: 2013.1
Solicitation Year: 2013
Award Year: 2015
Award Start Date (Proposal Award Date): 2014-12-18
Award End Date (Contract End Date): 2016-12-17
Small Business Information
33 Thornwood Drive, Suite 500, Ithaca, NY, 14850
DUNS: 000000000
HUBZone Owned: N
Woman Owned: N
Socially and Economically Disadvantaged: N
Principal Investigator
 Robert Joyce
 Technical Director
 (607) 257-1975
Business Contact
 Richard Smith
Title: Mr.
Phone: (607) 257-1975
Research Institution
ABSTRACT: In order to prevent loss of sensitive data, organizations need to be able to tag data with classification and releasability metadata. Currently classification is a manual and labor-intensive process, which slows the dissemination of sharable data. ATC-NY will develop DocMark, a product to automatically determine classification and dissemination metadata and record it in standard formats for review by a human subject matter expert. DocMark will perform both file marking and portion marking (e.g., frames in a video stream). It will provide graphical front-ends for configuration and review, and APIs for marking individual data objects or collections of them. By automatically performing many of the steps that human beings now perform manually, DocMark will dramatically improve the speed at which data can be marked. BENEFIT: Many legacy military and intelligence sensors produce data objects that are not marked with the classification and releasability metadata needed for secure sharing. Marking is required both for newly created data objects and for repositories of unmarked objects. DocMark will convert marking from a manual process to a semi-automated one, in which the job of the human being is to review and occasionally correct a marking provided by DocMark. The first target markets of DocMark are the intelligence and military communities, who have an urgent need to disseminate data to the personnel who need it. The business community is now discovering its own need for classification, which helps prevent data loss by clarifying the difference between highly sensitive objects and those that can be freely disseminated. DocMark will be available as a commercial product; ATC-NY will also provide training, support, and integration/customization services.

* Information listed above is at the time of submission. *

Agency Micro-sites

SBA logo
Department of Agriculture logo
Department of Commerce logo
Department of Defense logo
Department of Education logo
Department of Energy logo
Department of Health and Human Services logo
Department of Homeland Security logo
Department of Transportation logo
Environmental Protection Agency logo
National Aeronautics and Space Administration logo
National Science Foundation logo
US Flag An Official Website of the United States Government