Semi-Supervised Algorithms against Malware Evolution (SESAME)

Award Information
Agency: Department of Defense
Branch: Air Force
Contract: FA8750-12-C-0144
Agency Tracking Number: F11B-T21-0014
Amount: $99,984.00
Phase: Phase I
Program: STTR
Awards Year: 2012
Solicitation Year: 2011
Solicitation Topic Code: AF11-BT21
Solicitation Number: 2011.B
Small Business Information
Charles River Analytics Inc.
625 Mount Auburn Street, Cambridge, MA, -
DUNS: 115243701
HUBZone Owned: N
Woman Owned: N
Socially and Economically Disadvantaged: N
Principal Investigator
 Avi Pfeffer
 Principal Scientist
 (617) 491-3474
 apfeffer@cra.com
Business Contact
 Mark Felix
Title: Contracts Manager
Phone: (617) 491-3474
Email: mfelix@cra.com
Research Institution
 University of Louisiana--Lafayette
 Ruth Landry
 104 University Circle
Lafayette, LA, 70504-0504
 (337) 482-5811
 Nonprofit college or university
Abstract
ABSTRACT: Recent years have seen an explosion in the number and sophistication of malware attacks. The sheer volume of novel malware has made purely manual signature development impractical and has led to research on applying machine learning and data mining to automatically infer malware signatures in the wild. Unfortunately, researchers have recently found ways to game the machine learning algorithms and learn to predict which samples the learning algorithms will classify as benign or malicious, thus opening the door for innovative deception on the part of malware developers. To counter this threat, we propose Semi-Supervised Algorithms against Malware Evolution (SESAME), which uses online learning to evolve as new malware is encountered, recognizing novel families and adapting its model of families as they themselves evolve. It uses semi-supervised learning to enable it to learn from both labeled and unlabeled malware. SESAME combines a rich feature set with deep learning algorithms to learn the essential characteristics of malware that enable us to relate novel malware to existing malware. We propose to evaluate the potential of the novel approach afforded by SESAME by using both standard malware datasets and malware specifically designed to fool automated detection systems. BENEFIT: Because SESAME provides an evolving, real-time detection system capable of defeating evolving malware, it will have immediate and tangible benefit for military and Government programs as well as commercial security products. As the number of new malware encountered continues to grow exponentially, we must support and augment human analysts with automated techniques that enable near-real-time malware detection and remediation. Thus, techniques to detect novel and deliberately deceptive attacks will benefit a range of Governmental and commercial security products.

* information listed above is at the time of submission.

Agency Micro-sites

SBA logo
Department of Agriculture logo
Department of Commerce logo
Department of Defense logo
Department of Education logo
Department of Energy logo
Department of Health and Human Services logo
Department of Homeland Security logo
Department of Transportation logo
Environmental Protection Agency logo
National Aeronautics and Space Administration logo
National Science Foundation logo
US Flag An Official Website of the United States Government