USA flag logo/image

An Official Website of the United States Government

SBIR Phase II: Multimodal Semantic Video Retrieval and Summarization

Award Information

National Science Foundation
Award ID:
Program Year/Program:
2011 / SBIR
Agency Tracking Number:
Solicitation Year:
Solicitation Topic Code:
Phase II
Solicitation Number:
Small Business Information
Video Semantics LLC
3565 A2 Ellicott Mills Dr Suite 201 Ellicott City, MD 21043-0000
View profile »
Woman-Owned: No
Minority-Owned: No
HUBZone-Owned: No
Phase 2
Fiscal Year: 2011
Title: SBIR Phase II: Multimodal Semantic Video Retrieval and Summarization
Agency: NSF
Contract: 1058428
Award Amount: $500,000.00


This Small Business Innovation Research Phase II project will develop contextual video segmentation and automatic tagging technology and software. In long video streams that contain one or more topics, the software automatically discovers the beginnings and ends of Contextually-Coherent Video Segments in each video. Moreover, Video Semantics' technology automatically assigns textual tags to each segment such that these tags describe the topic discussed in that segment. The tags assigned make all parts of the video easily searchable. Large video producers currently depend on manually segmenting their content into small segments and assigning textual tags to these segments in order to make them searchable. A short advertisement is then inserted before each segment. This manual segmentation and tagging process represents a significant pain point for content producers because it is labor intensive and not cost effective. Meanwhile, government agencies, which continuously monitor video content depend on speech recognition to spot specified keywords. This approach inflicts two pain points: (i) analysts have to deal with large number of false detections because the context in which the keyword occurs might be irrelevant, and (ii) if the keyword occurs in an important context, analysts still need to scroll back and forth into the video to find the beginning of the relevant segment. Video Semantics' technology and products have the potential to efficiently address significant market needs. In addition to the commercial applications, the proposed technology will enable media monitoring agencies to perform their tasks more efficiently saving valuable analyst time and resources. Moreover, because Video Semantics? technology is language-independent, media monitoring agencies will be able to monitor more content in foreign languages without the need to develop language-specific technologies. The company will employ an indirect sales strategy via partnerships with software companies that develop media monitoring solutions and metadata generation tools. The company has identified its first customer and is working with them to integrate the contextual segmentation and tagging technology with their current media monitoring solutions.

Principal Investigator:

Wael Abd-Almageed

Business Contact:

Wael Abd-Almageed
Small Business Information at Submission:

Video Semantics LLC
3565 A2 Ellicott Mills Dr Suite 201 Ellicott City, MD 21043-0000

EIN/Tax ID: 262731680
Number of Employees:
Woman-Owned: No
Minority-Owned: No
HUBZone-Owned: No