USA flag logo/image

An Official Website of the United States Government

High-Precision Agile Active Learning for Domain-Customizable Information…

Award Information

Department of Defense
Air Force
Award ID:
Program Year/Program:
2007 / SBIR
Agency Tracking Number:
Solicitation Year:
Solicitation Topic Code:
Solicitation Number:
Small Business Information
Language Computer Corporation
2435 N. Central Expressway Suite 1200 Richardson, TX 75080-
View profile »
Woman-Owned: Yes
Minority-Owned: No
HUBZone-Owned: No
Phase 1
Fiscal Year: 2007
Title: High-Precision Agile Active Learning for Domain-Customizable Information Extraction (HALCYON)
Agency / Branch: DOD / USAF
Contract: FA8750-07-C-0148
Award Amount: $99,639.00


The dynamic operational environments in which Air Force users operate today require information extraction systems that can be rapidly - and easily - customized to new and challenging domains. In order to address operational demands for textual information, Language Computer Corporation (LCC) has developed a customizable information extraction system, known as CiceroCustom, which enables military and intelligence personnel to extract information from sources of unstructured textual information (including OSINT and HUMINT) quickly and efficiently. In this Phase I SBIR effort, called High-Precision Agile Active Learning for Domain-Customizable Information Extraction (HALCYON), LCC will extend the customizable information extraction capacity provided by CiceroCustom with a new framework which can be used to enhance the quality and accuracy of domain customizations performed by users. We plan to build an enhanced prototype which incorporates (1) an agile customization framework which leverages a novel paradigm for active learning, (2) a context-driven mechanism for customizing extractors to specific domains that allows for the incorporation of diverse forms of user input, (3) a novel method for integrating domain-specific knowledge into an information extraction system, and (4) a robust textual reasoning capability which leverages a state-of-the-art textual entailment system in order to reason about domain knowledge for extraction.

Principal Investigator:

Paul Aarseth
Principal Investigator

Business Contact:

Yolanda T. Guzman
VP-Financial & Legal
Small Business Information at Submission:

1701 North Collins Blvd., Suite 2000 Richardson, TX 75080

EIN/Tax ID: 752641308
Number of Employees:
Woman-Owned: No
Minority-Owned: No
HUBZone-Owned: No