USA flag logo/image

An Official Website of the United States Government

Concept-Based Event Extraction Utilizing Rich Semantics (CONVERSE)

Award Information

Agency:
Department of Defense
Branch:
Air Force
Award ID:
79058
Program Year/Program:
2007 / SBIR
Agency Tracking Number:
F061-060-2594
Solicitation Year:
N/A
Solicitation Topic Code:
N/A
Solicitation Number:
N/A
Small Business Information
Language Computer Corporation
2435 N. Central Expressway Suite 1200 Richardson, TX 75080-
View profile »
Woman-Owned: Yes
Minority-Owned: No
HUBZone-Owned: No
 
Phase 2
Fiscal Year: 2007
Title: Concept-Based Event Extraction Utilizing Rich Semantics (CONVERSE)
Agency / Branch: DOD / USAF
Contract: FA8750-07-C-0048
Award Amount: $742,660.00
 

Abstract:

This effort extends a novel approach to concept-based event extraction that leverages a rich substrate of semantic and conceptual knowledge in order to extract all of the essential information associated with events in text. In this work, we combine semantic information from a number of state-of-the-art text processing systems - including (1) word sense disambiguation systems, (2) semantic parsers (based on PropBank, NomBank, and FrameNet annotations), (3) within-document and (4) cross-document coreference resolution systems, (5) named entity recognition systems, and (6) discourse parsers -- in order to produce robust conceptual representations of events in any domain. These forms of knowledge are then used in conjunction with an active learning-based framework for open domain event extraction which can be rapidly customized to meet the particular information needs of a user. We plan to extend our Phase I work by incorporating new (1) statistical models for estimating the correctness of extracted information, (2) kernel-based methods for extracting the essential relations associated with event types, (3) unsupervised methods for named entity recognition, (4) inference-based methods for recognizing coherence relations, and (5) event coreference and event merging techniques for performing inter-sentential event extraction.

Principal Investigator:

John Lehmann
Principal Investigator
9722310052
john.lehmann@languagecomputer.com

Business Contact:

Yolanda T. Guzman
VP - Financial & Legal
9722310052
yolanda@languagecomputer.com
Small Business Information at Submission:

LANGUAGE COMPUTER CORP.
1701 North Collins Blvd. Suite 2000 Richardson, TX 75080

EIN/Tax ID: 752641308
DUNS: N/A
Number of Employees:
Woman-Owned: No
Minority-Owned: No
HUBZone-Owned: No