Concept-Based Event Extraction Utilizing Rich Semantics (CONVERSE)

Award Information
Agency: Department of Defense
Branch: Air Force
Contract: FA8750-07-C-0048
Agency Tracking Number: F061-060-2594
Amount: $742,660.00
Phase: Phase II
Program: SBIR
Awards Year: 2007
Solicitation Year: 2006
Solicitation Topic Code: AF06-060
Solicitation Number: 2006.1
Small Business Information
LANGUAGE COMPUTER CORP.
1701 North Collins Blvd., Suite 2000, Richardson, TX, 75080
DUNS: 127802234
HUBZone Owned: N
Woman Owned: N
Socially and Economically Disadvantaged: N
Principal Investigator
 John Lehmann
 Principal Investigator
 (972) 231-0052
 john.lehmann@languagecomputer.com
Business Contact
 Yolanda Guzman
Title: VP - Financial & Legal
Phone: (972) 231-0052
Email: yolanda@languagecomputer.com
Research Institution
N/A
Abstract
This effort extends a novel approach to concept-based event extraction that leverages a rich substrate of semantic and conceptual knowledge in order to extract all of the essential information associated with events in text. In this work, we combine semantic information from a number of state-of-the-art text processing systems – including (1) word sense disambiguation systems, (2) semantic parsers (based on PropBank, NomBank, and FrameNet annotations), (3) within-document and (4) cross-document coreference resolution systems, (5) named entity recognition systems, and (6) discourse parsers -- in order to produce robust conceptual representations of events in any domain. These forms of knowledge are then used in conjunction with an active learning-based framework for open domain event extraction which can be rapidly customized to meet the particular information needs of a user. We plan to extend our Phase I work by incorporating new (1) statistical models for estimating the correctness of extracted information, (2) kernel-based methods for extracting the essential relations associated with event types, (3) unsupervised methods for named entity recognition, (4) inference-based methods for recognizing coherence relations, and (5) event coreference and event merging techniques for performing inter-sentential event extraction.

* information listed above is at the time of submission.

Agency Micro-sites

SBA logo
Department of Agriculture logo
Department of Commerce logo
Department of Defense logo
Department of Education logo
Department of Energy logo
Department of Health and Human Services logo
Department of Homeland Security logo
Department of Transportation logo
Environmental Protection Agency logo
National Aeronautics and Space Administration logo
National Science Foundation logo
US Flag An Official Website of the United States Government