High Precision Event Extraction Using Predicate Arguments (HIPEPA)
Agency / Branch:
DOD / USAF
Our goal is to facilitate visualization of event information using a novel event extraction paradigm that achieves accuracy close to that of human analysts and allows both a simplified customization to new domains and extraction of complex events expressed as an event extraction framework. The event extraction framework replaces the pattern-based paradigm with predictate-argument structures that allow extraction of events in any domain. Mappings to new domain of interest can be learned in this new paradigm by making use of maximum entropy models. Furthermore, this novel paradigm that we propose takes advantage of several novel open-domain features including (1) an open domain semantic parser used to extract syntactic and semantic information (e.g. predicate-argument relations) from source documents and (2) discourse processing techniques such as coreference resolution of events produced by event normalization and event fusion. This paradigm allows temporal and spatial normalization of events such that space or time expressions are recognized and normalized when they are not explicit references e.g. "last summer", "four years" (duration), every month (set), and "a year after the earthquake (event anchored expressions). Similarly, in the case of special expressions, we consider implicit references e.g. "second house", areas of the country such as "The South", sets "every river", and event anchored expressions, e.g. "twenty miles north of Baghdad". As technical leaders in the field of Natural Language Processing and its application to unstructured text understanding for the military and intelligence communities, Language Computer Corporation (LCC) is well suited to provide this capability. Our proposed work on this SBIR, called High Precision Event Extraction Using Predicate Arguments (HIPEPA) will provide a framework for event extraction that is accurate, domain relevant, and easily customized to the dynamic information needs of the intelligence analyst. LCC will build on our experience and existing capability to provide a prototype pattern-free event extraction framework that normalizes spatial and temporal information for visualization and supports the detection and fusion of event data.
Small Business Information at Submission:
LANGUAGE COMPUTER CORP.
1701 North Collins Blvd., Suite 2000 Richardson, TX 75080
Number of Employees: