SBIR Phase I: Hybrid Question Answering Combining a Search Index with an RDF Store

Award Information
Agency: National Science Foundation
Branch: N/A
Contract: 1113285
Agency Tracking Number: 1113285
Amount: $149,800.00
Phase: Phase I
Program: SBIR
Awards Year: 2011
Solicitation Year: 2010
Solicitation Topic Code: IC
Solicitation Number: N/A
Small Business Information
1701 N. Collins Blvd., Suite 2200, Richardson, TX, 75080-3587
DUNS: 876919353
HUBZone Owned: N
Woman Owned: N
Socially and Economically Disadvantaged: N
Principal Investigator
 Christine Nezda
 (972) 680-0800
Business Contact
 Christine Nezda
Phone: (972) 680-0800
Research Institution
This Small Business Innovation Research (SBIR) Phase I project will address the issue that enterprises today are faced with the problem of linking their disparate structured databases with unstructured text documents like articles, manuals, reports, emails, blogs, folksonomies, and others. There is no easy way to perform a federated search, let alone enable more intelligent applications over such diverse data sources without considerable time and effort spent in system and data model customization by experts. With the recent emergence of commercial grade Resource Description Framework (RDF) triple stores it becomes possible to merge massive amounts of structured and unstructured data by defining a common ontology model for the DBMS schemas and representing the structured content as semantic triples. Lymba proposes novel methods to transform unstructured data sources inside corporate firewalls into a consolidated RDF store, merge it with other ontologies and structured data, and moreover offer a natural language question answering (QA) interface for easy use. To make the QA robust, an innovative hybrid approach is proposed that draws answers from the RDF store as well as directly from indexed text documents. The potential impact of delivering a question answering system that operates on a commercial grade RDF store is significant as it fills a need for users of this store to easily access more information and quickly implement intelligent applications using natural language questions as the main vehicle. The proposal also leads to enabling technology software to advance the semantic web. If successfully deployed, the proposed research has the potential to translate into a viable commercial product with significant revenues.

* Information listed above is at the time of submission. *

Agency Micro-sites

SBA logo
Department of Agriculture logo
Department of Commerce logo
Department of Defense logo
Department of Education logo
Department of Energy logo
Department of Health and Human Services logo
Department of Homeland Security logo
Department of Transportation logo
Environmental Protection Agency logo
National Aeronautics and Space Administration logo
National Science Foundation logo
US Flag An Official Website of the United States Government