You are here

SBIR Phase II: Xtractica - A System for Extracting Coherent Data from Documents

Award Information
Agency: National Science Foundation
Branch: N/A
Contract: 0238863
Agency Tracking Number: 0238863
Amount: $0.00
Phase: Phase I
Program: SBIR
Solicitation Topic Code: N/A
Solicitation Number: N/A
Timeline
Solicitation Year: N/A
Award Year: 2003
Award Start Date (Proposal Award Date): N/A
Award End Date (Contract End Date): N/A
Small Business Information
25 East Loop Road
Stony Brook, NY 11790
United States
DUNS: N/A
HUBZone Owned: No
Woman Owned: No
Socially and Economically Disadvantaged: No
Principal Investigator
 Tatyana Vidrevich
 () -
Business Contact
Phone: () -
Research Institution
N/A
Abstract

This Small Business Innovation Research Phase II project will implement a software system that allows domain experts to specify programs that transform unstructured or partially structured data from a variety of document sources, such as World Wide Web sites, PDF files, and text into structured, coherent, and readily usable information. The system will consist of a set of tightly integrated syntactic and semantics-driven data extraction technologies that are managed from a graphical user interface. The goal will be to retrieve information that was created for human understandability, and work with it to create knowledge that can support automated decision-making and transactions. The system will empower users, who are knowledgeable about their application domains but are not necessarily trained as computing technologists, to rapidly structure data into knowledge. The Phase II implementation effort will build upon the results from the Phase I feasibility study to produce a fully functional system.

Phase III will make the system commercially available to clients with diverse business interests including content aggregation, e-procurement, ERP, and supply chain management vendors.

* Information listed above is at the time of submission. *

US Flag An Official Website of the United States Government