Distributed Relevance Ranking in Heterogeneous Document Collections

Award Information
Agency: Department of Energy
Branch: N/A
Contract: DE-FG02-03ER83822
Agency Tracking Number: 72223S03-I
Amount: $99,198.00
Phase: Phase I
Program: SBIR
Awards Year: 2003
Solicitation Year: N/A
Solicitation Topic Code: N/A
Solicitation Number: N/A
Small Business Information
Deep Web Technologies, Llc
154 Piedra Loop, Los Alamos, NM, 87544
HUBZone Owned: N
Woman Owned: N
Socially and Economically Disadvantaged: N
Principal Investigator
 Abe Lederman
 (505) 672-0007
Business Contact
 Abe Lederman
Phone: (505) 672-0007
Email: abe@deepwebtech.com
Research Institution
72223S03-I Given the large and ever-growing volume of scientific information spread throughout the Internet, a researcher with limited time needs help to determine the most relevant documents to review. No satisfactory tools exist to retrieve the most relevant documents across different collections. Therefore, this project will develop, test, and implement key components of a distributed approach for ranking the relevance of documents acquired from an in-depth search of multiple sources. Machine-learning heuristics will be introduced to minimize the processing required to find those best documents. Phase I will conduct experiments and demonstrate that the automated approach can find a greater number of relevant documents, and miss fewer important ones, compared to a human-based approach. The use of computational grids will be investigated as a framework for implementing a scalable and resource-intensive solution. Commercial Applications and Other Benefits as described by awardee: The relevance ranking system should have use in research divisions of companies with the need to do high quality, exhaustive document search and retrieval, especially where time-to-market is critical (e.g., in the pharmaceutical and oil and gas industries).

* information listed above is at the time of submission.

Agency Micro-sites

US Flag An Official Website of the United States Government