An Integrated Suite of Text and Data Mining Tools for Program Managers
Small Business Information
4960 Peachtree Industrial Blvd, Norcross, GA, 30071
Director of R&D
Director of R&D
AbstractThis proposal describes an effort to build an integrated suite of tools for R&D Program Managers, incorporating text mining and data mining tools for information extraction and knowledge discovery from requirement sources and bibliographic databases of R&Dliterature. Successful program management depends in part on identifying and understanding requirements, discerning linkages among requirements (e.g., commonality, dependency, priority, etc.), and recognizing correspondence between program requirements andthe capabilities of available resources. Requirements take several forms, but of particular interest are large written documents, such as Strategic Plans and R&D Master Plans. Requirements may originate from databases of operating experience andmaintenance information. In either the database form or the resulting documents, mastery of these information sources presents a daunting challenge. The technologies of text and data mining have great potential for assisting Program Managers in theirtask of defining or understanding requirements from these very large data sources by identifying relationships among requirements and discovering connections between the requirements and other R&D activities reported in bibliographic databases. In PhaseI, we will 1) analyze requirements sources, 2) prepare a report on text and data mining techniques, 3) develop a software specification, and 4) demonstrate the feasibility by developing a demonstration prototype.Successful completion of all three phases ofthis program will result in a powerful suite of tools for text mining. Program Managers in large organizations (government and commercial) will be able to use these tools to extract knowledge from databases of operational and maintenance experience. Thisknowledge will assist the Program Manager in defining, articulating, and defending programmatic requirements. The suite of tools will also allow the manager to mine clusters of requirements from free text documents such as Requirements Documents, Scienceand Technology Master Plans, and Strategic Plans. These requirements clusters can then be used to mine open literature S&T bibliographic databases to identify centers of excellence and assess the qualifications of individuals and organizations submittingproposals. By cross-mining requirements documents and S&T literature, the manager can also find new relationships among technologies and applications that may provide leverage points for investment of R&D resources. By mining internal research plansagainst patent databases, managers can enhance their protection of an organization's intellectual property by assessing how their research agenda and product development plans compare with their competitor's patent strategy.
* information listed above is at the time of submission.