Automatic Unsupervised Recognition and Allocation of Speakers (AURAS)
Small Business Information
1235 South Clark Street, Suite 400, Arlington, VA, -
AbstractDevelopments in the availability of audio data and the methods of collecting it have only added to the problem of data overload for the DoD. Automated methods of analysis are the only way to process the necessary volume of audio data in a timely manner. The need exists both within the military and industry to harness these automated methods to rapidly and reliably cluster segments of audio by unique speaker. The proposed solution, Automatic Unsupervised Recognition and Allocation of Speakers (AURAS) advances the state-of-the-art in this area. Able to operate on a static collection of audio files or an audio stream, AURAS automatically groups segments of audio by unique speaker using their individual sound characteristics. Unlike less-advanced methods, AURAS requires no training data, and detects the number of speakers present in the data automatically. While AURAS is completely language-independent, in cases when the language being spoken is known, the system is able to leverage the words used by speakers to enhance its accuracy. Additionally, AURAS"learns as it analyzes,"and can therefore operate continuously, without the need for downtime to"re-learn"when it encounters a group of completely new speakers. In fact, performance improves the more new speakers it sees.
* information listed above is at the time of submission.