This solution will perform automatic speech recognition on your audio files with a fully-automated workflow, which is initiated when an audio file is delivered to a configured Amazon S3 bucket. After just a few minutes the transcription will be produced, which is available in another Amazon S3 bucket and can be accessed by your preferred business intelligence (BI) solution. Please see the Application architecture section for full details on the processing steps involved in this workflow.
A user interface is provided, allowing you to easily visualize a single call and perform some level of searching and filtering across all of the calls in the system. It is expected that customers will use Amazon Quicksight or their preferred BI solution, with this application's user interface being used to allow users to easily read the transcripts, playback the calls and review the other analytics datapoints in a particular call.
The following table outlines the major features of this application.
Feature | Description |
---|---|
Languages | Support all Amazon Transcribe languages and dialects |
Transcript | Generates a speaker-diarised transcript, with word confidence scores per word |
Audio formats | Handles mono or stereo audio files automatically in any format supported by Amazon Transcribe [1] |
Sentiment | Performs sentiment analysis across each line of the call, and also generates sentiment trends across the call for each speaker |
Filename Metadata | Ability to extract call-related metadata, such as the call time or an agent identifier, from the audio filename via regular expressions |
Audio playback | Ability to play back the audio from any part of the call transcript, as well as scrub forwards and backwards on demand |
Accuracy | Enhance the accuracy of your transcripts through the provision of custom vocabulary files |
Redaction | Supports text redaction based on defined vocabulary filters, as well as redacting any personally identifiable information |
Entity detection | Searches through and tags your transcript with detected entities |
Analytics | Provides additional analytics data through the optional use of Amazon Transcribe's Call Analytics feature |
[1] Stereo, channel-separated audio files are preferred, as that makes available the highest quality analytics from the AWS Language Services