Skip to content

Latest commit

 

History

History
22 lines (17 loc) · 2.5 KB

File metadata and controls

22 lines (17 loc) · 2.5 KB

Application features

This solution will perform automatic speech recognition on your audio files with a fully-automated workflow, which is initiated when an audio file is delivered to a configured Amazon S3 bucket. After just a few minutes the transcription will be produced, which is available in another Amazon S3 bucket and can be accessed by your preferred business intelligence (BI) solution. Please see the Application architecture section for full details on the processing steps involved in this workflow.

A user interface is provided, allowing you to easily visualize a single call and perform some level of searching and filtering across all of the calls in the system. It is expected that customers will use Amazon Quicksight or their preferred BI solution, with this application's user interface being used to allow users to easily read the transcripts, playback the calls and review the other analytics datapoints in a particular call.

The following table outlines the major features of this application.

Feature Description
Languages Support all Amazon Transcribe languages and dialects 
Transcript Generates a speaker-diarised transcript, with word confidence scores per word
Audio formats Handles mono or stereo audio files automatically in any format supported  by Amazon Transcribe [1]
Sentiment Performs sentiment analysis across each line of the call, and also generates sentiment trends across the call for each speaker
Filename Metadata Ability to extract call-related metadata, such as the call time or an agent identifier, from the audio filename via regular expressions
Audio playback Ability to play back the audio from any part of the call transcript, as well as scrub forwards and backwards on demand
Accuracy Enhance the accuracy of your transcripts through the provision of custom vocabulary files
Redaction Supports text redaction based on defined vocabulary filters, as well as redacting any personally identifiable information
Entity detection Searches through and tags your transcript with detected entities
Analytics Provides additional analytics data through the optional use of Amazon Transcribe's Call Analytics feature

[1] Stereo, channel-separated audio files are preferred, as that makes available the highest quality analytics from the AWS Language Services