EngineTranscribeSettings
Settings specific for Amazon Transcribe as the live transcription engine.
If you specify an invalid combination of parameters, a TranscriptFailed
event will be sent with the contents of the
BadRequestException
generated by Amazon Transcribe. For more information on each parameter and which combinations are valid, refer to the
StartStreamTranscription API in the
Amazon Transcribe Developer Guide.
Contents
- ContentIdentificationType
-
Labels all personally identifiable information (PII) identified in your transcript. If you don't include
PiiEntityTypes
, all PII is identified.Note
You can’t set
ContentIdentificationType
andContentRedactionType
.Type: String
Valid Values:
PII
Required: No
- ContentRedactionType
-
Content redaction is performed at the segment level. If you don't include
PiiEntityTypes
, all PII is redacted.Note
You can’t set
ContentRedactionType
andContentIdentificationType
.Type: String
Valid Values:
PII
Required: No
- EnablePartialResultsStabilization
-
Enables partial result stabilization for your transcription. Partial result stabilization can reduce latency in your output, but may impact accuracy.
Type: Boolean
Required: No
- IdentifyLanguage
-
Enables automatic language identification for your transcription.
If you include
IdentifyLanguage
, you can optionally useLanguageOptions
to include a list of language codes that you think may be present in your audio stream. Including language options can improve transcription accuracy.You can also use
PreferredLanguage
to include a preferred language. Doing so can help Amazon Transcribe identify the language faster.You must include either
LanguageCode
orIdentifyLanguage
.Language identification can't be combined with custom language models or redaction.
Type: Boolean
Required: No
- LanguageCode
-
Specify the language code that represents the language spoken.
If you're unsure of the language spoken in your audio, consider using
IdentifyLanguage
to enable automatic language identification.Type: String
Valid Values:
en-US | en-GB | es-US | fr-CA | fr-FR | en-AU | it-IT | de-DE | pt-BR | ja-JP | ko-KR | zh-CN | th-TH | hi-IN
Required: No
- LanguageModelName
-
Specify the name of the custom language model that you want to use when processing your transcription. Note that language model names are case sensitive.
The language of the specified language model must match the language code. If the languages don't match, the custom language model isn't applied. There are no errors or warnings associated with a language mismatch.
If you use Amazon Transcribe in multiple Regions, the custom language model must be available in Amazon Transcribe in each Region.
Type: String
Length Constraints: Minimum length of 1. Maximum length of 200.
Pattern:
^[0-9a-zA-Z._-]+
Required: No
- LanguageOptions
-
Specify two or more language codes that represent the languages you think may be present in your media; including more than five is not recommended. If you're unsure what languages are present, do not include this parameter.
Including language options can improve the accuracy of language identification.
If you include
LanguageOptions
, you must also includeIdentifyLanguage
.Important
You can only include one language dialect per language. For example, you cannot include
en-US
anden-AU
.Type: String
Length Constraints: Minimum length of 1. Maximum length of 200.
Pattern:
^[a-zA-Z-,]+
Required: No
- PartialResultsStability
-
Specify the level of stability to use when you enable partial results stabilization (
EnablePartialResultsStabilization
).Low stability provides the highest accuracy. High stability transcribes faster, but with slightly lower accuracy.
Type: String
Valid Values:
low | medium | high
Required: No
- PiiEntityTypes
-
Specify which types of personally identifiable information (PII) you want to redact in your transcript. You can include as many types as you'd like, or you can select
ALL
.Values must be comma-separated and can include:
ADDRESS
,BANK_ACCOUNT_NUMBER
,BANK_ROUTING
,CREDIT_DEBIT_CVV
,CREDIT_DEBIT_EXPIRY
CREDIT_DEBIT_NUMBER
,EMAIL
,NAME
,PHONE
,PIN
,SSN
, orALL
.Note that if you include
PiiEntityTypes
, you must also includeContentIdentificationType
orContentRedactionType
.If you include
ContentRedactionType
orContentIdentificationType
, but do not include PiiEntityTypes, all PII is redacted or identified.Type: String
Length Constraints: Minimum length of 1. Maximum length of 300.
Pattern:
^[A-Z_, ]+
Required: No
- PreferredLanguage
-
Specify a preferred language from the subset of languages codes you specified in
LanguageOptions
.You can only use this parameter if you include
IdentifyLanguage
andLanguageOptions
.Type: String
Valid Values:
en-US | en-GB | es-US | fr-CA | fr-FR | en-AU | it-IT | de-DE | pt-BR | ja-JP | ko-KR | zh-CN | th-TH | hi-IN
Required: No
- Region
-
The AWS Region in which to use Amazon Transcribe.
If you don't specify a Region, then the MediaRegion of the meeting is used. However, if Amazon Transcribe is not available in the
MediaRegion
, then aTranscriptFailed
event is sent.Use
auto
to use Amazon Transcribe in a Region near the meeting’sMediaRegion
. For more information, refer to Choosing a transcription Region in the Amazon Chime SDK Developer Guide.Type: String
Valid Values:
us-east-2 | us-east-1 | us-west-2 | ap-northeast-2 | ap-southeast-2 | ap-northeast-1 | ca-central-1 | eu-central-1 | eu-west-1 | eu-west-2 | sa-east-1 | auto | us-gov-west-1
Required: No
- VocabularyFilterMethod
-
Specify how you want your vocabulary filter applied to your transcript.
To replace words with
***
, choosemask
.To delete words, choose
remove
.To flag words without changing them, choose
tag
.Type: String
Valid Values:
remove | mask | tag
Required: No
- VocabularyFilterName
-
Specify the name of the custom vocabulary filter that you want to use when processing your transcription. Note that vocabulary filter names are case sensitive.
If you use Amazon Transcribe in multiple Regions, the vocabulary filter must be available in Amazon Transcribe in each Region.
If you include
IdentifyLanguage
and want to use one or more vocabulary filters with your transcription, use theVocabularyFilterNames
parameter instead.Type: String
Length Constraints: Maximum length of 4096.
Required: No
- VocabularyFilterNames
-
Specify the names of the custom vocabulary filters that you want to use when processing your transcription. Note that vocabulary filter names are case sensitive.
If you use Amazon Transcribe in multiple Regions, the vocabulary filter must be available in Amazon Transcribe in each Region.
If you're not including
IdentifyLanguage
and want to use a custom vocabulary filter with your transcription, use theVocabularyFilterName
parameter instead.Type: String
Length Constraints: Minimum length of 1. Maximum length of 3000.
Pattern:
^[a-zA-Z0-9,-._]+
Required: No
- VocabularyName
-
Specify the name of the custom vocabulary that you want to use when processing your transcription. Note that vocabulary names are case sensitive.
If you use Amazon Transcribe multiple Regions, the vocabulary must be available in Amazon Transcribe in each Region.
If you include
IdentifyLanguage
and want to use one or more custom vocabularies with your transcription, use theVocabularyNames
parameter instead.Type: String
Length Constraints: Maximum length of 4096.
Required: No
- VocabularyNames
-
Specify the names of the custom vocabularies that you want to use when processing your transcription. Note that vocabulary names are case sensitive.
If you use Amazon Transcribe in multiple Regions, the vocabulary must be available in Amazon Transcribe in each Region.
If you don't include
IdentifyLanguage
and want to use a custom vocabulary with your transcription, use theVocabularyName
parameter instead.Type: String
Length Constraints: Minimum length of 1. Maximum length of 3000.
Pattern:
^[a-zA-Z0-9,-._]+
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: