lmDecode - WeAreCAS

Q: What does the 'alpha' parameter control?

The 'alpha' parameter is a tunable parameter that manages the trade-off between the acoustic model and the language model. A larger alpha value gives more weight to the language model when ranking candidates. The default value is 1.

Q: How does the 'beta' parameter influence the decoding process?

The 'beta' parameter controls the trade-off between the acoustic model and the sentence length. A larger beta value gives more weight to the length of the sentence when ranking candidate results. The default value is 0.

Q: Which parameter specifies the input table for the language model?

The 'langModelTable' parameter is used to specify the n-gram language model table required for the decoding process.

Q: What is the purpose of the 'maxPathSize' parameter?

The 'maxPathSize' parameter specifies the maximum number of candidate paths to retain during the decoding process. Its default value is 100.

Q: How are the score columns in the input data mapped to their labels?

The 'columnMap' parameter is used to specify the labels that the score columns represent, and they must be in the same order.

Q: What is the 'blankLabel' parameter used for?

The 'blankLabel' parameter specifies the string that is used to indicate a 'blank' label in the RNN scores. The default value is "_".

Description

The lmDecode action decodes recurrent neural network (RNN) scores using a specified language model. It's a crucial step in speech-to-text systems, refining the raw output of an acoustic model by applying linguistic rules and probabilities to generate more coherent and accurate text.

langModel.lmDecode { alpha=double, beta=double, blankLabel="string", casOut={casouttable}, columnMap={"string-1" , "string-2", ...}, copyVars={"variable-name-1" , "variable-name-2", ...}, langModelTable={castable}, maxPathSize=64-bit-integer, ngramsOrder=64-bit-integer, spaceLabel="string", table={castable} };

Settings

Parameter	Description
alpha	Specifies a tunable parameter that controls the trade-off between the acoustic model and the language model. The larger 'alpha' is, the more the language model will count while ranking all candidates of the final guess.
beta	Specifies a tunable parameter that controls the trade-off between the acoustic model and the length of the sentence. The larger 'beta' is, the more the length of the sentence will count while ranking all candidates of the final guess.
blankLabel	Specifies the string used to indicate a 'blank' label.
casOut	Specifies the output table to store the decoded results.
columnMap	Specifies the labels that the score columns in each time frame represent, which must follow the same order.
copyVars	Specifies the variables to transfer from the input table to the output table.
langModelTable	Specifies the n-gram language model to use in the decoding process.
maxPathSize	Specifies the maximum number of paths kept as candidates for final results during the decoding process.
ngramsOrder	Specifies the highest order of n-grams to use during the decoding process.
spaceLabel	Specifies the string used to indicate a 'space' label.
table	Specifies the input table that contains the RNN scores returned by the acoustic model.

Examples

FAQ

What is the primary function of the lmDecode action?

What does the 'alpha' parameter control?

How does the 'beta' parameter influence the decoding process?

Which parameter specifies the input table for the language model?

What is the purpose of the 'maxPathSize' parameter?

How are the score columns in the input data mapped to their labels?

What is the 'blankLabel' parameter used for?

Actions associées

langModel

calculateErrorRate

The `calculateErrorRate` action compares reference (ground truth) transcripts...

langModel

IdentifySpeakers

Provides actions that are used for language models in speech-to-text systems....

langModel

lmImport

The lmImport action imports an n-gram language model from a CAS table into a ...

Table of Contents

Description

Examples

FAQ

Actions associées

calculateErrorRate

IdentifySpeakers

lmImport