langModel

lmDecode

Description

The lmDecode action decodes recurrent neural network (RNN) scores using a specified language model. It's a crucial step in speech-to-text systems, refining the raw output of an acoustic model by applying linguistic rules and probabilities to generate more coherent and accurate text.

langModel.lmDecode { alpha=double, beta=double, blankLabel="string", casOut={casouttable}, columnMap={"string-1" , "string-2", ...}, copyVars={"variable-name-1" , "variable-name-2", ...}, langModelTable={castable}, maxPathSize=64-bit-integer, ngramsOrder=64-bit-integer, spaceLabel="string", table={castable} };
Settings
ParameterDescription
alphaSpecifies a tunable parameter that controls the trade-off between the acoustic model and the language model. The larger 'alpha' is, the more the language model will count while ranking all candidates of the final guess.
betaSpecifies a tunable parameter that controls the trade-off between the acoustic model and the length of the sentence. The larger 'beta' is, the more the length of the sentence will count while ranking all candidates of the final guess.
blankLabelSpecifies the string used to indicate a 'blank' label.
casOutSpecifies the output table to store the decoded results.
columnMapSpecifies the labels that the score columns in each time frame represent, which must follow the same order.
copyVarsSpecifies the variables to transfer from the input table to the output table.
langModelTableSpecifies the n-gram language model to use in the decoding process.
maxPathSizeSpecifies the maximum number of paths kept as candidates for final results during the decoding process.
ngramsOrderSpecifies the highest order of n-grams to use during the decoding process.
spaceLabelSpecifies the string used to indicate a 'space' label.
tableSpecifies the input table that contains the RNN scores returned by the acoustic model.

Examples

FAQ

What is the primary function of the lmDecode action?
What does the 'alpha' parameter control?
How does the 'beta' parameter influence the decoding process?
Which parameter specifies the input table for the language model?
What is the purpose of the 'maxPathSize' parameter?
How are the score columns in the input data mapped to their labels?
What is the 'blankLabel' parameter used for?