langModel

lmDecode

Description

The lmDecode action decodes recurrent neural network (RNN) scores using a specified language model. It's a crucial step in speech-to-text systems, refining the raw output of an acoustic model by applying linguistic rules and probabilities to generate more coherent and accurate text.

langModel.lmDecode { alpha=double, beta=double, blankLabel="string", casOut={casouttable}, columnMap={"string-1" , "string-2", ...}, copyVars={"variable-name-1" , "variable-name-2", ...}, langModelTable={castable}, maxPathSize=64-bit-integer, ngramsOrder=64-bit-integer, spaceLabel="string", table={castable} };
Settings
ParameterDescription
alpha Specifies a tunable parameter that controls the trade-off between the acoustic model and the language model. The larger 'alpha' is, the more the language model will count while ranking all candidates of the final guess.
beta Specifies a tunable parameter that controls the trade-off between the acoustic model and the length of the sentence. The larger 'beta' is, the more the length of the sentence will count while ranking all candidates of the final guess.
blankLabel Specifies the string used to indicate a 'blank' label.
casOut Specifies the output table to store the decoded results.
columnMap Specifies the labels that the score columns in each time frame represent, which must follow the same order.
copyVars Specifies the variables to transfer from the input table to the output table.
langModelTable Specifies the n-gram language model to use in the decoding process.
maxPathSize Specifies the maximum number of paths kept as candidates for final results during the decoding process.
ngramsOrder Specifies the highest order of n-grams to use during the decoding process.
spaceLabel Specifies the string used to indicate a 'space' label.
table Specifies the input table that contains the RNN scores returned by the acoustic model.

Examples

FAQ

What is the primary function of the lmDecode action?
What does the 'alpha' parameter control?
How does the 'beta' parameter influence the decoding process?
Which parameter specifies the input table for the language model?
What is the purpose of the 'maxPathSize' parameter?
How are the score columns in the input data mapped to their labels?
What is the 'blankLabel' parameter used for?