calculateErrorRate

Description

The `calculateErrorRate` action compares reference (ground truth) transcripts with hypothesis (predicted) transcripts to calculate error rates at the character, word, and sentence levels. This is crucial for evaluating the performance of a speech-to-text model. It requires two input tables: one for the reference text and one for the hypothesis text, and it matches the transcripts based on their IDs.

proc cas; langModel.calculateErrorRate / table={...} reference={...} tableId="string" tableText="string" referenceId="string" referenceText="string"; run;

Settings

Parameter	Description
table	Specifies the input table that contains the hypothesis transcripts generated by the speech-to-text model.
reference	Specifies the input table that contains the ground truth (reference) transcripts.
tableId	Specifies the variable in the hypothesis table that contains the unique identifier for each transcript.
tableText	Specifies the variable in the hypothesis table that contains the transcribed text to be evaluated.
referenceId	Specifies the variable in the reference table that contains the unique identifier for each transcript.
referenceText	Specifies the variable in the reference table that contains the ground truth text.

Data Preparation View data prep sheet

Creating Reference and Hypothesis Data

First, we create two CAS tables. `reference_transcripts` holds the correct, or 'ground truth', text. `hypothesis_transcripts` holds the text generated by our speech-to-text model. Both tables include an ID to match the corresponding sentences.

Copied!

1	DATA mycas.reference_transcripts;
2	INFILE DATALINES dsd;
3	LENGTH id $ 10 text $ 100;
4	INPUT id $ text $;
5	DATALINES;
6	utt1,this is a sample sentence
7	utt2,another test for accuracy
8	;
9	RUN;
10
11	DATA mycas.hypothesis_transcripts;
12	INFILE DATALINES dsd;
13	LENGTH hyp_id $ 10 hyp_text $ 100;
14	INPUT hyp_id $ hyp_text $;
15	DATALINES;
16	utt1,this is a sample sentience
17	utt2,an other test for accuracy
18	;
19	RUN;

Examples

This example calculates the error rate by providing the reference and hypothesis tables. By default, the action assumes the first column is the ID and the second is the text for both tables.

SAS® / CAS Code Code awaiting community validation

Copied!

1	PROC CAS;
2	langModel.calculateErrorRate /
3	TABLE={name='hypothesis_transcripts'},
4	reference={name='reference_transcripts'};
5	RUN;

Result :
The action returns a result table summarizing the Word Error Rate (WER), Character Error Rate (CER), and Sentence Error Rate (SER), along with counts of substitutions, insertions, and deletions for both words and characters.

This example demonstrates how to specify the exact columns for IDs and text in both the hypothesis and reference tables, which is useful when tables have multiple columns or non-standard naming conventions.

SAS® / CAS Code Code awaiting community validation

Copied!

1	PROC CAS;
2	langModel.calculateErrorRate /
3	TABLE={name='hypothesis_transcripts'},
4	reference={name='reference_transcripts'},
5	tableId='hyp_id',
6	tableText='hyp_text',
7	referenceId='id',
8	referenceText='text';
9	RUN;

Result :
The output is a comprehensive report detailing the error rates. It includes overall statistics (WER, CER, SER) and a breakdown of errors (substitutions, deletions, insertions) for each transcript pair, allowing for a granular analysis of the model's performance.

Associated Scenarios

Use Case

Validation of Medical Dictation Accuracy

A hospital is evaluating a new Speech-to-Text model for transcribing doctor's notes. The goal is to compare the model's output against manually verified transcripts to ensure th...

View scenario

Use Case

High Volume Customer Support Analytics

A telecommunications company processes 10,000 customer support calls per hour. The data science team needs to ensure the `calculateErrorRate` action can handle batch processing ...

View scenario

Use Case

Robustness to Missing Data and ID Mismatches

In a real-world pipeline, audio files sometimes fail to process, or metadata gets corrupted. This test simulates a 'dirty' dataset where some hypothesis IDs are missing (audio f...

View scenario

Actions associées

langModel

IdentifySpeakers

Provides actions that are used for language models in speech-to-text systems....

langModel

lmDecode

The lmDecode action decodes recurrent neural network (RNN) scores using a spe...

langModel

lmImport

The lmImport action imports an n-gram language model from a CAS table into a ...

Table of Contents

Description

Creating Reference and Hypothesis Data

Examples

Basic Error Rate Calculation

Detailed Error Rate Calculation with Specific Columns

Associated Scenarios

Use Case

Validation of Medical Dictation Accuracy

Use Case

High Volume Customer Support Analytics

Use Case

Robustness to Missing Data and ID Mismatches

Actions associées

IdentifySpeakers

lmDecode

lmImport