bartScoreMargin - WeAreCAS

Q: What is the purpose of the bart.bartScoreMargin action in SAS Viya?

The bart.bartScoreMargin action computes predictive margins by using a fitted Bayesian additive regression trees (BART) model.

Q: What is the 'model' parameter used for in the bartScoreMargin action?

The 'model' parameter is required and specifies a binary table object that was created from a previous model fitting, which contains the BART model to be used for scoring.

Q: How do I specify the input data for the bartScoreMargin action?

You must use the 'table' parameter to specify the input data table that you want to compute predictive margins for.

Q: What is a predictive margin and how is it defined in this action?

A predictive margin is a statistical measure. In this action, you define it using the 'margins' parameter (alias: 'scenarios'). Each margin is a scenario where you can specify variables to modify and the values they should be set to, using the 'at' subparameter.

Q: How can I compute the difference between two predictive margins?

You can compute the difference between two predictive margins by using the 'differences' parameter (alias: 'diffs'). You need to specify the 'evtMargin' (event margin) and 'refMargin' (reference margin) by their names, which you defined in the 'margins' parameter.

Q: What does the 'alpha' parameter control?

The 'alpha' parameter specifies the significance level for constructing equal-tail credible limits for the computed margins. The default value is 0.05.

At a glance

Deciphering the inner workings of sophisticated algorithms is essential for establishing trust in automated decisions. The bartScoreMargin action provides analysts with a powerful mechanism to conduct "what-if" scenarios by computing predictive margins from fitted Bayesian Additive Regression Tree models. By systematically holding selected variables constant across the dataset, users can evaluate the pure average effect of specific inputs, effectively stripping away confounding factors. To facilitate your use of these interpretability techniques, we have compiled a comprehensive FAQ section addressing syntax implementation, variable selection, and result interpretation.

Description

The bartScoreMargin action computes predictive margins by using a fitted Bayesian additive regression trees (BART) model. Predictive margins are predictions from a model at fixed values of some predictors, averaged over the distribution of the other predictors. This technique is useful for understanding the effect of a specific predictor on the outcome, while accounting for the influence of other variables in the model.

bart.bartScoreMargin { alpha=double, casOut={casouttable}, differences={{bartScoreMargin_scoreDiff-1} <, {bartScoreMargin_scoreDiff-2}, ...>}, display={displayTables}, marginInfo=TRUE | FALSE, margins={{bartScoreMargin_evaluate-1} <, {bartScoreMargin_evaluate-2}, ...>}, model={castable}, outputTables={outputTables}, seed=64-bit-integer, table={castable} };

Settings

Parameter	Description
alpha	Specifies the significance level for constructing equal-tail credible limits. The default is 0.05.
casOut	Specifies the output data table to store the computed predictive margins.
differences	Specifies a list of differences between predictive margins to compute. Each difference is defined by a reference margin (refMargin) and an event margin (evtMargin).
display	Specifies which result tables to display. By default, all tables are displayed.
marginInfo	When set to TRUE, requests a summary table of the variables and their values that define each predictive margin.
margins	Specifies one or more predictive margins to compute. Each margin is defined by a name and a set of variable values ('at' subparameter).
model	Specifies the CAS table that contains the fitted BART model information, saved from a previous call to the bartGauss or bartProbit action.
outputTables	Specifies which result tables to save as CAS tables.
seed	Specifies the seed for the pseudorandom number generator to ensure reproducibility.
table	Specifies the input data table used for computing the predictive margins. This is typically the same data used to train the model.

Data Preparation View data prep sheet

Data Creation

This example first creates a sample dataset 'getStarted' with a binary outcome 'y' and several predictor variables. Then, it fits a Bayesian additive regression trees model for a binary outcome using the `bartProbit` action and saves the model to a CAS table named 'my_bart_model'. This saved model is required for the `bartScoreMargin` action.

Copied!

1	DATA mycas.getStarted;
2	call streaminit(123);
3	DO i = 1 to 100;
4	x1 = rand('UNIFORM');
5	x2 = rand('UNIFORM');
6	x3 = rand('UNIFORM');
7	IF (i <= 50) THEN x4 = 'A'; ELSE x4 = 'B';
8	p = 1 / (1 + exp(-(x1 - 0.5x2 + 0.2x3)));
9	y = rand('BERNOULLI', p);
10	OUTPUT;
11	END;
12	RUN;
13
14	PROC CAS;
15	bart.bartProbit TABLE={name='getStarted'},
16	model={depVars={{name='y', levelType='BINARY'}},
17	effects={{vars={'x1', 'x2', 'x3', 'x4'}}}},
18	store={name='my_bart_model', replace=true};
19	QUIT;

Examples

This example computes a single predictive margin named 'margin_x1_high'. It evaluates the model's prediction when the predictor 'x1' is fixed at a high value (0.9), while averaging over the observed values of all other predictors in the 'getStarted' table.

SAS® / CAS Code Code awaiting community validation

Copied!

1	PROC CAS;
2	bart.bartScoreMargin
3	TABLE='getStarted',
4	model='my_bart_model',
5	margins={{
6	name='margin_x1_high',
7	at={{var='x1', value=0.9}}
8	}};
9	QUIT;

Result :
The action produces a 'Margins' table showing the posterior mean, standard deviation, and credible interval for the predicted probability when x1 is 0.9.

This example calculates and compares predictive margins for the two levels of the categorical variable 'x4'. It defines two margins, 'margin_x4_A' and 'margin_x4_B', and then computes the difference between them, named 'diff_A_vs_B'. This allows for quantifying the effect of changing 'x4' from 'A' to 'B' on the predicted outcome.

SAS® / CAS Code Code awaiting community validation

Copied!

1	PROC CAS;
2	bart.bartScoreMargin
3	TABLE='getStarted',
4	model='my_bart_model',
5	seed=1234,
6	alpha=0.1,
7	margins={{
8	name='margin_x4_A',
9	label='Margin for x4=A',
10	at={{var='x4', value='A'}}
11	},
12	{
13	name='margin_x4_B',
14	label='Margin for x4=B',
15	at={{var='x4', value='B'}}
16	}},
17	differences={{
18	name='diff_A_vs_B',
19	label='Difference (A vs B)',
20	refMargin='margin_x4_B',
21	evtMargin='margin_x4_A'
22	}},
23	marginInfo=true,
24	casOut={name='scored_margins', replace=true};
25	QUIT;

Result :
The results will include three tables: 'MarginInfo' describing the defined margins, 'Margins' with the posterior summaries for each margin, and 'MarginDifferences' with the posterior summary for the difference 'diff_A_vs_B'. An output CAS table named 'scored_margins' will also be created containing the detailed posterior samples for each margin.

FAQ

What is the purpose of the bart.bartScoreMargin action in SAS Viya?

What is the 'model' parameter used for in the bartScoreMargin action?

How do I specify the input data for the bartScoreMargin action?

What is a predictive margin and how is it defined in this action?

How can I compute the difference between two predictive margins?

What does the 'alpha' parameter control?

Associated Scenarios

Use Case

Standard Case: Analyzing Marketing Campaign Effectiveness

A retail company has launched a targeted marketing campaign and wants to measure the impact of offering a discount (15% vs. 0%) and the marketing channel used (Email vs. SMS) on...

View scenario

Use Case

Performance/Volume Case: Analyzing Large-Scale Clinical Trial Data

A pharmaceutical company is analyzing data from a large-scale clinical trial (500,000 patients) for a new drug. They need to understand how different dosage levels (50mg, 100mg,...

View scenario

Use Case

Edge Case: Scoring with Missing Values and Unseen Factor Levels

A utility company uses a model to predict power grid failures. The model is trained on a complete dataset. However, the real-time scoring data sometimes contains missing sensor ...

View scenario

Actions associées

bart

bartGauss

The bartGauss action fits Bayesian additive regression trees (BART) models fo...

bart

bartProbit

The bartProbit action fits a probit Bayesian Additive Regression Trees (BART)...

bart

bartScore

The bartScore action scores a data table using a previously fitted Bayesian a...

Table of Contents

At a glance

Description

Data Creation

Examples

Basic Predictive Margin Calculation

Comparing Predictive Margins for a Categorical Variable

FAQ

Associated Scenarios

Use Case

Standard Case: Analyzing Marketing Campaign Effectiveness

Use Case

Performance/Volume Case: Analyzing Large-Scale Clinical Trial Data

Use Case

Edge Case: Scoring with Missing Values and Unseen Factor Levels

Actions associées

bartGauss

bartProbit

bartScore