eig - WeAreCAS

Q: What is the primary purpose of the eig action?

The eig action is used to extract principal components by using the eigenvalue decomposition method.

Q: How can I calculate principal components using the covariance matrix instead of the correlation matrix?

You can set the "cov" parameter to TRUE. By default, it is set to FALSE.

Q: How do I specify the number of principal components to compute?

Use the "n" parameter to specify the number of principal components. If you set the value to 0, all principal components are computed.

Q: Can I omit the intercept from the model?

Yes, you can omit the intercept by setting the "noInt" parameter to TRUE.

Q: How can I perform a weighted analysis of the data?

You can specify a numeric variable to use as a weight by using the "weight" parameter.

Q: Is it possible to accelerate the computation using a GPU?

Yes, you can enable GPU computation by setting the "enable" subparameter within the "gpu" parameter to TRUE. Note that if you specify the "groupBy" subparameter in the "table" parameter, the "gpu" parameter is ignored.

Q: How do I save the model fit information for future scoring?

You can use the "store" parameter to specify an output table that will contain the model fit information.

Q: What does the "singular" parameter control?

The "singular" parameter specifies the singularity criterion, which ranges from 0 to 1. The default value is 1E-08.

Q: How can I obtain standard deviations and eigenvalues in an output table?

You can use the "outStat" parameter to specify an output table that contains various statistics, including means, standard deviations, eigenvalues, and eigenvectors.

Codes SAS Liés

Seamless Integration: How to Attach Data Step Outputs Directly to SAS Viya Jobs

Logic Traps: Mastering Operator Precedence and Parentheses in Complex SAS Filters

Variable Mismatches? No Problem. How to Standardize Metadata with ATTRIB and PROC SQL

The 'Many-to-Many' Trap: Handling Duplicate Keys and Missing Values in SAS Merges

Read Less, Run Faster: Optimizing SAS Performance with KEEP, WHERE, and OBS

Beyond PROC FREQ: Generating High-Performance Frequency Tables with SAS Viya Actions

High-Speed Aggregation: How to Filter and Group Data Instantly with SAS Viya

SAS Viya to Excel: Automating Professional Reports with ODS EXCEL and CAS

Forget PROC IMPORT: The Fast Way to Upload Local CSVs to SAS Cloud

Folders Are Dead: Understanding the Flat Architecture of the SAS Viya Files Service

Description

The `eig` action performs Principal Component Analysis (PCA) using the eigenvalue decomposition method. It is a fundamental statistical technique used for dimensionality reduction and data exploration. By analyzing the covariance or correlation matrix of numeric variables, it calculates eigenvalues and eigenvectors to transform the original correlated variables into a smaller set of uncorrelated variables called principal components. This action supports weighting, frequency variables, and can generate output tables containing component scores and statistical summaries.

pca.eig <result=results> <status=rc> / attributes={{format="string", label="string", name="variable-name", ...}, ...} code={casOut={...}} cov=TRUE | FALSE display={caseSensitive=TRUE|FALSE, exclude=TRUE|FALSE, names={"string-1", ...}, ...} freq="variable-name" gpu={enable=TRUE | FALSE} groupbyLimit=64-bit-integer inputs={{name="variable-name", ...}, ...} n=integer noInt=TRUE | FALSE outStat={casOut={name="table-name", ...}, rPrefix="string"} output={casOut={name="table-name", ...}, copyVars={"variable-name", ...}, residual="string", score="string"} outputTables={names={"string-1", ...}, replace=TRUE|FALSE} partial={"variable-name-1", ...} prefix="string" singular=double std=TRUE | FALSE store={name="table-name", ...} table={name="table-name", caslib="string", ...} varDef="DF" | "N" | "WDF" | "WEIGHT" | "WGT" weight="variable-name";

Settings

Parameter	Description
table	Specifies the settings for the input CAS table to be analyzed.
inputs	Specifies the list of numeric variables to use for the analysis. If omitted, all numeric variables are used.
n	Specifies the number of principal components to be computed. If set to 0, all components are computed.
cov	If set to TRUE, computes the principal components from the covariance matrix. If FALSE (default), the correlation matrix is used.
std	If set to TRUE, standardizes the principal component scores in the output table to unit variance.
output	Specifies the output table to contain observation-wise statistics, such as component scores.
outStat	Specifies the output table to contain statistics like means, standard deviations, eigenvalues, and eigenvectors.
noInt	If set to TRUE, suppresses the intercept (fits the model through the origin).
prefix	Specifies a prefix string for naming the principal component variables (default is 'Prin').
freq	Specifies a numeric variable that contains the frequency of occurrence for each observation.
weight	Specifies a numeric variable to use as a weight for performing a weighted analysis.
code	Generates SAS DATA step code to compute predicted values (scores) based on the fitted model.
store	Saves the model fit information to a CAS table (analytic store) for use in scoring.

Data Preparation View data prep sheet

Data Preparation

Loads the sample 'Iris' dataset into a CAS table named 'iris' in the 'casuser' library.

Copied!

1	PROC CAS;
2	/* Load SASHELP.IRIS into CAS memory */
3	DATA casuser.iris;
4	SET sashelp.iris;
5	RUN;
6	QUIT;

Examples

Performs a standard Principal Component Analysis on the Iris dataset to extract the top 2 components based on the correlation matrix.

SAS® / CAS Code Code awaiting community validation

Copied!

1	PROC CAS;
2	pca.eig /
3	TABLE={name="iris", caslib="casuser"}
4	inputs={"SepalLength", "SepalWidth", "PetalLength", "PetalWidth"}
5	n=2;
6	RUN;

Result :
The action returns the Eigenvalues table (showing variance explained) and Eigenvectors table for the first 2 components.

Performs PCA using the covariance matrix, standardizes the output scores, creates specific output tables for statistics and scores, and saves the scoring code.

SAS® / CAS Code Code awaiting community validation

Copied!

1	PROC CAS;
2	pca.eig /
3	TABLE={name="iris", caslib="casuser"}
4	/* Use specific numeric inputs */
5	inputs={"SepalLength", "SepalWidth", "PetalLength", "PetalWidth"}
6	/* Use Covariance matrix instead of Correlation */
7	cov=true
8	/* Standardize scores to unit variance */
9	std=true
10	/* Custom prefix for component names */
11	prefix="PC"
12	/* Output table for Eigenvalues/Vectors */
13	outStat={casOut={name="eigen_stats", caslib="casuser", replace=true}}
14	/* Output table for Scores, copying the Species variable */
15	OUTPUT={casOut={name="iris_scores", caslib="casuser", replace=true},
16	score="Score",
17	copyVars={"Species"}}
18	/* Generate scoring code */
19	code={casOut={name="score_code", caslib="casuser", replace=true}};
20	RUN;

Result :
Generates 'eigen_stats' table with statistical summaries and 'iris_scores' table containing the original 'Species' column and new 'Score1', 'Score2', etc. columns. Also creates 'score_code' containing DATA step logic.

FAQ

What is the primary purpose of the eig action?

How can I calculate principal components using the covariance matrix instead of the correlation matrix?

How do I specify the number of principal components to compute?

Can I omit the intercept from the model?

How can I perform a weighted analysis of the data?

Is it possible to accelerate the computation using a GPU?

How do I save the model fit information for future scoring?

What does the "singular" parameter control?

How can I obtain standard deviations and eigenvalues in an output table?

Associated Scenarios

Use Case

Standard Customer Behavior Dimensionality Reduction

A retail bank wants to segment its customer base for a new credit card offer. They have multiple correlated variables related to spending habits (groceries, travel, entertainmen...

View scenario

Use Case

High-Volume Sensor Analysis with Weighting and Covariance

A manufacturing plant monitors heavy machinery using dozens of sensors. They need to analyze the raw variance (Covariance) rather than correlation, because the magnitude of vibr...

View scenario

Use Case

Genomic Data with Singularity and Origin Forcing

Researchers are analyzing gene expression data that has been pre-normalized to be centered around zero. They want to perform PCA without an intercept (forcing the model through ...

View scenario

Table of Contents

Seamless Integration: How to Attach Data Step Outputs Directly to SAS Viya Jobs

Logic Traps: Mastering Operator Precedence and Parentheses in Complex SAS Filters

Variable Mismatches? No Problem. How to Standardize Metadata with ATTRIB and PROC SQL

The 'Many-to-Many' Trap: Handling Duplicate Keys and Missing Values in SAS Merges

Read Less, Run Faster: Optimizing SAS Performance with KEEP, WHERE, and OBS

Beyond PROC FREQ: Generating High-Performance Frequency Tables with SAS Viya Actions

High-Speed Aggregation: How to Filter and Group Data Instantly with SAS Viya

SAS Viya to Excel: Automating Professional Reports with ODS EXCEL and CAS

Forget PROC IMPORT: The Fast Way to Upload Local CSVs to SAS Cloud

Folders Are Dead: Understanding the Flat Architecture of the SAS Viya Files Service

Description

Data Preparation

Examples

Basic PCA Analysis

Covariance-based PCA with Output Statistics

FAQ

Associated Scenarios

Use Case

Standard Customer Behavior Dimensionality Reduction

Use Case

High-Volume Sensor Analysis with Weighting and Covariance

Use Case

Genomic Data with Singularity and Origin Forcing