How to check for redundant variables?

How to check for redundant variables?

book

Article ID: KB0074284

calendar_today

Updated On:

Products Versions
Spotfire Statistica 12.0 and higher

Description

The Data Health Check node available under Statistica "Data" tab performs numerous detailed checks on each variable, and its value, value ranges, discrete text labels, missing data, outliers, ect.
For instance, the feature of checking variable redundancy can help you detect and automatically remove correlated variables instead of removing them manually.

Issue/Introduction

How to check for redundant variables?

Environment

Windows

Resolution

1. After the input data is added to the workspace, click "Data | Data Health Check" from Statistica menu

2. Click the wheel icon on top left of the node and click "Specifications | Quick" to select variables and other desired options

3. Click "Specifications | Redundancy" to configure criteria for checking redundancy .Note you can click the "?" on the top right corner of the dialog to access details about options corresponding to this section in the e-manual.

4. After all configuration is done, click "Ok" to return to the workspace

5. Click the green arrow icon on bottom left of the Data Health Check Summary Node to run the node.