Products | Versions |
---|---|
Spotfire Statistica | 12.0 and higher |
The Data Health Check node available under Statistica "Data" tab performs numerous detailed checks on each variable, and its value, value ranges, discrete text labels, missing data, outliers, ect.
For instance, the feature of checking variable redundancy can help you detect and automatically remove correlated variables instead of removing them manually.
1. After the input data is added to the workspace, click "Data | Data Health Check" from Statistica menu
2. Click the wheel icon on top left of the node and click "Specifications | Quick" to select variables and other desired options
3. Click "Specifications | Redundancy" to configure criteria for checking redundancy .Note you can click the "?" on the top right corner of the dialog to access details about options corresponding to this section in the e-manual.
4. After all configuration is done, click "Ok" to return to the workspace
5. Click the green arrow icon on bottom left of the Data Health Check Summary Node to run the node.