Products | Versions |
---|---|
Spotfire Statistica | 13.1 and higher |
IDP is equivalent of Streaming DB connector in Statistica. IDP builds a bridge between database and Statistica, where Statistica lets you perform analysis on a frame of data at a time. IDP is only recommended when the table being queried is million cases big and an analysis where aggregation would work well is being performed, for example, basic stats or Frequency. tables, Breakdown/One way anova, e.t.c.
The advantage of using IDP is that the data is not brought back into Statistica (or WebStatistica). IDP performs the analysis frame by frame, so there are certain analyses like Basics Stats or frequency tables that will have performance advantages while performing via IDP vs the standard database query (Standard Database Connection and Dataset Query in WebStatistica).
To query dataset through IDP connection via WebStatistica:
1. In WebStatistica, go to "File | Create Dataset", under "IDP Connections" tab, give a name to the new dataset, provide the database "Connection String" and "Query String".
To get the connection string and query string, user can go to Statistica "File | External Data | Streaming DB Connector" to build a streaming database connection and query.
Or user can copy the "Connection String" from "Database Connection" and "SQL Statement" from "Data Configuration" from Enterprise Manager, and then paste in above dialog. Before doing that, please click on "Test Connections" and "Explore" in Enterprise Manager to make sure the Database Connection is successful and the Dataset is retrievable.
Note, Enterprise database connection string mask the password with *****. When pasting it in above WebStatistica IDP dialog, remember to replace ***** with the real actual password for database connection.
2. Once connection string and query string is done, continue click on "Query Options" tab, set different parameters and maximum number of cases to retrieve. User can always click on the question mark "?" on top right corner of the dialog for more explanations of the parameters.
3. Click OK. The IDP connection dataset (XXX.STI) is created and automatically set as Active Data Source for downstream analysis. Note the IDP connection will not show the dataset (e.g. in Spreadsheet Editor), as it does not bring the dataset from database back to Statistica.
4. If keen to switch to other dataset (XXX.STI) for analysis, click on "Open Dataset", the "Select Active Data Source" dialog opens, expand the folders and select the dataset of interest, click OK to set it as Active Data Source for further analysis.
5. After the IDP connection of interest is created and set as active data source, one can proceed to perform analysis as intended. For example, "Correlation Matrices" under "Statistics | Basic Statistics".