How to query data from Azure HDInsights into TIBCO Statistica Enterprise?

How to query data from Azure HDInsights into TIBCO Statistica Enterprise?

book

Article ID: KB0075204

calendar_today

Updated On:

Products Versions
Spotfire Statistica 13.3 and higher

Description

How to query data from Azure HDInsights into TIBCO Statistica Enterprise/TIBCO Statistica ?

User-added image

For the purpose of this article, Statistica Enterprise Manager is used. Similar connectivity can be established via File  >> External Data >> Query >> Create within Statistica Desktop

PREREQUISITES:
1. In addition to the Statistica/Statistica Enterprise installation, Microsoft HIVE ODBC driver must be downloaded and installed from : https://www.microsoft.com/en-us/download/details.aspx?id=40886
2. In context of using Statistica Enterprise Manager, the user should have permissions to create a database connection - Either System Admin (SADM) or Database Admin (EXTDB_ADM) and Data Admin (DADM)
3. Details of AzureHDInsights server, Port and user credentials.

Issue/Introduction

How to query data from Azure HDInsights into TIBCO Statistica Enterprise?

Resolution

1. Launch Control Panel >> System And Security>> Administrative Tools >> ODBC Data Sources. 

2. Define either a System DSN or a USER DSN by choosing a tab of your choice, and click on the Add button.

3. Choose the driver as : Microsoft HIVE ODBC driver and click Finish. (Contact System administrator, if you do not see the aforementioned driver - it likely implies that Microsoft HIVE ODBC driver is not installed ) .
 In the dialog that appears type a name for the data source ,server details such as name and user credentials.

4. Enter the Server and Port details and click on Connect. 
 
User-added image

5. Click on Advanced Options and enable option "Use Native Query"

User-added image

6. Click OK and then click on Test to check if the connection is successfully made.

User-added image

7. Click OK and close all the dialogs - The System/User DSN has now been defined

8. Launch Statistica Enterprise Manager and right click on Database Connection and click on "New OLEDB Database connection"  ( Or  File  >> External Data >> Query >> Create from Statistica) 

9. Choose the Microsoft OLEDB provider for ODBC drivers. Click Next.

User-added image

10. Choose the System/User DSN defined in Steps 1 though 7 from the drop down list. Enter the username and password for the Azure HDInsights Server and enable "Allow Using password" to not enter the credentials each time the data source is used.

User-added image

11. Test the connection. Click on OK twice to close both the dialogs. Define a name for the database connection, it access permissions and then click on Commit on the top left hand of the screen.

User-added image

12.Right click on any folder in Statistica Enterprise System View and choose "Create a new Data configuration"

13.Name the data configuration, choose the database connection that was defined in Step 11 and then click on New button near Queries. Type the SQL Statement .
 
User-added image

14. Name the data configuration , define Access permissions for the data configuration and click on commit on top left hand corner of Enterprise manager to save the data configuration.   Click on Explore to bring the data from Azure HDInsights into Statistica for further analysis.

User-added image