How to access files stored on HDFS from TIBCO Data Virtualization?

How to access files stored on HDFS from TIBCO Data Virtualization?

book

Article ID: KB0072754

calendar_today

Updated On:

Products Versions
TIBCO Data Virtualization 8.4, 8.5

Description

Refer to this article to access HDFS files inside TIBCO Data Virtualization.

Environment

Windows, Unix-based.

Resolution

  1. We can use the Local File Storage adapter to access the *.csv, *.txt and *.parquet file types stored on HDFS.
  2. Create a new Data Source in TDV Studio.
  3. Select Local File Storage adapter.
  4. Give URI in the format:   hdfs://<IP address>:<port>/teamstudio66.  In the URL <port> is NameNode service port.
  5. Click on Create & Introspect button.
  6. Select the resources that the user wants to use in TDV Studio
  7. Select Copy privileges from the parent folder checkbox.
  8. Click next.

Issue/Introduction

This article explains how files(.csv, .parquet, .txt) stored on HDFS can be accessed in TDV.