| Products | Versions |
|---|---|
| Spotfire Data Science | 6.x |
Operator HDFS Processing Tool Support
Operator HDFS Processing Tool Support
The following outlines which Data Processing Tool is supported by each of the various HDFS-supported Operators:
Note: Operators marked with "n/a" do not use any of the HDFS processing tools.| Pig | MapReduce | Spark | Sqoop | |
|---|---|---|---|---|
| Copy to Database | X (parallel mode) | |||
| Copy to Hadoop | X | |||
| Hadoop File | X |
| Pig | MapReduce | Spark | Sqoop | |
|---|---|---|---|---|
| Bar Chart | X | |||
| Box Plot | X | |||
| Correlation | X | |||
| Frequency | X | |||
| Histogram | X | |||
| Scatter Plot Matrix | X | |||
| Summary Statistics | X | X | ||
| T-Tests | X | |||
| Variable Selection | X |
| Pig | MapReduce | Spark | Sqoop | |
|---|---|---|---|---|
| Aggregation | X | |||
| Collapse | X | |||
| Column Filter | X | |||
| Distinct | X | |||
| Join | X | X | ||
| Normalization | X | |||
| Null Value Replacement | X | |||
| Pivot | X | |||
| Row Filter | X | |||
| Set Operations | X | |||
| Variable | X |
| Pig | MapReduce | Spark | Sqoop | |
|---|---|---|---|---|
| Random Sampling | X | |||
| Sample Selector | n/a | n/a | n/a | n/a |
| Pig | MapReduce | Spark | Sqoop | |
|---|---|---|---|---|
| Alpine Forest | X | X | ||
| Alpine Forest Regression | X | X | ||
| Decision Tree | X | |||
|
Gradient Boosting Classification | X | |||
|
Gradient Boosting Regression | X | |||
| K-Means | X | X | ||
| Linear Regression | X | X | ||
| Logistic Regression | X | X | ||
| Naive Bayes | X | |||
| PCA | X | |||
| SVM Classification | X | |||
| Time Series | X |
| Pig | MapReduce | Spark | Sqoop | |
|---|---|---|---|---|
| Classifier | X | |||
| Predictor | X | |||
| PCA Apply | X | |||
| Time Series Predictor | n/a | n/a | n/a | n/a |
| Pig | MapReduce | Spark | Sqoop | |
|---|---|---|---|---|
| Alpine Forest Evaluator | X | |||
| Confusion Matrix | X | |||
| Goodness of Fit | X | |||
| Lift | X | |||
| Regression Evaluator | X | |||
| ROC | X |
| Pig | MapReduce | Spark | Sqoop | |
|---|---|---|---|---|
| Export Operator | n/a | n/a | n/a | n/a |
| Flow Control | X | |||
| Note | n/a | n/a | n/a | n/a |
| Pig Execute | X | |||
| R Execute | n/a | n/a | n/a | n/a |
| Sub-Flow* |
*Note: For the Flow Control and Sub-Flow Operators, the HDFS Data
Processing tool used depends on the customer specific implementation.