Products | Versions |
---|---|
Spotfire Data Science | 6.x |
Operator HDFS Processing Tool Support
Operator HDFS Processing Tool Support
The following outlines which Data Processing Tool is supported by each of the various HDFS-supported Operators:
Note: Operators marked with "n/a" do not use any of the HDFS processing tools.Pig | MapReduce | Spark | Sqoop | |
---|---|---|---|---|
Copy to Database | X (parallel mode) | |||
Copy to Hadoop | X | |||
Hadoop File | X |
Pig | MapReduce | Spark | Sqoop | |
---|---|---|---|---|
Bar Chart | X | |||
Box Plot | X | |||
Correlation | X | |||
Frequency | X | |||
Histogram | X | |||
Scatter Plot Matrix | X | |||
Summary Statistics | X | X | ||
T-Tests | X | |||
Variable Selection | X |
Pig | MapReduce | Spark | Sqoop | |
---|---|---|---|---|
Aggregation | X | |||
Collapse | X | |||
Column Filter | X | |||
Distinct | X | |||
Join | X | X | ||
Normalization | X | |||
Null Value Replacement | X | |||
Pivot | X | |||
Row Filter | X | |||
Set Operations | X | |||
Variable | X |
Pig | MapReduce | Spark | Sqoop | |
---|---|---|---|---|
Random Sampling | X | |||
Sample Selector | n/a | n/a | n/a | n/a |
Pig | MapReduce | Spark | Sqoop | |
---|---|---|---|---|
Alpine Forest | X | X | ||
Alpine Forest Regression | X | X | ||
Decision Tree | X | |||
Gradient Boosting Classification | X | |||
Gradient Boosting Regression | X | |||
K-Means | X | X | ||
Linear Regression | X | X | ||
Logistic Regression | X | X | ||
Naive Bayes | X | |||
PCA | X | |||
SVM Classification | X | |||
Time Series | X |
Pig | MapReduce | Spark | Sqoop | |
---|---|---|---|---|
Classifier | X | |||
Predictor | X | |||
PCA Apply | X | |||
Time Series Predictor | n/a | n/a | n/a | n/a |
Pig | MapReduce | Spark | Sqoop | |
---|---|---|---|---|
Alpine Forest Evaluator | X | |||
Confusion Matrix | X | |||
Goodness of Fit | X | |||
Lift | X | |||
Regression Evaluator | X | |||
ROC | X |
Pig | MapReduce | Spark | Sqoop | |
---|---|---|---|---|
Export Operator | n/a | n/a | n/a | n/a |
Flow Control | X | |||
Note | n/a | n/a | n/a | n/a |
Pig Execute | X | |||
R Execute | n/a | n/a | n/a | n/a |
Sub-Flow* |
*Note: For the Flow Control and Sub-Flow Operators, the HDFS Data
Processing tool used depends on the customer specific implementation.