Creating a scatterplot
Let's see what factors might influence Drug
, the target variable. As a
researcher, you know that the concentrations of sodium and potassium in the blood are important
factors. Since these concentrations are both numeric values, you can create a scatterplot of sodium
versus potassium that uses the drug categories as a color overlay.
- Place a Plot node on the canvas and connect it to the drug1n.csv Data Asset node. Then double-click the Plot node to edit its properties.
- Select
Na
as the X field,K
as the Y field, andDrug
as the Color (overlay) field. Click Save. Hover over the Plot node and click the Run icon . A plot chart is added to the Outputs pane.The plot clearly shows a threshold. For values higher than the threshold, drug
Y
is always the correct drug. And for values less than the threshold, drugY
is never the correct drug. This threshold is the ratio of sodium (Na
) to potassium (K
).