cartnode プロパティー

資料の英語版に戻る

cartnode プロパティー

最終更新: 2024年10月04日

cartnode プロパティー

C & R Tree ノード・アイコン C&R Tree (分類と回帰ツリー) ノードは、ディシジョン・ツリーを生成し、将来の観測値を予測または分類できるようにします。この方法は再帰的なデータ区分を使用して学習レコードを複数のセグメントに分割し、各ステップで不純性を最小限に抑えます。ツリーのノードが「純粋」であると考えられるのは、ノード中にあるケースの 100% が、対象フィールドのある特定のカテゴリーに分類される場合です。対象フィールドおよび入力フィールドは、数値範囲またはカテゴリー (名義型、順序型、フラグ) が使用できます。すべての分岐は 2 分割です (2 つのサブグループのみ)。

例

node = stream.createAt("cart", "My node", 200, 100)
# "Fields" tab
node.setPropertyValue("custom_fields", True)
node.setPropertyValue("target", "Drug")
node.setPropertyValue("inputs", ["Age", "BP", "Cholesterol"])
# "Build Options" tab, "Objective" panel
node.setPropertyValue("model_output_type", "InteractiveBuilder")
node.setPropertyValue("use_tree_directives", True)
node.setPropertyValue("tree_directives", """Grow Node Index 0 Children 1 2
Grow Node Index 2 Children 3 4""")
# "Build Options" tab, "Basics" panel
node.setPropertyValue("prune_tree", False)
node.setPropertyValue("use_std_err_rule", True)
node.setPropertyValue("std_err_multiplier", 3.0)
node.setPropertyValue("max_surrogates", 7)
# "Build Options" tab, "Stopping Rules" panel
node.setPropertyValue("use_percentage", True)
node.setPropertyValue("min_parent_records_pc", 5)
node.setPropertyValue("min_child_records_pc", 3)
# "Build Options" tab, "Advanced" panel
node.setPropertyValue("min_impurity", 0.0003)
node.setPropertyValue("impurity_measure", "Twoing")
# "Model Options" tab
node.setPropertyValue("use_model_name", True)
node.setPropertyValue("model_name", "Cart_Drug")

表 1. cartnode プロパティー
`cartnode`プロパティー	値	プロパティーの説明
`target`	フィールド	C&R Tree モデルは 1 つの対象フィールドおよび 1 つ以上の入力フィールドを使用します。度数フィールドも指定できます。詳しくは、トピック「共通モデル作成ノードのプロパティー」を参照してください。
`continue_training_existing_model`	フラグ
`objective`	`Standard` `Boosting` `Bagging` `psm`	`psm`は、非常に大きなデータ・セットに使用され、サーバー接続を必要とします。
`model_output_type`	`Single` `InteractiveBuilder`
`use_tree_directives`	フラグ
`tree_directives`	string	ツリーの成長のためのディレクティブ (式) を指定します。ディレクティブ (式) は、改行や引用符のエスケープ処理を回避するために、三重の引用符で囲むことができます。ディレクティブは、データやモデルリング・オプションの些細な変更に依存するため、他のデータセットに対しては一般化できません。
`use_max_depth`	`Default` `Custom`
`max_depth`	整数	最大ツリー深さ (0 から 1000)。 `use_max_depth = Custom`の場合にのみ使用されます。
`prune_tree`	フラグ	オーバーフィットしないようにツリーを剪定します。
`use_std_err`	フラグ	リスクにおける最大差 (標準誤差) を使用します。
`std_err_multiplier`	数値	最大差。
`max_surrogates`	数値	最大代理変数。
`use_percentage`	フラグ
`min_parent_records_pc`	数値
`min_child_records_pc`	数値
`min_parent_records_abs`	数値
`min_child_records_abs`	数値
`use_costs`	フラグ
`costs`	構造化	構造化プロパティー。
`priors`	`Data` `Equal` `Custom`
`custom_priors`	構造化	構造化プロパティー。
`adjust_priors`	フラグ
`trails`	数値	ブーストまたはバグのコンポーネント・モデル数。
`set_ensemble_method`	`Voting` `HighestProbability` `HighestMeanProbability`	カテゴリー型対象のデフォルト結合ルール。
`range_ensemble_method`	`Mean` `Median`	連続型対象のデフォルト結合ルール。
`large_boost`	フラグ	特に大きなデータセットのブースティングを適用します。
`min_impurity`	数値
`impurity_measure`	`Gini` `Twoing` `Ordered`
`train_pct`	数値	オーバーフィット防止セット。
`set_random_seed`	フラグ	結果を再現オプション。
`seed`	数値
`calculate_variable_importance`	フラグ
`calculate_raw_propensities`	フラグ
`calculate_adjusted_propensities`	フラグ
`adjusted_propensity_partition`	`Test` `Validation`