Home IT技术随机森林分类器：预测概率的特征重要性

随机森林分类器：预测概率的特征重要性

IT技术 xiaolong · 2025年4月12日 · 0 Comment

我正在使用sklearn的随机森林分类器（RFC）。

forest.fit(training_data, y_train)probas_test = forest.predict_proba(test_data)

我想知道是否有方法可以找出导致预测的每个特征的贡献/重要性。

类似于，但针对单个数据点级别。

   forest.feature_importances_

回答：

这个问题可以通过多种方式解决；请查看 http://blog.datadive.net/interpreting-random-forests/ （以及相关的Python包：https://github.com/andosa/treeinterpreter）。还有其他不太直接的选项，例如：

https://arxiv.org/abs/1606.05390 （实现：https://github.com/sato9hara/defragTrees）
https://arxiv.org/abs/1611.05722 （实现：https://github.com/IBCNServices/GENESIM）

machine-learning random-forest scikit-learn

发表回复取消回复