通过Hyperopt解决的最佳参数不合适

我使用hyperopt来搜索SVM分类器的最佳参数，但Hyperopt表示最佳的’kernel’是’0’。{‘kernel’: ‘0’} 显然是不合适的。

是否有人知道这是我的错误还是hyperopt的一个bug导致的？

代码如下。

from hyperopt import fmin, tpe, hp, randimport numpy as npfrom sklearn.metrics import accuracy_scorefrom sklearn import svmfrom sklearn.cross_validation import StratifiedKFoldparameter_space_svc = {   'C':hp.loguniform("C", np.log(1), np.log(100)),   'kernel':hp.choice('kernel',['rbf','poly']),   'gamma': hp.loguniform("gamma", np.log(0.001), np.log(0.1)),    }from sklearn import datasetsiris = datasets.load_digits()train_data = iris.datatrain_target = iris.targetcount = 0def function(args):  print(args)  score_avg = 0  skf = StratifiedKFold(train_target, n_folds=3, shuffle=True, random_state=1)  for train_idx, test_idx in skf:    train_X = iris.data[train_idx]    train_y = iris.target[train_idx]    test_X = iris.data[test_idx]    test_y = iris.target[test_idx]    clf = svm.SVC(**args)    clf.fit(train_X,train_y)    prediction = clf.predict(test_X)    score = accuracy_score(test_y, prediction)    score_avg += score  score_avg /= len(skf)  global count  count = count + 1  print("round %s" % str(count),score_avg)  return -score_avgbest = fmin(function, parameter_space_svc, algo=tpe.suggest, max_evals=100)print("best estimate parameters",best)

输出如下。

best estimate parameters {'C': 13.271912841932233, 'gamma': 0.0017394328334592358, 'kernel': 0}

回答：

首先，你使用的是sklearn.cross_validation，该模块自0.18版本起已被弃用。因此请更新为sklearn.model_selection。

现在回到主要问题，来自fmin的best总是返回使用hp.choice定义的参数的索引。

所以在你的例子中，'kernel':0意味着选择第一个值（'rbf'）作为kernel的最佳值。

请查看以下链接以确认这一点：

https://github.com/hyperopt/hyperopt/issues/216

要从best中获取原始值，请使用space_eval()函数，如下所示：

from hyperopt import space_evalspace_eval(parameter_space_svc, best)Output:{'C': 13.271912841932233, 'gamma': 0.0017394328334592358, 'kernel': 'rbf'}

学技术

通过Hyperopt解决的最佳参数不合适

发表回复取消回复

相关文章：

Related Posts

使用LSTM在Python中预测未来值

如何在gensim的word2vec模型中查找双词组的相似性

dask_xgboost.predict 可以工作但无法显示 – 数据必须是一维的

ML Tuning – Cross Validation in Spark

如何在React JS中使用fetch从REST API获取预测

如何分析ML.NET中多类分类预测得分数组？

发表回复 取消回复

发表回复取消回复