使用Python的字符串子序列核和支持向量机

如何使用字符串子序列核（SSK）[Lodhi 2002]在Python中训练支持向量机（SVM）？

回答：

这是对gcedo的回答的更新，以适应当前版本的shogun（Shogun 6.1.3）。

工作示例：

import numpy as npfrom shogun import StringCharFeatures, RAWBYTEfrom shogun import BinaryLabelsfrom shogun import SubsequenceStringKernelfrom shogun import LibSVMstrings = ['cat', 'doom', 'car', 'boom','caboom','cartoon','cart']test = ['bat', 'soon', 'it is your doom', 'i love your cat cart','i love loonytoons']train_labels  = np.array([1, -1, 1, -1,-1,-1,1])test_labels = np.array([1, -1, -1, 1])features = StringCharFeatures(strings, RAWBYTE)test_features = StringCharFeatures(test, RAWBYTE)# 1是n，0.5是lambda，如Lodhi 2002中所述sk = SubsequenceStringKernel(features, features, 3, 0.5)# 训练支持向量机labels = BinaryLabels(train_labels)C = 1.0svm = LibSVM(C, sk, labels)svm.train()# 预测predicted_labels = svm.apply(test_features).get_labels()print(predicted_labels)

学技术

使用Python的字符串子序列核和支持向量机

发表回复取消回复

相关文章：

Related Posts

使用LSTM在Python中预测未来值

如何在gensim的word2vec模型中查找双词组的相似性

dask_xgboost.predict 可以工作但无法显示 – 数据必须是一维的

ML Tuning – Cross Validation in Spark

如何在React JS中使用fetch从REST API获取预测

如何分析ML.NET中多类分类预测得分数组？

发表回复 取消回复

发表回复取消回复