Auto-sklearn:如何加载pickle文件并运行predict()函数。

4

我使用auto-sklearn拟合了一个分类模型,并成功将其保存到pickle文件中。

x = automl.show_models()
results = {"ensemble": x}
pickle.dump(results, open('file.pickle','wb'))

我也成功重新加载了该模型。

automl = pickle.load(open('file.pickle','rb'))

但我无法使用重新加载的模型对新数据运行预测。

当我运行:

y_hat = automl.predict(X_test)

I get the following error:

AttributeError: 'str' object has no attribute 'predict'

你可以尝试使用MLJAR AutoML https://github.com/mljar/mljar-supervised - 它会自动保存和加载模型,无需直接调用load/save。 - pplonski
2个回答

6

错误:

x = automl.show_models()
results = {"ensemble": x} # <---
pickle.dump(results, open('file.pickle','wb'))

正确:

x = automl.show_models()
#results = {"ensemble": x}
results = autml # the classifier/regressor itself
pickle.dump(results, open('file.pickle','wb'))

Iris的示例代码:

from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score
from autosklearn.classification import AutoSklearnClassifier
import pickle


# dataset:
X, y = load_iris(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2)

# train model:
classifier = AutoSklearnClassifier(
    time_left_for_this_task=30, 
    per_run_time_limit=60,
    memory_limit=1024*12) # depends on your computer
classifier.fit(X_train, y_train)

# save model
with open('iris-classifier.pkl', 'wb') as f:
    pickle.dump(classifier, f)

# load model
with open('iris-classifier.pkl', 'rb') as f:
    loaded_classifier = pickle.load(f)

# predict
y_true = y_test
y_pred = loaded_classifier.predict(X_test)
print('iris classifier: accuracy:', accuracy_score(y_true, y_pred))
# iris classifier: accuracy: 0.9333333333333333

2

与其将automl.show_models()的输出结果丢弃,我认为应该将automl对象丢弃,因为这是唯一具有predict方法的类。


网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接