我试图使用statsmodels.formula.api
OLS实现的predict()
函数。当我将新的数据框传递给函数以获取样本外数据集的预测值时,result.predict(newdf)
返回以下错误:'DataFrame' object has no attribute 'design_info'
。这是什么意思,我该如何解决?完整的回溯信息如下:
p = result.predict(newdf)
File "C:\Python27\lib\site-packages\statsmodels\base\model.py", line 878, in predict
exog = dmatrix(self.model.data.orig_exog.design_info.builder,
File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 2088, in __getattr__
(type(self).__name__, name))
AttributeError: 'DataFrame' object has no attribute 'design_info'
编辑: 这里有一个可重现的示例。看起来错误是发生在我对结果对象进行pickle(腌制)和unpickle(解腌)的时候(这是我在实际项目中需要做的):
import cPickle
import pandas as pd
import numpy as np
import statsmodels.formula.api as sm
df = pd.DataFrame({"A": [10,20,30,324,2353], "B": [20, 30, 10, 1, 2332], "C": [0, -30, 120, 11, 2]})
result = sm.ols(formula="A ~ B + C", data=df).fit()
print result.summary()
test1 = result.predict(df) #works
f_myfile = open('resultobject', "wb")
cPickle.dump(result, f_myfile, 2)
f_myfile.close()
print("Result Object Saved")
f_myfile = open('resultobject', "rb")
model = cPickle.load(f_myfile)
test2 = model.predict(df) #produces error