我正在尝试使用pyvttbl对我的数据集进行ANOVA,但是我遇到了一个奇怪的错误。
以下是我的代码:
import pyvttbl
df = pyvttbl.DataFrame()
df.read_tbl("ANOVA_MWE_input.csv")
print df
print type(df)
AN = df.anova('len', sub='id', bfactors=['p1', 'p2'])
输出结果为:
id name len p1 p2
=========================
0 AAA 32 1 0
1 BBB 33 2 0
2 CCC 29 3 0
3 DDD 22 4 0
<class 'pyvttbl.base.DataFrame'>
Traceback (most recent call last):File "/home/stefano/ownCloud/PycharmProjects/Stockh_cours/ANOVA_MWE.py", line 15, in <module>
AN = df.anova('len', sub='id', bfactors=['p1', 'p2'])
File "/usr/local/lib/python2.7/dist-packages/pyvttbl/base.py", line 1975, in anova
measure=measure, transform=transform, alpha=alpha)
File "/usr/local/lib/python2.7/dist-packages/pyvttbl/stats/_anova.py", line 713, in run
self._between()
File "/usr/local/lib/python2.7/dist-packages/pyvttbl/stats/_anova.py", line 751, in _between
cw = self._num2binvec(e,Nf)
File "/usr/local/lib/python2.7/dist-packages/pyvttbl/stats/_anova.py", line 1240, in _num2binvec
return list(array(list(zeros((p-len(b))))+b)+1.)
TypeError: 'float' object cannot be interpreted as an index
我真的不明白我的数据集中哪里有一个浮点对象。你能帮忙吗?实际上,当我将其应用于我的实际表格时,我会得到一个超出范围的错误。奇怪的是,只有in_file更改的MWE中我才会遇到这个问题。
感谢任何建议。
from statsmodels.stats.anova import anova_lm
。它有不同的输出(信息较少),但对于我的目的来说仍然足够。 - Stefano_g