Seaborn中出现AttributeError: 'str' object has no attribute 'view',散点图

17

我在Seaborn中遇到了一个奇怪的异常。

以下是可复现的示例:

toy_data.to_json()
'{"X":{"0":0.12765045,"1":0.0244816152,"2":0.1263715245,"3":0.0246376768,"4":0.1108581319,"5":0.1406719382,"6":0.1358105564,"7":0.1245863432,"8":0.1175445352,"9":0.1188479018,"10":0.1113148159,"11":0.117455495,"12":0.110555662,"13":0.1328567106,"14":0.103064284,"15":0.1119474442,"16":0.119390455,"17":0.1246727756,"18":0.1117827155,"19":0.1169972547},"Y":{"0":0.1241083714,"1":0.1394242378,"2":0.1225010796,"3":0.0077080173,"4":0.1198371354,"5":0.0029026989,"6":0.1259473297,"7":0.0,"8":0.0,"9":0.1214620231,"10":0.1204110739,"11":0.0,"12":0.1194570059,"13":0.0014971676,"14":0.1184584731,"15":0.1212061305,"16":0.1221438778,"17":0.0,"18":0.1209991075,"19":0.0},"Label":{"0":"17","1":"3","2":"17","3":"0","4":"14","5":"21","6":"16","7":"23","8":"20","9":"15","10":"14","11":"20","12":"14","13":"22","14":"13","15":"14","16":"15","17":"23","18":"14","19":"20"},"Probability":{"0":1.0,"1":1.0,"2":1.0,"3":1.0,"4":1.0,"5":1.0,"6":1.0,"7":1.0,"8":1.0,"9":1.0,"10":1.0,"11":1.0,"12":1.0,"13":1.0,"14":1.0,"15":1.0,"16":1.0,"17":1.0,"18":0.9101796407,"19":1.0}}'

toy_data.head()
       X           Y    Label   Probability
0   0.127650    0.124108    17  1.0
1   0.024482    0.139424    3   1.0
2   0.126372    0.122501    17  1.0
3   0.024638    0.007708    0   1.0
4   0.110858    0.119837    14  1.0

sns.scatterplot(x = toy_data.X, y = toy_data.Y, hue = toy_data.Label.values, alpha = 0.5)

AttributeError: 'str' object has no attribute 'view'

与此语法类似的异常:

sns.scatterplot(x = 'X', y = 'Y', data = toy_data, hue = 'Label', alpha = 0.5)

1
请查看此链接:https://github.com/mwaskom/seaborn/issues/1515 - rafaelc
你应该将 'Label' 列更改为数字。这样可能就可以了,除非它们确实是分类的。 - busybear
2个回答

20

这是 seaborn 的一个奇特问题,颜色 调色板 需要与标签数量相等

sns.scatterplot(x = toy_data.X, y = toy_data.Y, hue = toy_data.Label, alpha = 0.5,
                palette=sns.color_palette("Set1", toy_data.Label.nunique()) )

如果没有指定,seaborn将会使用默认的四种颜色来映射hue类别的数量。这就是为什么在值为字符串时会出现错误。


2
这应该是我个人认为可以接受的答案,因为它不强制色调为数字(考虑到真正的分类数据,例如邮政编码)。而且以防不够清楚:我们所需要做的就是添加带有自定义颜色数量的调色板参数。 - Chiraz BenAbdelkader

5

根据所述,您的列标签是一个对象。它需要是一个数字(int或float)。

toy_data.info()

<class 'pandas.core.frame.DataFrame'>
Index: 20 entries, 0 to 9
Data columns (total 4 columns):
X              20 non-null float64
Y              20 non-null float64
Label          20 non-null object
Probability    20 non-null float64
dtypes: float64(3), object(1)
memory usage: 800.0+ bytes

请执行以下操作:

toy_data.loc[:,'Label'] = toy_data.Label.astype(np.float)

将其更改为浮点数。

然后您的命令:

sns.scatterplot(x = toy_data.X, y = toy_data.Y, hue = toy_data.label.values, alpha = 0.5)

应该可以正常工作。

在此输入图片描述

我正在使用此命令生成数据框:

dict = {"X":{"0":0.12765045,"1":0.0244816152,"2":0.1263715245,"3":0.0246376768,"4":0.1108581319,"5":0.1406719382,"6":0.1358105564,"7":0.1245863432,"8":0.1175445352,"9":0.1188479018,"10":0.1113148159,"11":0.117455495,"12":0.110555662,"13":0.1328567106,"14":0.103064284,"15":0.1119474442,"16":0.119390455,"17":0.1246727756,"18":0.1117827155,"19":0.1169972547},"Y":{"0":0.1241083714,"1":0.1394242378,"2":0.1225010796,"3":0.0077080173,"4":0.1198371354,"5":0.0029026989,"6":0.1259473297,"7":0.0,"8":0.0,"9":0.1214620231,"10":0.1204110739,"11":0.0,"12":0.1194570059,"13":0.0014971676,"14":0.1184584731,"15":0.1212061305,"16":0.1221438778,"17":0.0,"18":0.1209991075,"19":0.0},"Label":{"0":"17","1":"3","2":"17","3":"0","4":"14","5":"21","6":"16","7":"23","8":"20","9":"15","10":"14","11":"20","12":"14","13":"22","14":"13","15":"14","16":"15","17":"23","18":"14","19":"20"},"Probability":{"0":1.0,"1":1.0,"2":1.0,"3":1.0,"4":1.0,"5":1.0,"6":1.0,"7":1.0,"8":1.0,"9":1.0,"10":1.0,"11":1.0,"12":1.0,"13":1.0,"14":1.0,"15":1.0,"16":1.0,"17":1.0,"18":0.9101796407,"19":1.0}}
toy_data = pd.DataFrame(dict)

你需要使用numpy import numpy as np 将数值转化为浮点数。

不,色调变量可以是分类的或数值的。请参阅此处的文档:https://seaborn.pydata.org/generated/seaborn.scatterplot.html - samhiggins2001

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接