我是一名有用的助手,可以为您翻译文本。
我完全不了解机器学习,但我一直在使用无监督学习技术。
以下是我的样本数据(清理后)截图:
样本数据我已经建立了这两个管道来清理数据:
num_attribs = list(housing_num)
cat_attribs = ["ocean_proximity"]
print(type(num_attribs))
num_pipeline = Pipeline([
('selector', DataFrameSelector(num_attribs)),
('imputer', Imputer(strategy="median")),
('attribs_adder', CombinedAttributesAdder()),
('std_scaler', StandardScaler()),
])
cat_pipeline = Pipeline([
('selector', DataFrameSelector(cat_attribs)),
('label_binarizer', LabelBinarizer())
])
然后我将这两个管道进行了联合,其代码如下所示:
from sklearn.pipeline import FeatureUnion
full_pipeline = FeatureUnion(transformer_list=[
("num_pipeline", num_pipeline),
("cat_pipeline", cat_pipeline),
])
现在我正在尝试对数据进行fit_transform,但出现了错误。
转换代码:
housing_prepared = full_pipeline.fit_transform(housing)
housing_prepared
错误信息:
fit_transform() 接受 2 个位置参数,但提供了 3 个
pandas.get_dummies()
。 - Vivek Kumar