Spark:从管道模型中提取ML逻辑回归模型的摘要

6

我使用管道估计了一项逻辑回归。

在拟合逻辑回归之前,我的最后几行代码如下:

from pyspark.ml.feature import VectorAssembler
from pyspark.ml.classification import LogisticRegression
lr = LogisticRegression(featuresCol="lr_features", labelCol = "targetvar")
# create assember to include encoded features
    lr_assembler = VectorAssembler(inputCols= numericColumns + 
                               [categoricalCol + "ClassVec" for categoricalCol in categoricalColumns],
                               outputCol = "lr_features")
from pyspark.ml.classification import LogisticRegression
from pyspark.ml import Pipeline
# Model definition:
lr = LogisticRegression(featuresCol = "lr_features", labelCol = "targetvar")
# Pipeline definition:
lr_pipeline = Pipeline(stages = indexStages + encodeStages +[lr_assembler, lr])
# Fit the logistic regression model:
lrModel = lr_pipeline.fit(train_train)

我尝试运行模型的摘要(summary),但是下面的代码行出现问题:

trainingSummary = lrModel.summary

结果为:'PipelineModel'对象没有'summary'属性

有什么建议可以从管道模型中提取通常包含在回归模型中的摘要信息吗?

非常感谢!

1个回答

9

只需从阶段获取模型:

lrModel.stages[-1].summary

如果model在Pipeline中较早,则将-1替换为其索引。

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接