我该如何在Spark中对多列求和?例如,在SparkR中,以下代码可以用于获取一列的总和,但如果我尝试获取
df
中两列的总和,则会出现错误。# Create SparkDataFrame
df <- createDataFrame(faithful)
# Use agg to sum total waiting times
head(agg(df, totalWaiting = sum(df$waiting)))
##This works
# Use agg to sum total of waiting and eruptions
head(agg(df, total = sum(df$waiting, df$eruptions)))
##This doesn't work
无论是SparkR还是PySpark代码都可以使用。