我尝试复制一个来自stackoverflow的问题的结果: dplyr: How to apply do() on result of group_by?
这是数据
person = c('Grace', 'Grace', 'Grace', 'Rob', 'Rob', 'Rob')
foods = c('apple', 'banana', 'cucumber', 'spaghetti', 'cucumber', 'banana')
eaten <- data.frame(person, foods, stringsAsFactors = FALSE)
我试图复制的结果是:
[[1]]
[,1] [,2] [,3]
[1,] "apple" "apple" "banana"
[2,] "banana" "cucumber" "cucumber"
[[2]]
[,1] [,2] [,3]
[1,] "spaghetti" "spaghetti" "cucumber"
[2,] "cucumber" "banana" "banana"
上述结果的原始代码如下,但已不再适用:
> eaten %>% group_by(person) %>% do(function(x) combn(x$foods, m = 2))
Error: Results are not data frames at positions: 1, 2
尝试了多种方法,但都无法使用do()函数。
> eaten %>% group_by(person) %>% do(combn(.$foods, m = 2))
Error: Results are not data frames at positions: 1, 2
> eaten %>% group_by(person) %>% do(.$foods, combn, m =2)
Error: Arguments to do() must either be all named or all unnamed
> eaten %>% group_by(person) %>% do((combn(.$foods, m=2)))
Error: Results are not data frames at positions: 1, 2
似乎只有下面这个方法能够工作,但会出现警告信息:
> eaten %>% group_by(person) %>% do(as.data.frame(combn(.$foods, m = 2)))
# person V1 V2 V3
# 1 Grace apple apple banana
# 2 Grace banana cucumber cucumber
# 3 Rob spaghetti spaghetti cucumber
# 4 Rob cucumber banana banana
# Warning messages:
# 1: In rbind_all(out[[1]]) : Unequal factor levels: coercing to character
# 2: In rbind_all(out[[1]]) : Unequal factor levels: coercing to character
相信在新版本下必须对do()的行为进行更改。有哪些改变?如何正确地使用do()?谢谢。
编辑:安装了最新的dplyr并运行@hadley建议的代码
packageVersion("dplyr")
[1] ‘0.3.0.2’
eaten %>% group_by(person) %>% do(x = combn(.$foods, m = 2))
# Source: local data frame [2 x 2]
# Groups: <by row>
#
# person x
# 1 Grace <chr[2,3]>
# 2 Rob <chr[2,3]>
编辑2:需要按照@hadley的建议提取“x”列
eaten2 <- eaten %>% group_by(person) %>% do(x = combn(.$foods, m = 2))
eaten2[["x"]]
# [[1]]
# [,1] [,2] [,3]
# [1,] "apple" "apple" "banana"
# [2,] "banana" "cucumber" "cucumber"
#
# [[2]]
# [,1] [,2] [,3]
# [1,] "spaghetti" "spaghetti" "cucumber"
# [2,] "cucumber" "banana" "banana"
do(as.data.frame(combn(.$foods, m = 2), stringsAsFactors = FALSE ))
- 希望能帮到您。 - talatx
列,你就能得到想要的内容。 - hadley