我希望使用dplyr summarise按组汇总计数。具体来说,如果不是所有的汇总值都为NA,我想要删除NA值,但是如果所有的汇总值都是NA,我想要显示NA。例如:
在这种情况下,我希望得到以下结果:
结果如下:
结果是:
name <- c("jack", "jack", "mary", "mary", "ellen", "ellen")
number <- c(1,2,1,NA,NA,NA)
df <- data.frame(name,number)
在这种情况下,我希望得到以下结果:
- Jack = 3
- Mary = 1
- Ellen = NA
na.rm = F
:df %>% group_by(name) %>% summarise(number = sum(number, na.rm = F))
结果如下:
- Jack = 3
- Mary = NA
- Ellen = NA
如果我设置 na.rm = T
:
df %>% group_by(name) %>% summarise(number = sum(number, na.rm = T))
结果是:
- Jack = 3
- Mary = 1
- Ellen = 0