ggplot2中的stat_summary mean_sdl与mean +/- sd不同

7

我不确定为什么使用Hmisc中的mean_sdl函数在ggplot2中生成的误差条比手动计算并绘制平均值+标准差和平均值-标准差得到的误差条要显著宽。我的代码:

library(drc)
library(tidyverse)

test_dataset <- 
  structure(
    list(
      X = c(1e-10, 1e-08, 3e-08, 1e-07, 3e-07, 1e-06, 3e-06, 1e-05, 3e-05, 1e-04, 3e-04),
      AY1 = c(0, 11, 125, 190, 258, 322, 354, 348, NA, 412, NA),
      AY2 = c(3, 33, 141, 218, 289, 353, 359, 298, NA, 378, NA),
      AY3 = c(2, 25, 160, 196, 345, 328, 369, 372, NA, 399, NA),
      BY1 = c(3, NA, 11, 52, 80, 171, 289, 272, 359, 352, 389),
      BY2 = c(5, NA, 25, 55, 77, 195, 230, 333, 306, 320, 338),
      BY3 = c(4, NA, 28, 61, 44, 246, 243, 310, 297, 365, NA)
    ),
    class = c("tbl_df", "tbl", "data.frame"),
    row.names = c(NA,-11L),
    .Names = c("X", "AY1", "AY2", "AY3", "BY1", "BY2", "BY3")
  )

test_dataset2 <- test_dataset %>% 
  rename(conc = X) %>% 
  gather(-conc, key = "measurement", value = "signal") %>% 
  separate(col = measurement, into = c("mAb", "rep"), sep = "Y")

plot_with_mean_sdl <- ggplot(test_dataset2, aes(x = conc, y = signal, col = mAb)) + 
  scale_x_log10() +
  stat_summary(fun.data = mean_se, 
               geom = "point",
               size = 2
               ) +
  # geom_errorbar(data = (test_dataset2 %>% group_by(mAb, conc) %>% 
  # summarise(AVG = mean(signal), SD = sd(signal)) %>% 
  # dplyr::filter(AVG != "NA") %>% 
  # mutate(top = AVG + SD, bottom = AVG - SD)), aes(x = conc, y = AVG, ymin = bottom, ymax = top)) + 
  stat_summary(fun.data = mean_sdl, geom = "errorbar") +
  stat_smooth(method = "drm",
              method.args=list(fct = L.4()),
              se = F,
              n = 300
              )

plot_with_manual_errorbars <- ggplot(test_dataset2, aes(x = conc, y = signal, col = mAb)) + 
  scale_x_log10() +
  stat_summary(fun.data = mean_se, 
               geom = "point",
               size = 2
               ) +
  geom_errorbar(data = (test_dataset2 %>% group_by(mAb, conc) %>%
  summarise(AVG = mean(signal), SD = sd(signal)) %>%
  dplyr::filter(AVG != "NA") %>%
  mutate(top = AVG + SD, bottom = AVG - SD)), aes(x = conc, y = AVG, ymin = bottom, ymax = top)) +
  # stat_summary(fun.data = mean_sdl, geom = "errorbar") +
  stat_smooth(method = "drm",
              method.args=list(fct = L.4()),
              se = F,
              n = 300
              )

我以为Hmisc包中的smean_sdl函数应该绘制平均值加减一个常数倍标准偏差的范围,我错了什么?谢谢。

你可以让这个示例更加简洁。完全没有必要使用stat_smoothdrm包。 - Axeman
1个回答

11

?smean.sd(也在?hmisc中链接):

smean.sdl计算平均值加上或减去标准差的常数倍。

并且:

smean.sdl(x,mult = 2,na.rm = TRUE)

因此,默认值似乎是2个标准差。

使用fun.args = list(mult = 1),如stat_summary的示例所示。


太棒了,谢谢!我不知道默认值是2。毕竟,这不是置信区间... - biomiha

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接