在使用pivot_wider()函数时如何将列名从后缀改为前缀?

17

我正试图弄清楚如何更改tidyr的 pivot_wider() 函数在生成宽数据集中新变量名称的方式。具体来说,我希望“names_from”变量被添加到新变量的前缀而不是后缀。

我的数据集如下:

list(ID = c("A950", "A950", "A950", "A970", "A970", "A970", "A996", "A996", "A996"), 
Phase = c("P1", "P2", "P3", "P1", "P2", "P3", "P1", "P2", "P3"), 
A = c(23.5, 25.2, 21.9, 21.9, 21.1, 20.3, 19.5, 18.7, 17.9), 
B = c(21.9, 21.1, 20.3, 19.5, 18.7, 17.9, 17.1, 16.3, 15.5), 
C = c(25.2, 21.9, 20.3, 17.6, 15.1, 12.7, 10.3, 7.8, 5.4), 
D = c("M", "M", "M", "F", "F", "F", "N", "N", "N"))

使用pivot_wider()函数,以 Phase 作为“键”,把数据集展开后,我的结果如下:

ex_wide <- ex_long %>%
  pivot_wider(names_from = Phase, values_from = c(3:6))

list(ID = c("A950", "A970", "A996"), 
A_P1 = c(23.5, 21.9, 19.5), 
A_P2 = c(25.2, 21.1, 18.7), 
A_P3 = c(21.9, 20.3, 17.9), 
B_P1 = c(21.9, 19.5, 17.1), 
B_P2 = c(21.1, 18.7, 16.3), 
B_P3 = c(20.3, 17.9, 15.5), 
C_P1 = c(25.2, 17.6, 10.3), 
C_P2 = c(21.9, 15.1, 7.8), 
C_P3 = c(20.3, 12.7, 5.4), 
D_P1 = c("M", "F", "N"), 
D_P2 = c("M", "F", "N"), 
D_P3 = c("M", "F", "N"))
我希望列名看起来像 P1_A 而不是 A_P1(即,phase_variable 而不是 variable_phase)。这似乎很容易解决;然而,我并没有找到符合我的需求的任何解决方案。非常感谢您的帮助。提前致谢。
1个回答

23
你可以使用names_glue参数来实现这个:
ex_wide <- ex_long %>%
  pivot_wider(names_from = Phase, values_from = c(3:6), names_glue = "{Phase}_{.value}")

你只需使用下划线分隔符,传递Phase名称和指定列的.value即可。

结果

library(dplyr)
library(tidyr)

ex_wide <- ex_long %>%
  pivot_wider(names_from = Phase, values_from = c(3:6), names_glue = "{Phase}_{.value}")

ex_wide
#> # A tibble: 3 x 13
#>   ID     P1_A  P2_A  P3_A  P1_B  P2_B  P3_B  P1_C  P2_C  P3_C P1_D  P2_D  P3_D 
#>   <chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <chr> <chr> <chr>
#> 1 A950   23.5  25.2  21.9  21.9  21.1  20.3  25.2  21.9  20.3 M     M     M    
#> 2 A970   21.9  21.1  20.3  19.5  18.7  17.9  17.6  15.1  12.7 F     F     F    
#> 3 A996   19.5  18.7  17.9  17.1  16.3  15.5  10.3   7.8   5.4 N     N     N

数据

ex_long <- structure(list(ID = c("A950", "A950", "A950", "A970", "A970", 
"A970", "A996", "A996", "A996"), Phase = c("P1", "P2", "P3", 
"P1", "P2", "P3", "P1", "P2", "P3"), A = c(23.5, 25.2, 21.9, 
21.9, 21.1, 20.3, 19.5, 18.7, 17.9), B = c(21.9, 21.1, 20.3, 
19.5, 18.7, 17.9, 17.1, 16.3, 15.5), C = c(25.2, 21.9, 20.3, 
17.6, 15.1, 12.7, 10.3, 7.8, 5.4), D = c("M", "M", "M", "F", 
"F", "F", "N", "N", "N")), class = "data.frame", row.names = c(NA, 
-9L))

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接