如何在R中将数据框中的后缀替换为前缀?

3

我在R中有一个如下所示的数据框:

df <- data.frame(apple = c("apple", "apple", "applebad", "apple", "apple"),
                orange = c("orangebad", "orangebad", "orange", "orange", "orange"),
                pear = c("pear", "pear", "pear", "pearbad", "pear"))    

然而,我想从这个数据框中将“bad”的后缀更改为“rotten”的前缀,以便获得以下数据框:

df_new <- data.frame(apple = c("apple", "apple", "rottenapple", "apple", "apple"),
                orange = c("rottenorange", "rottenorange", "orange", "orange", "orange"),
                pear = c("pear", "pear", "pear", "rottenpear", "pear")) 

我知道可以使用str_replace(来自stringr)或gsub逐列执行此操作:

df$apple <- gsub("applebad", paste0("rottenapple"), df$apple)
df$apple <- str_replace(df$apple, "applebad", "rottenapple")

但我不确定如何做才能迭代数据框,而无需声明“rottenapple”、“rottenpear”,因为我的真实数据集有数百列。我尝试了以下方法,但并不起作用:

item <- c("apple", "orange", "pear")

df[,1] <- str_replace(df[,1], "bad", paste0("rotten", item[1]))

非常感谢任何帮助。

1个回答

4
我们可以将剩余的字符作为一个组((.*))捕获,接着跟上'bad'并用rotten子字符串和捕获组的反向引用(\\1)进行替换。
library(dplyr)
library(stringr)
df <- df %>% 
    mutate(across(everything(),  ~str_replace(.x, "(.*)bad", "rotten\\1")))

-检查

all.equal(df, df_new)
[1] TRUE

或者使用基础 R

 df[] <- lapply(df, gsub, pattern = "(.*)bad", replacement = "rotten\\1")
> df
        apple       orange       pear
1       apple rottenorange       pear
2       apple rottenorange       pear
3 rottenapple       orange       pear
4       apple       orange rottenpear
5       apple       orange       pear

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接