我正在尝试使用tidyr的extract函数从Notes
列中提取日期。 我正在处理的数据如下:
dates <- data.frame(col1 = c("customer", "customer2", "customer3"),
Notes = c("DOB: 12/10/62
START: 09/01/2019
END: 09/01/2020", "
S/DATE: 28/08/19
R/DATE: 27/08/20", "DOB: 13/01/1980
Start:04/12/2018"),
End_date = NA,
Start_Date = NA )
我尝试这样提取字符串"S/DATE"后面的日期:
extract <- extract(
dates,
col = "Notes",
into = "Start_date",
regex = "(?<=(S\\/DATE:)).*" # Using regex lookahead
)
然而,这只提取了字符串"S/DATE:",而不是其后面的日期。当我在regex101.com上尝试时,它按预期工作。
谢谢,Ibrahim