什么是在R中匹配任何中文字符的正则表达式?
[\\p{Han}]
似乎不能按预期工作。v=c("a","b","c","中","e","文")
grep("[\\p{Han}]",v, value = TRUE)
[1] "a"
perl = T
应该会产生正确的结果。R的默认设置是Ville Laurikari的TRE引擎的修改版本(源代码):grep("[\\p{Han}]", v, value = T, perl = T)
#### OUTPUT ####
[1] "中" "文"
Filter(function(x) Encoding(x)=="UTF-8", v)
- NelsonGon