字符串数组中最常见的元素，MATLAB

Question

8

我有一个字符串数组，例如：

arr = ['hello'; 'world'; 'hello'; 'again'; 'I----'; 'said-'; 'hello'; 'again']

如何提取最常见的字符串，在这个例子中是'hello'?

- newzad

2个回答

-1

最好使用单元数组和regexp函数；字符串数组的行为可能不符合您的预期。

arr = {'hello', 'world'; 'hello', 'again'; 'I----', 'said-'; 'hello', 'again'};

如果你使用

hellos = sum(~cellfun('isempty', regexp(arr, 'hello')));

它将返回单元数组arr中的'hello'数量。

- innoSPG

2

-1：问题是要找到最频繁的字符串，而不是特定预定的字符串。 - Eitan T

即使你正在寻找特定的字符串，使用 regexp 会有点大材小用。strcmp 可用于识别单元数组中相等的字符串。 - kwatford

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- Hugh Nolan · Accepted Answer

第一步，使用单元数组而不是字符串数组：

arr = {'hello', 'world'; 'hello', 'again'; 'I----', 'said-'; 'hello', 'again'};

其次，使用 unique 函数可以获取唯一的字符串（这不能用于字符串数组，这就是为什么我建议使用单元格的原因）：

[unique_strings, ~, string_map]=unique(arr);

然后使用 mode 函数查找 string_map 变量中出现最频繁的值：

most_common_string=unique_strings(mode(string_map));