如何在ndarray中计算特定项的出现次数？

Question

如何在ndarray中计算特定项的出现次数？

625

如何统计以下数组中数字0和1的数量？

y = np.array([0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 1])

y.count(0) 的输出为：

numpy.ndarray 对象没有 count 属性。

- mflowww

4

在这种情况下，也可以简单地使用 numpy.count_nonzero 函数。 - Mong H. Ng

32个回答

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- Thomas · Answer 1

这需要多一步操作，但更加灵活的解决方案也适用于2D数组和更复杂的过滤器，即创建布尔掩码，然后在掩码上使用.sum()。

>>>>y = np.array([0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 1])
>>>>mask = y == 0
>>>>mask.sum()
8

- sol · Answer 2

一般而言，简单的回答是：

numpy.sum(MyArray==x)   # sum of a binary list of the occurence of x (=0 or 1) in MyArray

这将导致这个完整的代码，例如：

import numpy
MyArray=numpy.array([0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 1])  # array we want to search in
x=0   # the value I want to count (can be iterator, in a list, etc.)
numpy.sum(MyArray==0)   # sum of a binary list of the occurence of x in MyArray

现在，如果MyArray是多维的，并且您想要计算一行（以下简称模式）中值的分布情况。

MyArray=numpy.array([[6, 1],[4, 5],[0, 7],[5, 1],[2, 5],[1, 2],[3, 2],[0, 2],[2, 5],[5, 1],[3, 0]])
x=numpy.array([5,1])   # the value I want to count (can be iterator, in a list, etc.)
temp = numpy.ascontiguousarray(MyArray).view(numpy.dtype((numpy.void, MyArray.dtype.itemsize * MyArray.shape[1])))  # convert the 2d-array into an array of analyzable patterns
xt=numpy.ascontiguousarray(x).view(numpy.dtype((numpy.void, x.dtype.itemsize * x.shape[0])))  # convert what you search into one analyzable pattern
numpy.sum(temp==xt)  # count of the searched pattern in the list of patterns

- deckard · Answer 3

对于通用条目：

x = np.array([11, 2, 3, 5, 3, 2, 16, 10, 10, 3, 11, 4, 5, 16, 3, 11, 4])
n = {i:len([j for j in np.where(x==i)[0]]) for i in set(x)}
ix = {i:[j for j in np.where(x==i)[0]] for i in set(x)}

将输出计数：

{2: 2, 3: 4, 4: 2, 5: 2, 10: 2, 11: 3, 16: 2}

还有索引：

{2: [1, 5],
3: [2, 4, 9, 14],
4: [11, 16],
5: [3, 12],
10: [7, 8],
11: [0, 10, 15],
16: [6, 13]}

- user7055304 · Answer 4

这可以很容易地通过以下方法完成。

y = np.array([0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 1])
y.tolist().count(1)

- The Guy · Answer 5

我这里有一些内容，通过它你可以计算特定数字的出现次数：根据你的代码

count_of_zero=list(y[y==0]).count(0) 

print(count_of_zero)

// according to the match there will be boolean values and according
// to True value the number 0 will be return.

- Sabeer Ebrahim · Answer 6

由于您的ndarray仅包含0和1，您可以使用sum()获取1的出现次数，len()-sum()获取0的出现次数。

num_of_ones = sum(array)
num_of_zeros = len(array)-sum(array)

- BendiXB · Answer 7

这个函数返回数组中变量出现的次数：

def count(array,variable):
    number = 0
    for i in range(array.shape[0]):
        for j in range(array.shape[1]):
            if array[i,j] == variable:
                number += 1
    return number

- Mauricio Arboleda-Zapata · Answer 8

如果你处理的是非常大的数组，使用生成器可能是一个选项。这种方法适用于数组和列表，并且不需要任何额外的软件包。此外，你不会使用太多内存。

my_array = np.array([0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 1])
sum(1 for val in my_array if val==0)
Out: 8

- JLT · Answer 9

如果您不想使用numpy或collections模块，可以使用字典：

d = dict()
a = [0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 1]
for item in a:
    try:
        d[item]+=1
    except KeyError:
        d[item]=1

结果：

>>>d
{0: 8, 1: 4}

当然你也可以使用if/else语句。我认为Counter函数几乎做了相同的事情，但这更透明。

- Haribk · Answer 10

最简单的，如果不必要请注释

import numpy as np
y = np.array([0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 1])
count_0, count_1 = 0, 0
for i in y_train:
    if i == 0:
        count_0 += 1
    if i == 1:
        count_1 += 1
count_0, count_1