将二进制字符串转换为NumPy数组

Question

将二进制字符串转换为NumPy数组

34

假设我有以下字符串：

my_data = '\x00\x00\x80?\x00\x00\x00@\x00\x00@@\x00\x00\x80@'

这里我获得它的来源并不重要，但是为了让事情具体化，假设我是从一个二进制文件中读取的。

我知道我的字符串是四个浮点数（每个浮点数占用4个字节）的二进制表示。我想将这些浮点数作为numpy数组获取。我可以这样做：

import struct
import numpy as np
tple = struct.unpack( '4f', my_data )
my_array = np.array( tple, dtype=np.float32 )

但是创建一个中间元组似乎很繁琐。有没有一种方法可以在不创建中间元组的情况下执行此操作？

编辑

我还想以这样的方式构建数组，以便我可以指定字符串的字节序。

- mgilson

可能是重复的问题：如何从字符串创建一个numpy数组？ - Aurelius

@aurelius 我想说这个问题很接近，但并不完全是重复的。虽然答案相似，但这个问题是关于浮点数，那个问题是关于整数的。 - Uyghur Lives Matter

2个回答

0

np.fromstring()已被弃用，请使用np.frombuffer()代替。

import numpy as np

my_data = b'\x00\x00\x80?\x00\x00\x00@\x00\x00@@\x00\x00\x80@'

# np.fromstring is deprecated
# data = np.fromstring(my_data, np.float32)
data = np.frombuffer(my_data, np.float32)

print(data)

[1. 2. 3. 4.]

- yoonghm

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- JAB · Accepted Answer

>>> np.frombuffer(b'\x00\x00\x80?\x00\x00\x00@\x00\x00@@\x00\x00\x80@', dtype='<f4') # or dtype=np.dtype('<f4'), or np.float32 on a little-endian system (which most computers are these days)
array([ 1.,  2.,  3.,  4.], dtype=float32)

或者，如果你想要大端序：

>>> np.frombuffer(b'\x00\x00\x80?\x00\x00\x00@\x00\x00@@\x00\x00\x80@', dtype='>f4') # or dtype=np.dtype('>f4'), or np.float32  on a big-endian system
array([  4.60060299e-41,   8.96831017e-44,   2.30485571e-41,
         4.60074312e-41], dtype=float32)

在Python 3之前，显然不需要使用b。

实际上，如果您真的正在使用二进制文件来加载数据，则甚至可以跳过使用字符串的步骤，并使用numpy.fromfile()直接从文件中加载数据。

另外，为了参考dtype，请查看：http://docs.scipy.org/doc/numpy/reference/arrays.dtypes.html