您正在寻找np.vsplit
(将数组垂直(按行)分成多个子数组) -
np.vsplit(data,300)
样例运行 -
In [56]: data
Out[56]:
array([[ 0.46677419, 0.07402051, 0.87270029, 0.12481164],
[ 0.40789713, 0.36018843, 0.41731607, 0.17348898],
[ 0.4701256 , 0.10056201, 0.31289602, 0.18681709],
[ 0.52407036, 0.89913995, 0.59097535, 0.38376443],
[ 0.06734662, 0.24470334, 0.09523911, 0.35680219],
[ 0.91178257, 0.58710922, 0.75099017, 0.24929987]])
In [57]: np.vsplit(data,3)
Out[57]:
[array([[ 0.46677419, 0.07402051, 0.87270029, 0.12481164],
[ 0.40789713, 0.36018843, 0.41731607, 0.17348898]]),
array([[ 0.4701256 , 0.10056201, 0.31289602, 0.18681709],
[ 0.52407036, 0.89913995, 0.59097535, 0.38376443]]),
array([[ 0.06734662, 0.24470334, 0.09523911, 0.35680219],
[ 0.91178257, 0.58710922, 0.75099017, 0.24929987]])]
根据您的使用需求,您可以将2D输入数组重新塑形为一个3D数组,该数组在第一个轴上的长度必须为300
。这将在性能和内存方面更加高效。内存方面,它必须是免费的,因为reshaping
只创建了NumPy数组的一个视图。实现如下:
data.reshape(300,-1,data.shape[1])
样例运行 -
In [68]: data
Out[68]:
array([[ 0.46677419, 0.07402051, 0.87270029, 0.12481164],
[ 0.40789713, 0.36018843, 0.41731607, 0.17348898],
[ 0.4701256 , 0.10056201, 0.31289602, 0.18681709],
[ 0.52407036, 0.89913995, 0.59097535, 0.38376443],
[ 0.06734662, 0.24470334, 0.09523911, 0.35680219],
[ 0.91178257, 0.58710922, 0.75099017, 0.24929987]])
In [69]: data.reshape(3,-1,data.shape[1])
Out[69]:
array([[[ 0.46677419, 0.07402051, 0.87270029, 0.12481164],
[ 0.40789713, 0.36018843, 0.41731607, 0.17348898]],
[[ 0.4701256 , 0.10056201, 0.31289602, 0.18681709],
[ 0.52407036, 0.89913995, 0.59097535, 0.38376443]],
[[ 0.06734662, 0.24470334, 0.09523911, 0.35680219],
[ 0.91178257, 0.58710922, 0.75099017, 0.24929987]]])
这里有一些运行时测试,旨在比较实际的拆分和重塑方法的性能 -
In [72]: data = np.random.rand(6000,40)
In [73]: %timeit np.vsplit(data,300)
100 loops, best of 3: 7.05 ms per loop
In [74]: %timeit data.reshape(300,-1,data.shape[1])
1000000 loops, best of 3: 1.08 µs per loop
np.split()
函数:new_array=np.split(D,300)
。 - Mazdak