我在对样本按小时分组时遇到了问题。数据结构如下:
data = [
{
"pressure": "1009.7",
"timestamp": "2019-09-03 08:03:00"
},
{
"pressure": "1009.7",
"timestamp": "2019-09-03 08:18:00"
},
{
"pressure": "1009.8",
"timestamp": "2019-09-03 08:33:00"
},
{
"pressure": "1009.8",
"timestamp": "2019-09-03 08:56:00"
},
{
"pressure": "1009.8",
"timestamp": "2019-09-03 09:03:00"
},
{
"pressure": "1009.8",
"timestamp": "2019-09-03 09:18:00"
},
{
"pressure": "1009.8",
"timestamp": "2019-09-03 09:33:00"
},
{
"pressure": "1009.7",
"timestamp": "2019-09-03 09:56:00"
},
{
"pressure": "1009.6",
"timestamp": "2019-09-03 10:03:00"
}
]
如您所见,每小时有4个压力测量值,我希望能够计算每小时的平均值。我尝试过使用Pandas实现这一点,但是没有成功。我尝试提取起始时间戳和结束时间戳,将它们舍入到整点,并将其作为索引传递给DataFrame,json作为数据,但是形状不匹配(难怪)。我认为我可以像这样将其传递给df,然后尝试计算平均值,但是似乎需要一些中间步骤。