我希望重新创建一个R脚本,但我不知道如何在Python中重新创建这个管道。我正在分析不同工厂的累计产量,并需要归一化它们的累计生产时间以进行比较。
这个管道看起来像这样:
Norm_hrs <- Cum_df%>%
group_by(Name)%>%
complete(Cum_hrs = seq(0,max(Cum_hrs),730.5))
它需要这个:
Name Cum_Hrs A B C
Factory 1 1 0 1.887861 3.775722
Factory 1 251 0 2104.335728 21932.57871
Factory 1 611 0 2324.586178 37498.99722
Factory 1 1208 0 4361.588197 65235.05541
Factory 2 48 0 1517.840244 6604.770432
Factory 2 163 0 3370.461172 17252.70972
Factory 2 822 0 13284.87786 71918.78308
Factory 2 1541 0 21476.93602 134569.0388
Factory 2 2285 0 32053.99192 225895.1477
Factory 2 3028 0 42299.41357 340798.6151
Factory 2 3699 0 50125.85599 462145.5438
Factory 2 4436 0 56715.74945 584474.9989
将其转换为以下内容:
并将它变成这样:
Name Cum_Hrs A B C
Factory 1 1 0 1.887861 3.775722
Factory 1 251 0 2104.335728 21932.57871
Factory 1 611 0 2324.586178 37498.99722
Factory 1 730.5 NA NA NA
Factory 1 1208 0 4361.588197 65235.05541
Factory 2 48 0 1517.840244 6604.770432
Factory 2 163 0 3370.461172 17252.70972
Factory 2 730.5 NA NA NA
Factory 2 822 0 13284.87786 71918.78308
Factory 2 1461 NA NA NA
Factory 2 1541 0 21476.93602 134569.0388
Factory 2 2091.5 NA NA NA
Factory 2 2285 0 32053.99192 225895.1477
Factory 2 2922 NA NA NA
Factory 2 3028 0 42299.41357 340798.6151
这反过来使得我可以在规范化的时间步骤中插值DataFrame中NA的值。