> df = pd.DataFrame({"A": ["2002-01-12","2002-01-12","2002-01-12","2002-01-13","2002-01-13","2002-01-13","2002-01-16","2002-01-16","2002-01-16"], "B": ["12:00:00", "13:00:00", "14:00:00","11:00:00", "12:00:00", "13:00:00", "10:00:00", "11:00:00", "12:00:00"], "C": [ 3,19, 15, 6, 1, 5, 3, 12, 8]})
A B C
0 2002-01-12 12:00:00 3
1 2002-01-12 13:00:00 19
2 2002-01-12 14:00:00 15
3 2002-01-13 11:00:00 6
4 2002-01-13 12:00:00 1
5 2002-01-13 13:00:00 5
6 2002-01-16 10:00:00 3
7 2002-01-16 11:00:00 12
8 2002-01-16 12:00:00 8
我想创建一个新的
df['D']
和df['E']
,并根据每个A
组的以下条件进行操作:
df['D']
:获取前一天(与A
组相关)B == 12:00:00
时刻的C
值。df['E']
:获取前一天(与A
组相关)C
值的平均数。
A B C D E
0 2002-01-12 12:00:00 3 0 0
1 2002-01-12 13:00:00 19 0 0
2 2002-01-12 14:00:00 15 0 0
3 2002-01-13 11:00:00 6 3 12.3
4 2002-01-13 12:00:00 1 3 12.3
5 2002-01-13 13:00:00 5 3 12.3
6 2002-01-16 10:00:00 3 1 4.0
7 2002-01-16 11:00:00 12 1 4.0
8 2002-01-16 12:00:00 8 1 4.0
12:00:00
的值吗? - jezrael