在pandas中,基于另一个dataframe的掩码值,使得一个dataframe中的值变为NaN

3

我有一个数据框

df1 = pd.DataFrame([["A",1,98,56,51], ["B",1,99,74,36], ["C",1,97,82,83],["B",1,96,31,90], ["C",1,45,92,12], ["A",1,67,33,55]], columns=["id","date","c1","c2","c3"])

我有另一个数据框,它具有相同的列名

df2 = pd.DataFrame([["A",1,False,False,True], ["B",1,False,False,True], ["C",1,False,False,False],["B",1,False,True,False], ["C",1,True,False,True], ["A",1,False,True,False]], columns=["id","date","c1","c2","c3"])

我希望得到一个名为df_out的数据框,其中df2中值为True的替换为空白/NaN在df1中,如果为False,则保持不变。

期望输出:

df_out = pd.DataFrame([["A",1,98,56, ""], ["B",1,99,74,""], ["C",1,97,82,83],["B",1,96,"",90], ["C",1,"",92,""], ["A",1,67,"",55]], columns=["id","date","c1","c2","c3"])

如何做到呢?

1个回答

5

试试使用where

l = ['c1','c2','c3']
df1[l] = df1[l].where(df2[l]==False,'')
df1
Out[199]: 
  id  date  c1  c2  c3
0  A     1  98  56    
1  B     1  99  74    
2  C     1  97  82  83
3  B     1  96      90
4  C     1      92    
5  A     1  67      55

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接