Python Pandas Lambda：在DataFrame中使用多个变量Lambda

Question

Python Pandas Lambda：在DataFrame中使用多个变量Lambda

5

我有一个如下所示的系列：

example = pd.Series([[1.0, 1209.75, 1207.25],
 [1.0, 1211.0, 1207.5],
 [-1.0, 1211.25, 1205.75],
 [0, 1207.25, 1206.0],
 [1.0, 1206.25, 1201.0],
 [-1.0, 1205.75, 1202.75],
 [0, 1205.5, 1203.75]])

这个系列基本上是每个单元格中有3个数字的列表。我将其转换为DataFrame并添加了一个新列：

example = example.to_frame(name="input")
example["result"]=np.NaN

现在我想对它执行以下操作：

example["result"] = example["input"].apply(lambda x,y,z: y if x==1 else z if x==-1 else NaN)

尝试执行时，我收到以下错误消息： 缺少 2 个必需的位置参数：'y' 和 'z'

- jim jarnac

2个回答

0

这是一个矢量化的解决方案：

In [30]: example
Out[30]:
                      input
0   [1.0, 1209.75, 1207.25]
1     [1.0, 1211.0, 1207.5]
2  [-1.0, 1211.25, 1205.75]
3      [0, 1207.25, 1206.0]
4    [1.0, 1206.25, 1201.0]
5  [-1.0, 1205.75, 1202.75]
6      [0, 1205.5, 1203.75]

In [31]: example['result'] = np.where(np.isclose(example.input.str[0], 1),
    ...:                              example.input.str[1],
    ...:                              np.where(np.isclose(example.input.str[0], -1),
    ...:                                       example.input.str[2],
    ...:                                       np.nan))
    ...:

In [32]: example
Out[32]:
                      input   result
0   [1.0, 1209.75, 1207.25]  1209.75
1     [1.0, 1211.0, 1207.5]  1211.00
2  [-1.0, 1211.25, 1205.75]  1205.75
3      [0, 1207.25, 1206.0]      NaN
4    [1.0, 1206.25, 1201.0]  1206.25
5  [-1.0, 1205.75, 1202.75]  1202.75
6      [0, 1205.5, 1203.75]      NaN

- MaxU - stand with Ukraine

这并未处理 example.str[0] 为 -1 的情况。 - Moses Koledoye

@MaxU 这很有趣，不过 .isclose 有点不幸。 - jim jarnac

@MosesKoledoye，感谢您指出这个问题！我已经更正了我的答案。 - MaxU - stand with Ukraine

@jimbasquiat，当使用float数据类型时，最好使用isclose或allclose来比较值。 - MaxU - stand with Ukraine

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- Moses Koledoye · Accepted Answer

Lambda只接受一个参数，这里的参数是一个列表。只需对列表进行索引即可：

>>> example["result"] = example["input"].apply(lambda lst: lst[1] if lst[0]==1 else lst[2] if lst[0]==-1 else np.NaN)
>>> example
                      input   result
0   [1.0, 1209.75, 1207.25]  1209.75
1     [1.0, 1211.0, 1207.5]  1211.00
2  [-1.0, 1211.25, 1205.75]  1205.75
3      [0, 1207.25, 1206.0]      NaN
4    [1.0, 1206.25, 1201.0]  1206.25
5  [-1.0, 1205.75, 1202.75]  1202.75
6      [0, 1205.5, 1203.75]      NaN

轻松一点，您可以将嵌套的三元运算符重构为带有嵌套if语句的函数，以使您的代码更易读：

def func(lst):
    x, y, z = lst
    if x == 1:
        return y
    elif x == -1:
        return z
    else:
        return np.NaN


example["result"] = example["input"].apply(func)