在polars中，与pandas的DataFrame.drop_duplicates()等效的函数是什么？

Question

在polars中，与pandas的DataFrame.drop_duplicates()等效的函数是什么？

18

在 Polars 中，与 pandas 中的 drop_duplicates() 等价的函数是什么？

import polars as pl
df = pl.DataFrame({"a":[1,1,2], "b":[2,2,3], "c":[1,2,3]})
df

输出：

shape: (3, 3)
┌─────┬─────┬─────┐
│ a   ┆ b   ┆ c   │
│ --- ┆ --- ┆ --- │
│ i64 ┆ i64 ┆ i64 │
╞═════╪═════╪═════╡
│ 1   ┆ 2   ┆ 1   │
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┤
│ 1   ┆ 2   ┆ 2   │
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┤
│ 2   ┆ 3   ┆ 3   │
└─────┴─────┴─────┘

代码：

df.drop_duplicates(["a", "b"])

出现以下错误：

属性错误：未找到drop_duplicates

- keiv.fly

2个回答

2

这个函数已经改名为.unique()

请参考他们的Polars文档

- Claus8528

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- keiv.fly · Accepted Answer

正确的函数名称是 .unique()。

import polars as pl
df = pl.DataFrame({"a":[1,1,2], "b":[2,2,3], "c":[1,2,3]})
df.unique(subset=["a","b"])

而且这会输出正确的结果：

shape: (2, 3)
┌─────┬─────┬─────┐
│ a   ┆ b   ┆ c   │
│ --- ┆ --- ┆ --- │
│ i64 ┆ i64 ┆ i64 │
╞═════╪═════╪═════╡
│ 1   ┆ 2   ┆ 1   │
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┤
│ 2   ┆ 3   ┆ 3   │
└─────┴─────┴─────┘