ntile函数在最新版本的R中无法工作

3

我的数据是

my_basket <- data.frame(ITEM_GROUP = c("Fruit","Fruit","Fruit","Fruit","Fruit","Vegetable","Vegetable","Vegetable","Vegetable","Dairy","Dairy","Dairy","Dairy","Dairy"), 
                       ITEM_NAME = c("Apple","Banana","Orange","Mango","Papaya","Carrot","Potato","Brinjal","Raddish","Milk","Curd","Cheese","Milk","Paneer"),
                       Price = c(100,80,80,90,65,70,60,70,25,60,40,35,50,120)) 

我希望使用ntile函数来计算一个百分位数列。
df1 = mutate(my_basket, percentile_rank = ntile(my_basket$Price,100))

它应该给我一个类似于正确数据框的数据框。
correct_df<- data.frame(ITEM_GROUP = c("Fruit","Fruit","Fruit","Fruit","Fruit","Vegetable","Vegetable","Vegetable","Vegetable","Dairy","Dairy","Dairy","Dairy","Dairy"), 
                       ITEM_NAME = c("Apple","Banana","Orange","Mango","Papaya","Carrot","Potato","Brinjal","Raddish","Milk","Curd","Cheese","Milk","Paneer"),
                       Price = c(100,80,80,90,65,70,60,70,25,60,40,35,50,120),
                       percentile_rank=c(86,65,72,79,43,51,29,58,1,36,15,8,22,93))

但是我得到的数据框看起来像是 wrong_df

wrong_df<- data.frame(ITEM_GROUP = c("Fruit","Fruit","Fruit","Fruit","Fruit","Vegetable","Vegetable","Vegetable","Vegetable","Dairy","Dairy","Dairy","Dairy","Dairy"), 
                        ITEM_NAME = c("Apple","Banana","Orange","Mango","Papaya","Carrot","Potato","Brinjal","Raddish","Milk","Curd","Cheese","Milk","Paneer"),
                        Price = c(100,80,80,90,65,70,60,70,25,60,40,35,50,120),
                        percentile_rank=c(13,10,11,12,7,8,5,9,1,6,3,3,4,14)) 

这个问题只是在我更新 R 版本到 4.0.2 后才出现的。

1个回答

3

我认为这不是一个R的问题,而似乎是与dplyr 1.0.0有关,正如在这个开放GitHub问题中所提到的那样。请看从那里提取的两个函数输出的差异。

ntile_083(my_basket$Price,100)
#[1] 86 65 72 79 43 51 29 58  1 36 15  8 22 93
ntile_100(my_basket$Price,100)
#[1] 13 10 11 12  7  8  5  9  1  6  3  2  4 14

目前您可以使用 ntile_083 来获得先前的功能。


网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接