我有一个包含12个数据框的列表。列表的名称为kvish_1_10t.tables。每个数据框都有一个名为“day_mean”的列(在所有数据框中始终是第7列)。需要注意的是,所有数据框看起来完全相同。以下是其中一个表格的示例:
X2014_kvish_1_10t
kvish keta maslul yom nefah date day_mean
1 1 10 1 1 1936 2014-09-07 00:00:00 2910.958
2 1 10 1 1 966 2014-09-07 01:00:00 2910.958
3 1 10 1 1 737 2014-09-07 02:00:00 2910.958
4 1 10 1 1 596 2014-09-07 03:00:00 2910.958
5 1 10 1 1 479 2014-09-07 04:00:00 2910.958
6 1 10 1 1 765 2014-09-07 05:00:00 2910.958
7 1 10 1 1 2082 2014-09-07 06:00:00 2910.958
8 1 10 1 1 3624 2014-09-07 07:00:00 2910.958
9 1 10 1 1 3847 2014-09-07 08:00:00 2910.958
10 1 10 1 1 2960 2014-09-07 09:00:00 2910.958
11 1 10 1 1 2871 2014-09-07 10:00:00 2910.958
12 1 10 1 1 3149 2014-09-07 11:00:00 2910.958
13 1 10 1 1 3615 2014-09-07 12:00:00 2910.958
14 1 10 1 1 3943 2014-09-07 13:00:00 2910.958
15 1 10 1 1 4079 2014-09-07 14:00:00 2910.958
16 1 10 1 1 4856 2014-09-07 15:00:00 2910.958
17 1 10 1 1 5010 2014-09-07 16:00:00 2910.958
18 1 10 1 1 4783 2014-09-07 17:00:00 2910.958
19 1 10 1 1 4684 2014-09-07 18:00:00 2910.958
20 1 10 1 1 4478 2014-09-07 19:00:00 2910.958
21 1 10 1 1 3610 2014-09-07 20:00:00 2910.958
22 1 10 1 1 2799 2014-09-07 21:00:00 2910.958
23 1 10 1 1 2346 2014-09-07 22:00:00 2910.958
24 1 10 1 1 1648 2014-09-07 23:00:00 2910.958
25 1 10 1 2 1145 2014-09-08 00:00:00 2745.917
26 1 10 1 2 671 2014-09-08 01:00:00 2745.917
...
168 rows total
现在,我改变了“day_mean”列(右侧的第7列),使得在 seq(1,168,24)位置上的值(即1,25,49,73,97,121,145)保持不变,其余位置将变为NA。因此,我编写了以下内容来定义一个数字向量,表示将获得NA值的“day_mean”列中的位置:
aa = seq(1, 168 , 24)
bb = rep(T, 168)
bb[aa] = F
cc= (which(bb))
X2014_kvish_1_10t[,7][cc] = NA
现在,您可以看到,我更改了“day_mean”列,以便只保留相关值,并使其余部分变为NA。就像这样:
> X2014_kvish_1_10t
kvish keta maslul yom nefah date day_mean
1 1 10 1 1 1936 2014-09-07 00:00:00 2910.958
2 1 10 1 1 966 2014-09-07 01:00:00 NA
3 1 10 1 1 737 2014-09-07 02:00:00 NA
4 1 10 1 1 596 2014-09-07 03:00:00 NA
5 1 10 1 1 479 2014-09-07 04:00:00 NA
6 1 10 1 1 765 2014-09-07 05:00:00 NA
7 1 10 1 1 2082 2014-09-07 06:00:00 NA
8 1 10 1 1 3624 2014-09-07 07:00:00 NA
9 1 10 1 1 3847 2014-09-07 08:00:00 NA
10 1 10 1 1 2960 2014-09-07 09:00:00 NA
11 1 10 1 1 2871 2014-09-07 10:00:00 NA
12 1 10 1 1 3149 2014-09-07 11:00:00 NA
13 1 10 1 1 3615 2014-09-07 12:00:00 NA
14 1 10 1 1 3943 2014-09-07 13:00:00 NA
15 1 10 1 1 4079 2014-09-07 14:00:00 NA
16 1 10 1 1 4856 2014-09-07 15:00:00 NA
17 1 10 1 1 5010 2014-09-07 16:00:00 NA
18 1 10 1 1 4783 2014-09-07 17:00:00 NA
19 1 10 1 1 4684 2014-09-07 18:00:00 NA
20 1 10 1 1 4478 2014-09-07 19:00:00 NA
21 1 10 1 1 3610 2014-09-07 20:00:00 NA
22 1 10 1 1 2799 2014-09-07 21:00:00 NA
23 1 10 1 1 2346 2014-09-07 22:00:00 NA
24 1 10 1 1 1648 2014-09-07 23:00:00 NA
25 1 10 1 2 1145 2014-09-08 00:00:00 2745.917
26 1 10 1 2 671 2014-09-08 01:00:00 NA
27 1 10 1 2 497 2014-09-08 02:00:00 NA
...
168 rows total
到目前为止,一切都进行得很顺利,但当我尝试对列表中的所有数据框执行相同的操作时,它失败了。我尝试编写以下命令,但它并不好用,我创建了一个函数,使每个数据框中的所有7列都获得新值:
func = function(x) (x[,7][cc] = NA)
lapply(kvish_1_10t.tables, func)
我该如何更改每个数据框中所有的 day_mean 列?
Error in
[<-.data.frame(
tmp, cc, value = NA) : new columns would leave holes after existing columns
。 - Michael Spector