我的情况:
我有一个“主”数据框,其中包含以下列:
userid, condition
由于有四种实验条件,我也有四个数据框承载答案信息,每个数据框都包括以下列:
userid, condition, answer1, answer2
现在,我想将这些内容合并,以便将用户ID、条件和它们对应的答案组合成所有可能的组合。每一行中,每个条件只应该有正确的答案出现在相应的列中。
简短、自包含的示例:
master = data.frame(userid=c("foo","foo","foo","foo","bar","bar","bar","bar"), condition=c("A","B","C","D","A","B","C","D"))
cond_a = data.frame(userid=c("foo","bar"), condition="A", answer1=c("1","1"), answer2=c("2","2"))
cond_b = data.frame(userid=c("foo","bar"), condition="B", answer1=c("3","3"), answer2=c("4","4"))
cond_c = data.frame(userid=c("foo","bar"), condition="C", answer1=c("5","5"), answer2=c("6","6"))
cond_d = data.frame(userid=c("foo","bar"), condition="D", answer1=c("7","7"), answer2=c("8","8"))
我应如何将所有条件合并到主表中,使主表看起来像下面这样?
userid condition answer1 answer2
1 bar A 1 2
2 bar B 3 4
3 bar C 5 6
4 bar D 7 8
5 foo A 1 2
6 foo B 3 4
7 foo C 5 6
8 foo D 7 8
我已经尝试了以下方法:
temp = merge(master, cond_a, all.x=TRUE)
这让我得到:
userid condition answer1 answer2
1 bar A 1 2
2 bar B <NA> <NA>
3 bar C <NA> <NA>
4 bar D <NA> <NA>
5 foo A 1 2
6 foo B <NA> <NA>
7 foo C <NA> <NA>
8 foo D <NA> <NA>
但是一旦我这样做了...
merge(temp, cond_b, all.x=TRUE)
条件B
没有值。为什么会这样?
userid condition answer1 answer2
1 bar A 1 2
2 bar B <NA> <NA>
3 bar C <NA> <NA>
4 bar D <NA> <NA>
5 foo A 1 2
6 foo B <NA> <NA>
7 foo C <NA> <NA>
8 foo D <NA> <NA>
merge(temp, cond_b, all=TRUE)
,但这会给我带来额外的带有NA
的行。不是很理想。 - slhcktemp <-rbind(cond_a,cond_b,cond_c,cond_d) temp[order(temp["userid"]),]
还是说有什么特定的与主内容相关的关系吗? - A_K