在R中进行分面或分组相关性和自相关图的绘制

4

我正在尝试从数据框中按组/分面绘制相关图。如果我为每个变量子集数据,我可以做到这一点。如何同时为所有变量执行此操作,以基于每个变量生成分面图?

###Load libraries
library(gdata)
library(corrplot)
library(ggplot2)
library(gtable)
library(ggpmisc)
library(grid)
library(reshape2)
library(plotly)
packageVersion('plotly')

##Subset ample data from the "iris" data set in R
B<-iris[iris$Species == "virginica", ]

##calculate correlation for numeric columns only
M<-cor(B[,1:4])
head(round(M,2))

###calculate significance
cor.mtest <- function(mat, ...) {
mat <- as.matrix(mat)
n <- ncol(mat)
p.mat<- matrix(NA, n, n)
diag(p.mat) <- 0
for (i in 1:(n - 1)) {
    for (j in (i + 1):n) {
        tmp <- cor.test(mat[, i], mat[, j], ...)
        p.mat[i, j] <- p.mat[j, i] <- tmp$p.value
    }
}
colnames(p.mat) <- rownames(p.mat) <- colnames(mat)
p.mat
}
# matrix of the p-value of the correlation
p.mat <- cor.mtest(B[,1:4])

###plot
#color ramp
col<- colorRampPalette(c("red","white","blue"))(40)
corrplot(M, type="upper",tl.col="black", tl.cex=0.7,tl.srt=45, col=col,
p.mat = p.mat, insig = "blank", sig.level = 0.01)

这很好,因为我从数据框中只拿出了一个变量“virginica”。如何自动化这个过程,对于所有单独的变量进行唯一的相关计算,然后绘制corrplot作为单独的面板?
2个回答

3

我理解您想要为每个物种级别创建相关性图。因此,您可以尝试以下步骤:

library(Hmisc) # this package has implemented a cor function calculating both r and p.  
library(corrplot)
# split the data 
B <- split(iris[,1:4], iris$Species)
# Calculate the correlation in all data.frames using lapply 
M <- lapply(B, function(x) rcorr(as.matrix(x)))

# Plot three pictures
par(mfrow=c(1,3))
col<- colorRampPalette(c("red","white","blue"))(40)
lapply(M, function(x){
corrplot(x$r, type="upper",tl.col="black", tl.cex=0.7,tl.srt=45, col=col,
         p.mat = x$P, insig = "blank", sig.level = 0.01)
})

enter image description here


1

@Jimbou,感谢您的代码。我稍微编辑了一下,添加了相关分析、唯一的R和绘图功能,并为每个图表添加了一个唯一的名称。带标题的图表

library(ggplot2)
library(Hmisc) 
library(corrplot)
# split the data 
B <- split(iris[,1:4], iris$Species)
##extract names
nam<-names(B)
# Plot three pictures
par(mfrow=c(1,3))
col<- colorRampPalette(c("red","white","blue"))(40)
for (i in seq_along(B)){
# Calculate the correlation in all data.frames using lapply 
M<-rcorr(as.matrix(B[[i]]))
corrplot(M$r, type="upper",tl.col="black", tl.cex=0.7,tl.srt=45, col=col,
 addCoef.col = "black", p.mat = M$P, insig = "blank",sig.level = 0.01)
mtext(paste(nam[i]),line=1,side=3)}

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接