二分图投影错误(igraph,RStudio)

3
我正在使用 版本 0.99.879 和 包版本 1.0.1。我的问题与这个问题密切相关:RStudio中的igraph包:双部分图投影错误
然而,我的问题更与数据结构相关。以下是我使用的 csv 文件示例的链接:https://workupload.com/file/6qhyZqc 以及下面的代码:
# Start
set.seed(7)
setwd("C:/Users/Stefan/Desktop/")
data <- read.csv("example.csv", sep=";")
summary(data)

library(igraph)

## using subset function to select 2 variables
data_new <- subset(data, select=c(justification, claimant_function))
data_new

g <- graph.data.frame(data_new, directed = FALSE)
g

col <- c("steelblue1", "white")
shape <- c("circle", "square")

# creating bipartite network
V(g)$type <- FALSE
V(g)$type[V(g)$name %in% data_new[, 1]] <- TRUE
is.bipartite(g)
# TRUE

plot(g, layout = layout.kamada.kawai,
 vertex.shape = shape[as.numeric(V(g)$type) + 1],
 vertex.color = col[as.numeric(V(g)$type) + 1],
 edge.color = 'gray')

# bipartiate projection
one_mode_networks <- bipartite.projection(g)
# Error in .Call("R_igraph_bipartite_projection", graph, types, 
# as.integer(probe1),  : 
# At bipartite.c:198 : Non-bipartite edge found in bipartite projection, 
# Invalid value

除了投影命令外,一切都正常。因此,代码不是问题。也许错误/问题可能在数据本身上。由于我已经使用数据相当长一段时间,所以我认为我专业上有些迷糊。如果有人能查看发布的示例数据并提出问题可能是什么的建议,那将是很好的。任何帮助都非常受欢迎!
1个回答

3

我猜你在同一个从属关系的两个节点之间存在关联。经过一番搜索,果然如此。

请注意,data_new两列 中都有字符串“other”:

data_new[which(data_new[,1] %in% data_new[,2]),1]
[1] "other"

igraph读取该字符串作为网络中的一个节点。 V(g)$type[V(g)$name %in% data_new[, 1]] <- TRUE 将其type属性值设置为TRUE

我们可以查看连接两个type == 'TRUE'的顶点的连结:

> i <- which(V(g)$type[match(ends(g,1:ecount(g))[,1],V(g)$name)] == V(g)$type[match(ends(g,1:ecount(g))[,2],V(g)$name)])
> ends(g, i)
     [,1]                 [,2]   
[1,] "financial solidity" "other"

第12个边缘,两个顶点的type==TRUE。 只需重新赋值字符串使它们不相等,一切都会顺利进行。
data_new <- subset(data, select=c(justification, claimant_function))
data_new[which(data_new[,1]=="other"),1] <- "other just"
data_new[which(data_new[,2]=="other"),2] <- "other claim"

g <- graph.data.frame(data_new, directed = FALSE)

# creating bipartite network
V(g)$type <- FALSE
V(g)$type[V(g)$name %in% data_new[, 1]] <- TRUE
is.bipartite(g)

one_mode_networks <- bipartite_projection(g)

检查:

> one_mode_networks
$proj1
IGRAPH UNW- 16 72 -- 
+ attr: name (v/c), weight (e/n)
+ edges (vertex names):
 [1] business        --expert/scientist       business        --public figure         
 [3] business        --media/journalist       business        --citizen               
 [5] business        --legislative            business        --ECB                   
 [7] government      --media/journalist       government      --expert/scientist      
 [9] government      --other claim            government      --legislative           
[11] government      --ECB                    government      --European Commission   
[13] government      --other politician/party government      --European Parliament   
[15] government      --citizen                government      --European Council      
+ ... omitted several edges

$proj2
IGRAPH UNW- 16 83 -- 
+ attr: name (v/c), weight (e/n)
+ edges (vertex names):
 [1] political solidarity--monetary solidarity  political solidarity--financial solidity  
 [3] political solidarity--no justification     political solidarity--cultural solidarity 
 [5] political solidarity--sovereignty          political solidarity--self interest       
 [7] political solidarity--economic solidarity  political solidarity--human solidarity    
 [9] financial solidity  --social solidarity    financial solidity  --misuse of solidarity
[11] financial solidity  --economic solidarity  financial solidity  --cultural solidarity 
[13] financial solidity  --self interest        financial solidity  --legal regulations   
[15] financial solidity  --necessity            financial solidity  --conditionally       
+ ... omitted several edges

太好了。您的回答正是我所寻找的(并指向了数据集中其他“编码相似性”)。非常感谢! - Stefan_W
这些错误暴露了我们数据集中潜在问题的价值不言而喻! - tgraybam

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接