使用Python将RDa文件读取为Pandas数据框。

3

我有一个在R中创建的RDa文件。我想将其作为pandas dataframe在Python中读取。我有以下代码实现相同的功能:

import rpy2.robjects as robjects
import numpy as np
from rpy2.robjects import pandas2ri
pandas2ri.activate()

# load your file
robjects.r['load']('Data.RDa')

matrix = robjects.r['data']

matrix

我得到了以下结果:
R object with classes: ('data.frame',) mapped to:
<DataFrame - Python:0x0CF46F58 / R:0x0ED0F200>
[Float..., Float..., Float..., ..., Float..., Float..., Float...]
  area: <class 'rpy2.robjects.vectors.FloatVector'>
  R object with classes: ('numeric',) mapped to:
<FloatVector - Python:0x0CF56A80 / R:0x0F281898>
[NA_real_, NA_real_, NA_real_, ..., NA_real_, NA_real_, NA_real_]
  i: <class 'rpy2.robjects.vectors.FloatVector'>
  R object with classes: ('numeric',) mapped to:
<FloatVector - Python:0x0CF68E68 / R:0x0F2B9520>
[NA_real_, NA_real_, NA_real_, ..., NA_real_, NA_real_, NA_real_]
  s: <class 'rpy2.robjects.vectors.FloatVector'>
  R object with classes: ('numeric',) mapped to:
<FloatVector - Python:0x0CF68940 / R:0x0F380008>
[NA_real_, NA_real_, NA_real_, ..., NA_real_, NA_real_, NA_real_]
  ...
  upslope_area: <class 'rpy2.robjects.vectors.FloatVector'>
  R object with classes: ('numeric',) mapped to:
<FloatVector - Python:0x0D03FDA0 / R:0x0FE87C90>
[NA_real_, NA_real_, NA_real_, ..., 292.256494, NA_real_, NA_real_]
  i: <class 'rpy2.robjects.vectors.FloatVector'>
  R object with classes: ('numeric',) mapped to:
<FloatVector - Python:0x0D03FC88 / R:0x0FEBF918>
[331347.500000, 331352.500000, 331357.500000, ..., 332187.500000, 332192.500000, 332197.500000]
  s: <class 'rpy2.robjects.vectors.FloatVector'>
  R object with classes: ('numeric',) mapped to:
<FloatVector - Python:0x0D03FE68 / R:0x0FEF75A0>
[4554812.500000, 4554812.500000, 4554812.500000, ..., 4553982.500000, 4553982.500000, 4553982.500000]

我该如何将它转换为pandas数据框?

1个回答

0

这看起来像是在从搜索路径中检索第一个具有符号“data”的R对象时(简而言之,在执行robjects.r [“data”]时)缺少对当前转换的调用。如果尚未打开问题,请在rpy2跟踪器上打开问题,或者如果未解决或被认为过早解决,则在已经打开的问题的评论中发出噪音。

显式调用仅限于代码块的转换规则应该是一种简单的解决方法,并且可能有助于确保良好的性能。转换机制提供了方便,但往往以性能为代价,因为每次在转换的任何方向上进行数据框的复制。

以下是示例:

from rpy2.robjects import default_converter
from rpy2.robjects import pandas2ri
from rpy2.robjects.conversion import localconverter

# use the default conversion rules to which the pandas conversion
# is added
with localconverter(default_converter + pandas2ri.converter) as cv:
    dataf = robjects.r["data"]

这是文档中的内容:http://rpy2.readthedocs.io/en/version_2.8.x/robjects_convert.html#local-conversion-rules


网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接