如何从DWARF信息中获取结构体成员偏移量?

4

我正在尝试从dwarf信息中打印DW_AT_data_member_location属性值,以获取结构体变量的偏移量,但是没有任何辅助函数能够解决问题。

这是来自objdump的dwarf_info:

<1><1bf>: Abbrev Number: 5 (DW_TAG_structure_type)
<1c0>   DW_AT_name        : (indirect string, offset: 0xf0): class  
<1c4>   DW_AT_byte_size   : 208 
<1c5>   DW_AT_decl_file   : 1   
<1c6>   DW_AT_decl_line   : 10  
<1c7>   DW_AT_sibling     : <0x1f8> 
<2><1cb>: Abbrev Number: 6 (DW_TAG_member)
<1cc>   DW_AT_name        : (indirect string, offset: 0xc7): schools    
<1d0>   DW_AT_decl_file   : 1   
<1d1>   DW_AT_decl_line   : 11  
<1d2>   DW_AT_type        : <0x1f8> 
<1d6>   DW_AT_data_member_location: 2 byte block: 23 0  (DW_OP_plus_uconst: 0)
<2><1d9>: Abbrev Number: 6 (DW_TAG_member)
<1da>   DW_AT_name        : (indirect string, offset: 0xd8): size   
<1de>   DW_AT_decl_file   : 1   
<1df>   DW_AT_decl_line   : 12  
<1e0>   DW_AT_type        : <0x159> 
<1e4>   DW_AT_data_member_location: 3 byte block: 23 c8 1   (DW_OP_plus_uconst: 200)
<2><1e8>: Abbrev Number: 6 (DW_TAG_member)
<1e9>   DW_AT_name        : (indirect string, offset: 0xf6): record 
<1ed>   DW_AT_decl_file   : 1   
<1ee>   DW_AT_decl_line   : 13  
<1ef>   DW_AT_type        : <0x1b8> 
<1f3>   DW_AT_data_member_location: 3 byte block: 23 cc 1   (DW_OP_plus_uconst: 204)

我能够获取属性代码,但无法获取属性值:

 if(dwarf_whatattr(attrs[i],&attrcode,&error) != DW_DLV_OK)
                      printf("Error");

 printf("Attrcode: %d\n",attrcode); // This one works

 if(attrcode==DW_AT_data_member_location) 
           dwarf_formudata(attrs[i],&offset,0) // This one does not work              

这个变量的类型是什么? 如何获取它的值? 哪个辅助函数可以使用?


我希望你展示的代码片段只是伪代码,否则将变量与字面字符串进行比较是非常可疑的。实际上,如果那是真正的代码,条件将永远不会成立。此外,如果那是真正的代码,上面的printf调用也是可疑的。 - Some programmer dude
@JoachimPileborg 感谢您指出来.. 我已经编辑了代码。 - Udit Gupta
2个回答

4

DWARF调试格式允许将成员的偏移量表示为简单常量或需要评估以计算偏移量的表达式。由于某种原因,您正在尝试处理的调试信息已将简单常量偏移量表示为表达式形式。您需要“评估”表达式以确定偏移量是多少,类似以下内容:

if (attrcode == DW_AT_data_member_location) {
    Dwarf_Half form;
    dwarf_whatform(attrs[i], &form, &error);
    if (form == DW_FORM_data1 || form == DW_FORM_data2
        form == DW_FORM_data2 || form == DW_FORM_data4
        form == DW_FORM_data8 || form == DW_FORM_udata) {
        dwarf_formudata(attrs[i], &offset, 0);
    } else if (form == DW_FORM_sdata) {
        Dwarf_Signed soffset;
        dwarf_formsdata(attrs[i], &soffset, 0);
        if (soffset < 0) {
             printf("unsupported negative offset\n");
             /* FAIL */
        }
        offset = (Dwarf_Unsigned) soffset;
    } else {
        Dwarf_Locdesc **locdescs;
        Dwarf_Signed len;
        if (dwarf_loclist_n(attrs[i], &locdescs, &len,  &error) == DW_DLV_ERROR) {
             printf("unsupported member offset\n");
             /* FAIL */
        }
        if (len != 1
            || locdescs[0]->ld_cents != 1
            || (locdescs[0]->ld_s[0]).lr_atom != DW_OP_plus_uconst) {
             printf("unsupported location expression\n");
             /* FAIL */
        }
        offset = (locdescs[0]->ld_s[0]).lr_number;
    }
}

非常感谢。我现在明白了。我对你的代码进行了一些编辑。也许你没有运行代码,所以错误地将某些(指向结构体的指针)视为(结构体)本身。再次万分感谢 :-) - Udit Gupta
1
我根本没有测试它,因为我需要编写一堆其他代码才能让这段代码工作。感谢您更新了正确的代码答案。 - Ross Ridge
你能否解释一下如何获取一个变量的位置(地址)? - Udit Gupta
代码示例在最后一个分支中存在内存泄漏。以下释放操作将释放由libdwarf分配的内存: dwarf_dealloc(debug,locdescs [0]-> ld_s,DW_DLA_LOC_BLOCK); dwarf_dealloc(debug,locdescs [0],DW_DLA_LOCDESC); dwarf_dealloc(debug,locdescs,DW_DLA_LIST); - Herman Narkaytis
或者将最后一个分支替换为: if (form == DW_FORM_block1) { Dwarf_Block *block; int rv = dwarf_formblock(dw_attribute, &block, NULL); assert(rv == DW_DLV_OK); if (DW_OP_plus_uconst == *(uint8_t*) block->bl_data) { uint8_t* leb128 = block->bl_data + sizeof(uint8_t); int i, shift = 0; offset = 0; for (i = 1; i < block->bl_len; ++i) { offset |= (*leb128++ & ((1 << 7) - 1)) << shift; shift += 7; } } } - Herman Narkaytis

1
如果您喜欢使用JVM,我写了一个简单的用于解析dwarf的库,它创建了一个基于EMF的模型。以下是打印结构体成员位置的示例代码:
def someFunction() {
  var elf = new Elf32Context(buffer)
  var dwarf = new Dwarf32Context(elf)
  var model = DwarfModelFactory::createModel(dwarf)
  model.eAllContents.filter(StructureType).forEach[it.dumpStruct]
}


def dumpStruct(CompositeType struct) '''
  «switch(struct) {StructureType:'struct' UnionType:'union' default:'composite?'}» «struct.typedef?.name» {
    «FOR m : struct.members»
        «m.type.dumpType» «m.name»  @«m.dataMemberLocation»
    «ENDFOR»
  }
'''

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接