如何基于lxml中的子元素选择父元素？

Question

如何基于lxml中的子元素选择父元素？

5

我有这段代码：

<table cellspacing="1" cellpadding="1" border="0">
  <tbody>
   <tr>
    <td>Something else</td>
   </tr>
   <tr>
    <td valign="top">
      <a href="http://exact url">Something</a>
    </td>
    <td valign="top">Something else</td>
   </tr>
  </tbody>
</table>

我想找到表格，但是很难定位它（相同的代码使用了10次）。但我知道URL中的内容。那么如何获取父表格？

- acheruns

4个回答

2

一个纯XPath的解决方案。

用途：

(//a[@href = "http://exact url"])[1]/ancestor::table[1]

该XPath表达式选取XML文档中第一个a元素的第一个祖先table元素，其href属性值为字符串"http://exact url"。

即使存在嵌套表格，此方法也可以准确地选择所需的a元素作为后代的每个表格元素，并选择最内层的table元素，而不是当前已接受的答案，该答案获得最外层的table祖先。

- Dimitre Novatchev

2

使用[]筛选表格。请注意，该属性是一个grandchild //table[.//@href="blah"]

或者//a[@href="blah"]//ancestor::table

- artificialidiot

1

//a[@href="http://exact url"]/../../..

你需要使用3个..才能到达表格元素。

- beerbajay

啊，仍然不是一个特别漂亮或通用的解决方案。 - Fred Foo

同意，你的解决方案更优雅。 - beerbajay

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- Fred Foo · Accepted Answer

如果t是这个XML片段的etree，那么您要查找的链接就是：

t.xpath('//a[@href = "http://exact url"]')[0]

接着，您可以使用祖先轴（ancestor axis）获取到 table：

t.xpath('//a[@href = "http://exact url"]/ancestor::table')[-1]