Java中带命名空间的XPath

29

我想获取在标签之间的所有内容,但由于urn:命名空间的存在,我不知道该如何做。

<urn:ResponseStatus version="1.0" xmlns:urn="urn:camera-org">

<urn:requestURL>/CAMERA/Streaming/status</urn:requestURL>
<urn:statusCode>4</urn:statusCode>
<urn:statusString>Invalid Operation</urn:statusString>
<urn:id>0</urn:id>

</urn:ResponseStatus>

有什么想法吗?


1
你正在使用哪个库?你尝试在查询前加上 urn: 了吗? - Olivier.Roger
使用javax.xml.xpath.XPath; 我尝试了urn:,但没有帮助。 - Sergiu
2个回答

70
  1. 简短回答:使用XPath中的local-name()。像这样:xPathFactory.newXPath().compile("//*[local-name()='requestURL']/text()");将返回/CAMERA/Streaming/status
  2. 或者您可以实现一个NamespaceContext,将命名空间名称和URI映射,并在查询之前将其设置在XPath对象上。
  3. 请查看这篇博客文章更新:文章已下线,您可以在WebArchive上查看。

解决方案1示例:

XPath xpath = XPathFactory.newInstance().newXPath();
String responseStatus = xpath.evaluate("//*[local-name()='ResponseStatus']/text()", document);
System.out.println("-> " + responseStatus);

方案2示例:

// load the Document
Document document = ...;
NamespaceContext ctx = new NamespaceContext() {
    public String getNamespaceURI(String prefix) {
        return prefix.equals("urn") ? "urn:camera-org" : null; 
    }
    public Iterator getPrefixes(String val) {
        return null;
    }
    public String getPrefix(String uri) {
        return null;
    }
};
XPath xpath = XPathFactory.newInstance().newXPath();
xpath.setNamespaceContext(ctx);
String responseStatus = xpath.evaluate("//urn:ResponseStatus/text()", document);
System.out.println("-> " + responseStatus);

编辑

这是一个完整的示例,它正确地检索了元素:

String xml = "<urn:ResponseStatus version=\"1.0\" xmlns:urn=\"urn:camera-org\">\r\n" + //
        "\r\n" + //
        "<urn:requestURL>/CAMERA/Streaming/status</urn:requestURL>\r\n" + //
        "<urn:statusCode>4</urn:statusCode>\r\n" + //
        "<urn:statusString>Invalid Operation</urn:statusString>\r\n" + //
        "<urn:id>0</urn:id>\r\n" + //
        "\r\n" + //
        "</urn:ResponseStatus>";
DocumentBuilderFactory factory = DocumentBuilderFactory.newInstance();
factory.setNamespaceAware(true);
DocumentBuilder builder = factory.newDocumentBuilder();
Document doc = builder.parse(new java.io.ByteArrayInputStream(xml.getBytes()));
XPath xpath = XPathFactory.newInstance().newXPath();
xpath.setNamespaceContext(new NamespaceContext() {
    public String getNamespaceURI(String prefix) {
        return prefix.equals("urn") ? "urn:camera-org" : null;
    }

    public Iterator<?> getPrefixes(String val) {
        return null;
    }

    public String getPrefix(String uri) {
        return null;
    }
});
XPathExpression expr = xpath.compile("//urn:ResponseStatus");
Object result = expr.evaluate(doc, XPathConstants.NODESET);
NodeList nodes = (NodeList) result;
for (int i = 0; i < nodes.getLength(); i++) {
    Node currentItem = nodes.item(i);
    System.out.println("found node -> " + currentItem.getLocalName() + " (namespace: " + currentItem.getNamespaceURI() + ")");
}

1
好的回答!(如果代码缩进并且示例 XML 没有被作者自己的服务器吃掉,那篇博客文章会更有用。不过,最后的列表仍然很有帮助。) - Ti Strga
1
非常感谢!我使用这个逻辑解析了一个巨大复杂的XML文件,它对我很有效。 - Sandeep Singh
getNamespaceURI没有匹配任何已注册的前缀时,应该返回XMLConstants.NULL_NS_URI(即空字符串),而不是null。 - Jules
http://blog.davber.com/ 网站现在只返回一个内部服务器错误。 - hooknc
@hooknc,你仍然可以在WebArchive上查看它:https://web.archive.org/web/20070328212209/http://blog.davber.com/2006/09/17/xpath-with-namespaces-in-java/ - Alex
1
大家好:我在第三点中更新了答案,并通过Webarchive添加了指向原博客文章的链接。 - Alex

1

使用xpath命名空间的XML

简单XML

String namespaceXML = "<?xml version='1.0' ?><information><person id='1'><name>Deep</name><age>34</age><gender>Male</gender></person>  <person id='2'><name>Kumar</name><age>24</age><gender>Male</gender></person> <person id='3'><name>Deepali</name><age>19</age><gender>Female</gender></person><!-- more persons... --></information>";
String jsonString = "{}";
String expression = "//information";

命名空间XML

String namespaceXML = "<soap:Envelope xmlns:soap=\"http://schemas.xmlsoap.org/soap/envelope/\"><soap:Body><m:NumberToDollarsResponse xmlns:m=\"http://www.dataaccess.com/webservicesserver/\"><m:NumberToDollarsResult>nine hundred and ninety nine dollars</m:NumberToDollarsResult></m:NumberToDollarsResponse></soap:Body></soap:Envelope>";
String jsonString = "{'soap':'http://schemas.xmlsoap.org/soap/envelope/', 'm':'http://www.dataaccess.com/webservicesserver/'}";
String expression = "//m:NumberToDollarsResponse/m:NumberToDollarsResult/text()";

将命名空间 XML 文件作为字符串提供给 asscerionXpath(namespaceXML, jsonString, expression) 方法,并以文本/节点形式获取结果。

文本: 九百九十九美元

节点: <m:NumberToDollarsResult xmlns:m="http://www.dataaccess.com/webservicesserver/"> 九百九十九美元 </m:NumberToDollarsResult>

public static String asscerionXpath(String namespaceXML, String jsonString, String expression){
    if(namespaceXML.indexOf("><") > -1) namespaceXML = namespaceXML.replace("><", ">\r\n<");
    if(jsonString.indexOf("'") > -1)    jsonString = jsonString.replace("'", "\"");

    System.out.println("namespaceXML : \n"+namespaceXML);
    System.out.println("nsmespaces : \n"+jsonString);

    DocumentBuilderFactory factory = DocumentBuilderFactory.newInstance();
    factory.setValidating(false);
    factory.setNamespaceAware(true);
    factory.setIgnoringComments(true);
    factory.setIgnoringElementContentWhitespace(true);
    try {
        DocumentBuilder builder = factory.newDocumentBuilder();
        Document source = builder.parse( string2Source(namespaceXML) );
        XPath xpath = XPathFactory.newInstance().newXPath();

        addNameSpaces(jsonString, xpath);
        // An XPath expression is not thread-safe. Make sure it is accessible by only one Thread.
        XPathExpression expr = xpath.compile(expression);

        // The NodeList interface provides the abstraction of an ordered collection of nodes,
        NodeList nodes = (org.w3c.dom.NodeList) expr.evaluate(source, XPathConstants.NODESET);;
        Node tree_base = nodes.item(0);
        return document2String(tree_base);
    } catch (UnsupportedEncodingException e) {
        e.printStackTrace();
    } catch (SAXException e) {
        e.printStackTrace();
    } catch (IOException e) {
        e.printStackTrace();
    } catch (ParserConfigurationException e) {
        e.printStackTrace();
    } catch (XPathExpressionException e) {
        System.out.println("If the expression cannot be evaluated.");
    }
    return "";
}
static InputSource string2Source( String str ) {
    InputSource inputSource = new InputSource( new StringReader( str ) );
    return inputSource;
}
static void addNameSpaces(String jsonString, XPath xpath) {
    JSONParser parser = new JSONParser();
    try {
        JSONObject namespaces = (JSONObject) parser.parse(jsonString);

        if (namespaces.size() > 0) {
            final JSONObject declaredPrefix = namespaces; // To access in Inner-class.
            NamespaceContext nameSpace = new NamespaceContext() {
                // To get all prefixes bound to a Namespace URI in the current scope, XPath 1.0 specification
                // --> "no prefix means no namespace"
                public String getNamespaceURI(String prefix) {
                    Iterator<?> key = declaredPrefix.keySet().iterator();
                    System.out.println("Keys : "+key.toString());
                    while (key.hasNext()) {
                        String name = key.next().toString();
                        if (prefix.equals(name)) {
                            System.out.println(declaredPrefix.get(name));
                            return declaredPrefix.get(name).toString();
                        }
                    }
                    return "";
                }
                public Iterator<?> getPrefixes(String val) {
                    return null;
                }
                public String getPrefix(String uri) {
                    return null;
                }
            };// Inner class.

            xpath.setNamespaceContext( nameSpace );
        }

    } catch ( org.json.simple.parser.ParseException e) {
        e.printStackTrace();
    }
}

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接