我正在尝试从某些网页中获取"link"元素。但是我无法弄清楚我做错了什么。我得到了以下错误:
严重性: 警告
消息: DOMDocument::loadHTML() [domdocument.loadhtml]: htmlParseEntityRef: no name in Entity, line: 536
文件名: controllers/test.php
行号: 34
代码中的第34行如下:
我的代码:$dom->loadHTML($html);
$url = "http://www.amazon.com/"; $ch = curl_init(); curl_setopt($ch, CURLOPT_URL, $url); curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1); curl_setopt($ch, CURLOPT_CONNECTTIMEOUT, 10); if($html = curl_exec($ch)){ // parse the html into a DOMDocument $dom = new DOMDocument(); $dom->recover = true; $dom->strictErrorChecking = false; $dom->loadHTML($html); $hrefs = $dom->getElementsByTagName('a'); echo "<pre>"; print_r($hrefs); echo "</pre>"; curl_close($ch); }else{ echo "The website could not be reached."; }