我希望能够检索html页面中所有<h3>
标签的href属性,目前我已经能够获取其innerText,但是我不知道如何访问href属性。文档中有多个<h3>
标签,但目前只需要第一个。稍后我会处理其余部分...
这是我到目前为止得到的代码:
Sub Scrap()
Dim IE As New InternetExplorer
Dim sDD As String
Dim Doc As HTMLDocument
IE.Visible = True
IE.navigate "https://www.oneoiljobsearch.com/senior-reservoir-engineer-jobs/?page=1"
Do
DoEvents
Loop Until IE.readyState = READYSTATE_COMPLETE
Set Doc = IE.document
sDD = Trim(Doc.getElementsByTagName("h3")(0).innerText)
'sDD contains the string "Senior Reservoir Engineer"
End Sub
以下是要提取数据的HTML文档部分:
<div class="front_job_details">
<h3>
<a href="/jobs/senior-reservoir-engineer-oslo-norway-7?cmp=js&from=job-search-form-2" target="_blank">
Senior Reservoir Engineer
</a>
</h3>
我需要检索的文本是:“/jobs/senior-reservoir-engineer-oslo-norway-7?cmp=js&from=job-search-form-2”
非常感谢您的帮助。