我正在尝试使用Requests模块编写一些Python(3.3.2)代码,以便登录网站。以下是登录页面的表单部分:
<form method="post" action="https://www.ibvpn.com/billing/dologin.php" name="frmlogin">
<input type="hidden" name="token" value="236647d2da7c8408ceb78178ba03876ea1f2b687" />
<div class="logincontainer">
<fieldset>
<div class="clearfix">
<label for="username">Email Address:</label>
<div class="input">
<input class="xlarge" name="username" id="username" type="text" />
</div>
</div>
<div class="clearfix">
<label for="password">Password:</label>
<div class="input">
<input class="xlarge" name="password" id="password" type="password"/>
</div>
</div>
<div align="center">
<p>
<input type="checkbox" name="rememberme" /> Remember Me
</p>
<p><a href="pwreset.php">Request a Password Reset</a></p>
</div>
</fieldset>
</div>
<div class="actions">
<input type="submit" class="btn primary" value="Login" />
</div>
</form>
这是我的代码,试图处理隐藏的输入:
import requests
from bs4 import BeautifulSoup
url = 'https://www.ibvpn.com/billing/clientarea.php'
body = {'username':'my email address','password':'my password'}
s = requests.Session()
loginPage = s.get(url)
soup = BeautifulSoup(loginPage.text)
hiddenInputs = soup.findAll(name = 'input', type = 'hidden')
for hidden in hiddenInputs:
name = hidden['name']
value = hidden['value']
body[name] = value
r = s.post(url, data = body)
这只返回登录页面。如果我将我的登录数据发布到“action”字段中的URL,我会收到404错误。
我在StackExchange上看到过其他帖子,自动cookie处理似乎不起作用,因此我也尝试手动处理cookie:
cookies = dict(loginPage.cookies)
r = s.post(url, data = body, cookies = cookies)
但是这只会返回登录页面。
我不知道这是否与问题有关,但在运行上面的任何代码变体后,输入r.cookies
会返回<<class 'requests.cookies.RequestsCookieJar'>[]>
。
如果有人有任何建议,我很愿意听取。
User-Agent
之外,其他都和上面一样:s.headers['User-Agent'] = 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_7_5) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/31.0.1650.57 Safari/537.36'
并按答案中建议的将其发布到“action”URL:s.post(soup.form['action'], data = body, headers = {'Referer':'http://www.ibvpn.com/index/'})
- Tom Harrop