无法使用Python通过Google Drive API下载Google电子表格

3

我正在尝试将电子表格文件从我的云端硬盘下载到我的计算机上。 我能够成功地进行身份验证,获取文件列表,甚至成功获取元数据。 但当我尝试下载文件时,出现以下错误:

downloading file starts
An error occurred: <HttpError 400 when requesting https://www.googleapis.com/dri
ve/v2/files/1vJetI_p8YEYiKvPVl0LtXGS5uIAx1eRGUupsXoh7UbI?alt=media returned "The
 specified file does not support the requested alternate representation.">
downloading file ends

我在stackoverflow上没有找到任何相关的问题或疑问,而且stackoverflow上提供的其他方法和解决方案已经过时。这些方法已被Google弃用。

以下是我用于下载文件的代码:

import httplib2
import os    
from apiclient import discovery
import oauth2client
from oauth2client import client
from oauth2client import tools


from apiclient import errors
from apiclient import http

try:
    import argparse
    flags = argparse.ArgumentParser(parents=[tools.argparser]).parse_args()
except ImportError:
    flags = None

#SCOPES = 'https://www.googleapis.com/auth/drive.metadata.readonly'
SCOPES = 'https://www.googleapis.com/auth/drive'
CLIENT_SECRET_FILE = 'client_secrets.json'
APPLICATION_NAME = 'Drive API Quickstart'


def get_credentials():
    home_dir = os.path.expanduser('~')
    credential_dir = os.path.join(home_dir, '.credentials')
    if not os.path.exists(credential_dir):
        os.makedirs(credential_dir)
    credential_path = os.path.join(credential_dir,
                                   'drive-quickstart.json')

    store = oauth2client.file.Storage(credential_path)
    credentials = store.get()

    if not credentials or credentials.invalid:
        flow = client.flow_from_clientsecrets(CLIENT_SECRET_FILE, SCOPES)
        flow.user_agent = APPLICATION_NAME
        if flags:
            credentials = tools.run_flow(flow, store, flags)
        else: # Needed only for compatability with Python 2.6
            credentials = tools.run(flow, store)
        print 'Storing credentials to ' + credential_path
    return credentials

def main():
    credentials = get_credentials()
    http = credentials.authorize(httplib2.Http())
    service = discovery.build('drive', 'v2', http=http)

    file_id = '1vJetI_p8YEYiKvPVl0LtXGS5uIAx1eRGUupsXoh7UbI'

    print "downloading file starts"
    download_file(service, file_id)
    print "downloading file ends "

def download_file(service, file_id):

    local_fd = open("foo.csv", "w+")
    request = service.files().get_media(fileId=file_id)
    media_request = http.MediaIoBaseDownload(local_fd, request)

    while True:
        try:
            download_progress, done = media_request.next_chunk()
        except errors.HttpError, error:
            print 'An error occurred: %s' % error
            return
        if download_progress:
            print 'Download Progress: %d%%' % int(download_progress.progress() * 100)
        if done:
            print 'Download Complete'
            return

if __name__ == '__main__':
    main()
2个回答

1

谷歌电子表格没有媒体,而是有导出链接。获取文件元数据,然后查看导出链接并选择适当的URL。


我得到了以下元数据,但没有导出链接或下载链接: { 'selfLink': 'parentLink': 'iconLink': 'embedLink': 'alternateLink': 'selfLink': 'selfLink': } - kadamb
请粘贴在 "files.get Try-It" 中看到的元数据,以便我们可以查看。 - pinoyyid

0

这段代码对我很有效。我只需要从Google开发者控制台下载client_secret.json文件,并将其保存在与Python脚本相同的目录中。

在list_of_lists变量中,我得到了一个包含每行列表的列表。

import gspread
import json
from oauth2client.client import SignedJwtAssertionCredentials


json_key = json.load(open('client_secret.json'))
scope = ['https://spreadsheets.google.com/feeds']
credentials = SignedJwtAssertionCredentials(json_key['client_email'], json_key['private_key'], scope)

gc = gspread.authorize(credentials)
sht1 = gc.open_by_key('<id_of_sheet>')
worksheet_list = sht1.worksheets()
worksheet = sht1.sheet1
list_of_lists = worksheet.get_all_values()

for row in list_of_lists :
    print row

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接