我正在生成一个大的数据框(以CSV格式保存时为1.5 GB),需要将其存储在Excel文件的工作表中,同时还有一个较小的数据框需要保存在另一个工作表中。
print('Reading temporaty files for variable {}:'.format(Var))
print(' Reading stations')
s=pd.read_csv(StatFile,sep=':',dtype={'ID': 'str'},encoding='utf-8')
print(' Reading data')
d=pd.read_csv(DataFile,sep=':',dtype='str',encoding='utf-8').transpose()
d.columns = d.iloc[0]
d=d[1:].astype('float')
d.reindex_axis(sorted(d.columns), axis=1)
print('Writing out Excel file for variable {}'.format(Var))
writer = pd.ExcelWriter(Path + Var + '.xlsx', engine='xlsxwriter')
d.to_excel(writer, sheet_name='Data')
OutStatCol=['ID','Name','Longitude','Latitude','GRS','OriginalVariable','VariableUnits','URL','JsonNode']
s.to_excel(writer, columns=OutStatCol, index=False, sheet_name='Stations')
writer.save()
我的代码能够正常处理小的数据框,但是对于大的数据框,则会出现以下错误:
Traceback (most recent call last):
File "./Test2.py", line 29, in <module>
writer.save()
File "/home/user/miniconda2/lib/python2.7/site-packages/pandas/io/excel.py", line 1413, in save
return self.book.close()
File "/home/user/miniconda2/lib/python2.7/site-packages/xlsxwriter/workbook.py", line 297, in close
self._store_workbook()
File "/home/user/miniconda2/lib/python2.7/site-packages/xlsxwriter/workbook.py", line 624, in _store_workbook
xlsx_file.write(os_filename, xml_filename)
File "/home/user/miniconda2/lib/python2.7/zipfile.py", line 1148, in write
self._writecheck(zinfo)
File "/home/user/miniconda2/lib/python2.7/zipfile.py", line 1114, in _writecheck
" would require ZIP64 extensions")
zipfile.LargeZipFile: Filesize would require ZIP64 extensions
我能否在ExcelWriter声明或to_excel()方法中指定类似allowZip64=True
的内容?
谢谢!
pd.ExcelWriter(...)
中使用一个关键字。 - AaronallowZip64=True
和.use_zip64()
两种方式,但都返回了错误。 - user6357781