我尝试了不同的方法来压缩文件夹。我的理解是,Python内置模块总是比subprocess.call("Linux命令")更快。但是我刚刚做了一些演示。tarfile模块比subprocess.call("tar")慢。有人能给我解释一下吗?
#!/usr/bin/python
import os
import time
import tarfile
import subprocess
tStart1 = time.time()
TestFolder = ["Jack", "Robin"]
for folder in TestFolder:
name = "/mnt/ShareDrive/Share/ExistingUsers/"+folder
path = "/mnt/TEST2/"
tar = tarfile.open(path+folder+".tar.gz", "w:gz")
tar.add(name)
tar.close()
tEnd1 = time.time()
time.sleep(2)
tStart2 = time.time()
for folder in TestFolder:
path = "/mnt/TEST1/"
subprocess.call(["tar", "zcvf", path+folder+".tar.gz", "-P", "/mnt/ShareDrive/Share/ExistingUsers/"+folder])
tEnd2 = time.time()
print "The module cost %f sec" % (tEnd1 - tStart1)
print "The subprocess cost %f sec" % (tEnd2 - tStart2)
tarfile模块花费63秒。子进程只花费32秒。
两个文件夹的总大小为433 MB。