我有一个 csv 文件1,它的格式如下:
FLAGSTAFF AZ 50244.67 5.02 KA1_Podium_Garage_S
FLAGSTAFF AZ 33752.13 3.38 KA1_Podium_Garage_S
FLAGSTAFF AZ 11965.5 1.2 KA1_Podium_Garage_S
FLAGSTAFF AZ 3966.48 0.4 KA1_Podium_Garage_S
SEATTLE WA 12646.9 1.26 KA1_Podium_Garage_S
SEATTLE WA 225053.92 22.51 KA1_Podium_Garage_S
SEATTLE WA 23974.3 2.4 KA1_Podium_Garage_S
SEATTLE WA 7036.4 0.7 KA1_Podium_Garage_S
SEATTLE WA 3021.93 0.3 KA1_Podium_Garage_S
我有一个名为 2 的 CSV 文件,格式如下:
Alabama AL 1
Alaska AK 2
Arizona AZ 4
Arkansas AR 5
California CA 6
Colorado CO 8
Connecticut CT 9
SEATTLE WA 53
现在我需要通过比较第二列,在csv文件1中添加来自csv文件2的第三列值。
例如,它应该像这样,
AZ代码是4 WA代码是53 无论在我的csv文件1中出现AZ、WA,代码都应该被附加到一列中。
我的输出应该看起来像这样:
FLAGSTAFF AZ 50244.67 5.02 KA1_Podium_Garage_S 4
FLAGSTAFF AZ 33752.13 3.38 KA1_Podium_Garage_S 4
FLAGSTAFF AZ 11965.5 1.2 KA1_Podium_Garage_S 4
FLAGSTAFF AZ 3966.48 0.4 KA1_Podium_Garage_S 4
SEATTLE WA 12646.9 1.26 KA1_Podium_Garage_S 53
SEATTLE WA 225053.92 22.51 KA1_Podium_Garage_S 53
SEATTLE WA 23974.3 2.4 KA1_Podium_Garage_S 53
SEATTLE WA 7036.4 0.7 KA1_Podium_Garage_S 53
SEATTLE WA 3021.93 0.3 KA1_Podium_Garage_S 53
这是我尝试的代码:
with open("/home/sumit/Desktop/CSV_FILE1.csv", "r") as f:
first = {rows[1]: rows[0:] for rows in list(csv.reader(f))}
# compare second csv and append Code
with open("CSVFILE2.csv", "r") as f1:
for row in csv.reader(f1):
if row[1] in first:
first[row[1]].append(row[2])
# convert dict back to list
merged = [(k,) + tuple(v) for k, v in first.items()]
# write list to output csv
with open("output.csv", "w") as f1:
csv.writer(f1).writerows(merged)
我得到的输出类似于:
AZ FLAGSTAFF AZ 44230.4 4.42 KA1_Podium_Garage_S 4
WA SEATTLE WA 45329.3 4.53 KA1_Podium_Garage_S 53
dict
只能为每个键存储一个值;如果您尝试使用不同的值多次添加相同的键,则只会保留最新的值。 - R Nar