我正在从Excel表格中读取以下列:
CM FOTO 1 FOTO 2 FOTO 3 FOTO 4
some text some 744905_L050_01.jpg NaN 744905_B001_01.jpg NaN
some text some NaN NaN NaN NaN
some text some 621059_W034_01.jpg 621059_W034_02.jpg 621059_W034_03.jpg NaN
some text some NaN NaN NaN NaN
然后我正在编写一个新的csv文件,并创建以下列:
df_["DESCRIPTION"] = df["CM "]
df_["IMAGES_URL"] = df[["FOTO 1 ", "FOTO 2 ", "FOTO 3 ", "FOTO 4 "]].apply(lambda x: x.str.cat(sep='|'), axis=1)
df_["WAREHOUSE"] = "D"
最后,我将它存储在一个CSV文件中,并使用分号作为分隔符:
df_.to_csv('path/my_file.csv', encoding = 'utf-8', index=False, sep=';')
输出如下:
DESCRIPTION;IMAGES_URL;WAREHOUSE
some text some;744905_L050_01.jpg|744905_B001_01.jpg;D
some text some;;D
some text some;621059_W034_01.jpg|621059_W034_02.jpg|621059_W034_03.jpg;D
some text some;;D
然而,如果所有四个Foto列都有
NaN
值,则在csv文件中会写入双倍的;;
。我该如何删除双分号并将其替换为一个分号?