CSV转JSON转换器（按相同键值分组）

Question

CSV转JSON转换器（按相同键值分组）

4

我正在尝试将csv格式转换为JSON，我通过谷歌搜索并未找到修改方法来得到所需的结果。

这是我的Python代码：

import csv
import json

def csv_to_json(csvFilePath, jsonFilePath):
    jsonArray = []

    #reading csv (encoding is important)
    with open(csvFilePath, encoding='utf-8') as csvf:
        #csv library function
        csvReader = csv.DictReader(csvf)

        #convert each csv row into python dictionary
        for column in csvReader:
            #add this python dictionary to json array
            jsonArray.append(column)

    #convertion
    with open(jsonFilePath, 'w', encoding='utf-8') as jsonf:
        jsonString = json.dumps(jsonArray, indent=4)
        jsonf.write(jsonString)

csvFilePath='example.csv'
jsonFilePath='output.json'
csv_to_json(csvFilePath, jsonFilePath)

这是我的 CSV 文件格式：

我实际的 JSON 输出：

[
    {
        "Area": "IT",
        "Employee": "Carl",        
    },
    {
        "Area": "IT",
        "Employee": "Walter",      
    },
    {
        "Area": "Financial Resources",
        "Employee": "Jennifer",      
    }
]

我期望的JSON输出：

[
    {
        "Area": "IT",
        "Employee": ["Carl","Walter"],
    },
    {
      "Area": "Financial Resources",
      "Employee": ["Jennifer"],
    }
    
]

提前感谢您！

- Mauricio Reyes

2个回答

0

convtools 库提供了许多 reduce 操作来处理聚合（我必须承认，我是作者）：

from convtools import conversion as c
from convtools.contrib.tables import Table

# generates an ad-hoc function, which aggregates data
converter = (
    c.group_by(c.item("Area"))
    .aggregate(
        {
            "area": c.item("Area"),
            "employees": c.ReduceFuncs.Array(c.item("Employee")),
        }
    )
    .gen_converter()
)

result = converter(
    Table.from_csv("tmp/in.csv", header=True).into_iter_rows(dict)
)
assert result == [
    {"area": "IT", "employees": ["Carl", "Walter"]},
    {"area": "Financial Resources", "employees": ["Jennifer"]},
]

- westandskif

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- Alexander · Accepted Answer

类似这样的东西应该可以运行。

def csv_to_json(csvFilePath, jsonFilePath):
    areas = {}
    with open(csvFilePath, encoding='utf-8') as csvf:
        csvReader = csv.DictReader(csvf)
        for column in csvReader:
            area, employee = column["Area"], column["Employee"] # split values 
            if area in areas:  # add all keys and values to one dictionary
                areas[area].append(employee)
            else:
                areas[area] = [employee]
    # convert dictionary to desired output format.
    jsonArray = [{"Area": k, "Employee": v} for k,v in areas.items()]
    with open(jsonFilePath, 'w', encoding='utf-8') as jsonf:
        jsonString = json.dumps(jsonArray, indent=4)
        jsonf.write(jsonString)